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PREFACE 




This is a textbook on gravitation physics (Einstein’s “general relativity” or “geo- 
metrodynamics”). It supplies two tracks through the subject. The first track is focused 
on the key physical ideas. It assumes, as mathematical prerequisite, only vector 
analysis and simple partial-differential equations. It is suitable for a one-semester 
course at the junior or senior level or in graduate school; and it constitutes—in the 
opinion of the authors—the indispensable core of gravitation theory that every 
advanced student of physics should learn. The Track-1 material is contained in those 
pages of the book that have a 1 outlined in gray in the upper outside corner, by 
which the eye of the reader can quickly pick out the Track-1 sections. In the con¬ 
tents, the same purpose is served by a gray bar beside the section, box, or figure 
number. 

The rest of the text builds up Track 1 into Track 2. Readers and teachers are 
invited to select, as enrichment material, those portions of Track 2 that interest them 
most. With a few exceptions, any Track-2 chapter can be understood by readers 
who have studied only the earlier Track-1 material. The exceptions are spelled out 
explicitly in “dependency statements” located at the beginning of each Track-2 
chapter, or at each transition within a chapter from Track 1 to Track 2. 

The entire book (all of Track 1 plus all of Track 2) is designed for a rigorous, 
full-year course at the graduate level, though many teachers of a full-year course 
I may prefer a more leisurely pace that omits some of the Track-2 material. The full 
book is intended to give a competence in gravitation physics comparable to that 
\ which the average Ph.D. has in electromagnetism. When the student achieves this 
| competence, he knows the laws of physics in flat spacetime (Chapters 1-7). He can 

; predict orders of magnitude. He can also calculate using the principal tools of modern 

differential geometry (Chapters 8-15), and he can predict at all relevant levels of 
precision. He understands Einstein’s geometric framework for physics (Chapters 
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16-22). He knows the applications of greatest present-day interest: pulsars and 
neutron stars (Chapters 23-26); cosmology (Chapters 27-30); the Schwarzschild 
geometry and gravitational collapse (Chapters 31-34); and gravitational waves 
(Chapters 35-37). He has probed the experimental tests of Einstein’s theory (Chap¬ 
ters 38-40). He will be able to read the modern mathematical literature on differential 
geometry, and also the latest papers in the physics and astrophysics journals about 
geometrodynamics and its applications. If he wishes to go beyond the field equations, 
the four major applications, and the tests, he will find at the end of the book 
(Chapters 41-44) a brief survey of several advanced topics in general relativity. 
Among the topics touched on here, superspace and quantum geometrodynamics 
receive special attention. These chapters identify some of the outstanding physical 
issues and lines of investigation being pursued today. 

Whether the department is physics or astrophysics or mathematics, more students 
than ever ask for more about general relativity than mere conversation. They want 
to hear its principal theses clearly stated. They want to know how to “work the 
handles of its information pump” themselves. More universities than ever respond 
with a serious course in Einstein’s standard 1915 geometrodynamics. What a contrast 
to Maxwells standard 1864 electrodynamics! In 1897, when Einstein was a student 
at Zurich, this subject was not on the instructional calendar of even half the 
universities of Europe. 1 “We waited in vain for an exposition of Maxwell’s theory,” 
says one of Einstein’s classmates. “Above all it was Einstein who was disappointed,” 2 
for he rated electrodynamics as “the most fascinating subject at the time” 3 —as many 
students rate Einstein’s theory today! 

Maxwell’s theory recalls Einstein’s theory in the time it took to win acceptance. 
Even as.late as 1904 a book could appear by so great an investigator as William 
Thomson, Lord Kelvin, with the words, “The so-called ‘electromagnetic theory of 
light’ has not helped us hitherto ... it seems to me that it is rather a backward 
step ... the one thing about it that seems intelligible to me, I do not think is 
admissible . . . that there should be an electric displacement perpendicular to the 
line of propagation.” 4 Did the pioneer of the Atlantic cable in the end contribute 
so richly to Maxwell electrodynamics—from units, and principles of measurement, 
to the theory of waves guided by wires—because of his own early difficulties with 
the subject? Then there is hope for many who study Einstein’s geometrodynamics 
today! By the 1920’s the weight of developments, from Kelvin’s cable to Marconi’s 
wireless, from the atom of Rutherford and Bohr to the new technology of high- 
frequency circuits, had produced general conviction that Maxwell was right. Doubt 
dwindled. Confidence led to applications, and applications led to confidence. 

Many were slow to take up general relativity in the beginning because it seemed 
to be poor in applications. Einstein’s theory attracts the interest of many today 
because it is rich in applications. No longer is attention confined to three famous 
but meager tests: the gravitational red shift, the bending of light by the sun, and 


’G. Holton (1965). 3 A. Einstein (1949a). 

2 L. Kolbros (1956). 4 W. Thomson (1904). 

Citations for references will be found in the bibliography. 
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the precession of the perihelion of Mercury around the sun. The combination of 
radar ranging and general relativity is, step by step, transforming the solar-system 
celestial mechanics of an older generation to a new subject, with a new level of 
precision, new kinds of effects, and a new outlook. Pulsars, discovered in 1968, find 
no acceptable explanation except as the neutron stars predicted in 1934, objects with 
a central density so high (^10 14 g/cm 3 ) that the Einstein predictions of mass differ 
from the Newtonian predictions by 10 to 100 per cent. About further density increase 
and a final continued gravitational collapse, Newtonian theory is silent. In contrast, 
Einstein’s standard 1915 geometrodynamics predicted in 1939 the properties of a 
completely collapsed object, a “frozen star” or “black hole.” By 1966 detailed digital 
calculations were available describing the formation of such an object in the collapse 
of a star with a white-dwarf core. Today hope to discover the first black hole is 
not least among the forces propelling more than one research: How does rotation 
influence the properties of a black hole? What kind of pulse of gravitational radiation 
comes off when such an object is formed? What spectrum of x-rays emerges when 
gas from a companion star piles up on its way into a black hole? 5 All such investi¬ 
gations and more base themselves on Schwarzschild’s standard 1916 static and 
spherically symmetric solution of Einstein’s field equations, first really understood 
in the modern sense in 1960, and in 1963 generalized to a black hole endowed with 
angular momentum. 

Beyond solar-system tests and applications of relativity, beyond pulsars, neutron 
stars, and black holes, beyond geometrostatics (compare electrostatics!) and station¬ 
ary geometries (compare the magnetic field set up by a steady current!) lies geo¬ 
metrodynamics in the full sense of the word (compare electrodynamics!). Nowhere 
does Einstein’s great conception stand out more clearly than here, that the geometry 
of space is a new physical entity, with degrees of freedom and a dynamics of its 
own. Deformations in the geometry of space, he predicted in 1918, can transport 
energy from place to place. Today, thanks to the initiative of Joseph Weber, detectors 
of such gravitational radiation have been constructed and exploited to give upper 
limits to the flux of energy streaming past the earth at selected frequencies. Never 
before has one realized from how many kinds of processes significant gravitational 
radiation can be anticipated. Never before has there been more interest in picking 
up this new kind of signal and using it to diagnose faraway events. Never before 
has there been such a drive in more than one laboratory to raise instrumental 
sensitivity until gravitational radiation becomes a workaday new window on the 
universe. 

The expansion of the universe is the greatest of all tests of Einstein’s geometro¬ 
dynamics, and cosmology the greatest of all applications. Making a prediction too 
fantastic for its author to credit, the theory forecast the expansion years before it 
was observed (1929). Violating the short time-scale that Hubble gave for the expan¬ 
sion, and in the face of “theories” (“steady state”; “continuous creation”) manufac¬ 
tured to welcome and utilize this short time-scale, standard general relativity 
resolutely persisted in the prediction of a long time-scale, decades before the astro- 


r> As of April 1973, there are significant indications that Cygnus X-l and other compact x-ray sources 
may be black holes. 
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physical discovery (1952) that the Hubble scale of distances and times was wrong, 
and had to be stretched by a factor of more than five. Disagreeing by a factor of 
the order of thirty with the average density of mass-energy in the universe deduced 
from astrophysical evidence as recently as 1958, Einstein’s theory now as in the past 
argues for the higher density, proclaims “the mystery of the missing matter,” and 
encourages astrophysics in a continuing search that year by year turns up new 
indications of matter in the space between the galaxies. General relativity forecast 
the primordial cosmic fireball radiation, and even an approximate value for its 
present temperature, seventeen years before the radiation was discovered. This 
radiation brings information about the universe when it had a thousand times smaller 
linear dimensions, and a billion times smaller volume, than it does today. Quasistellar 
objects, discovered in 1963, supply more detailed information from a more recent 
era, when the universe had a quarter to half its present linear dimensions. Telling 
about a stage in the evolution of galaxies and the universe reachable in no other 
way, these objects are more than beacons to light up the far away and long ago. 
They put out energy at a rate unparalleled anywhere else in the universe. They eject 
matter with a surprising directivity. They show a puzzling variation with time, 
different between the microwave and the visible part of the spectrum. Quasistellar 
objects on a great scale, and galactic nuclei nearer at hand on a smaller scale, voice 
a challenge to general relativity: help clear up these mysteries! 

If its wealth of applications attracts many young astrophysicists to the study of 
Einstein’s geometrodynamics, the same attraction draws those in the world of physics 
who are concerned with physical cosmology, experimental general relativity, gravi¬ 
tational radiation, and the properties of objects made out of superdense matter. Of 
quite another motive for study of the subject, to contemplate Einstein’s inspiring 
vision of geometry as the machinery of physics, we shall say nothing here because 
it speaks out, we hope, in every chapter of this book. 

Why a new book? The new applications of general relativity, with their extraor¬ 
dinary physical interest, outdate excellent textbooks of an earlier era, among them 
even that great treatise on the subject written by Wolfgang Pauli at the age of 
twenty-one. In addition, differential geometry has undergone a transformation of 
outlook that isolates the student who is confined in his training to the traditional 
tensor calculus of the earlier texts. For him it is difficult or impossible either to read 
the writings of his up-to-date mathematical colleague or to explain the mathematical 
content of his physical problem to that friendly source of help. We have not seen 
any way to meet our responsibilities to our students at our three institutions except 
by a new exposition, aimed at establishing a solid competence in the subject, con¬ 
temporary in its mathematics, oriented to the physical and astrophysical applications 
of greatest present-day interest, and animated by belief in the beauty and simplicity 
of nature. 


High Island 
South Bristol, Maine 
September 4, 1972 


Charles W Misner 

Kip S. Thorne 

John Archibald Wheeler 
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SPACETIME PHYSICS 

Wherein the reader is led, once quickly (§ 1. 1), 
then again more slowly, down the highways and 
a few byways of Einstein's geometrodynamics — 
without benefit of a good mathematical compass. 




CHAPTER 


GEOMETRODYNAMICS 


IN BRIEF 


§1.1. THE PARABLE OF THE APPLE 

One day in the year 1666 Newton had gone to the country, 
and seeing the fail of an apple, as his niece told me, let himself 
be led into a deep meditation on the cause which thus 
draws every object along a line whose extension would pass 
almost through the center of the Earth. 

VOLTAIRE (1738) 

Once upon a time a student lay in a garden under an apple tree reflecting on the 
difference between Einstein’s and Newton’s views about gravity. He was startled 
by the fall of an apple nearby. As he looked at the apple, he noticed ants beginning 
to run along its surface (Figure 1.1). His curiosity aroused, he thought to investigate 
the principles of navigation followed by an ant. With his magnifying glass, he noted 
one track carefully, and, taking his knife, made a cut in the apple skin one mm 
above the track and another cut one mm below it. He peeled off the resulting little 
highway of skin and laid it out on the face of his book. The track ran as straight 
as a laser beam along this highway. No more economical path could the ant have 
found to cover the ten cm from start to end of that strip of skin. Any zigs and 
zags or even any smooth bend in the path on its way along the apple peel from 
starting point to end point would have increased its length. 

“What a beautiful geodesic.” the student commented. 

His eye fell on two ants starting off from a common point P in slightly different 
directions. Their routes happened to carry them through the region of the dimple 
at the top of the apple, one on each side of it. Each ant conscientiously pursued 
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Figure 1.1. 

The Riemannian geometry of the spacetime of general relativity is here symbolized by the two-dimen¬ 
sional geometry of the surface of an apple. The geodesic tracks followed by the ants on the apple’s 
surface symbolize the world line followed through spacetime by a free particle. In any sufficiently localized 
region of spacetime, the geometry can be idealized as flat, as symbolized on the apple’s two-dimensional 
surface by the straight-line course of the tracks viewed in the magnifying glass (“local Lorentz character” 
of geometry of spacetime). In a region of greater extension, the curvature of the manifold (four-dimen¬ 
sional spacetime in the case of the real physical world; curved two-dimensional geometry in the case 
of the apple) makes itself felt. Two tracks (I and ftf, originally diverging from a common point later 
approach, cross, and go off in very different directions. In Newtonian theory this effect is ascribed to 
gravitation acting at a distance from a center of attraction, symbolized here by the stem of the apple. 
According to Einstein a particle gets its moving orders locally, from the geometry of spacetime right 
where it is. Its instructions are simple: to follow the straightest possible track (geodesic). Physics is as 
simple as it could be locally. Only because spacetime is curved in the large do the tracks cross. Geome- 
trodynamics, in brief, is a double story of the effect of geometry on matter (causing originally divergent 
geodesics to cross) and the effect of matter on geometry (bending of spacetime initiated by concentration 
of mass, symbolized by effect of stem on nearby surface of apple). 


Einstein's local view of 
physics contrasted with 
Newton's "action at a 
distance" 


Physics is simple only when 
analyzed locally 


his geodesic. Each went as straight on his strip of appleskin as he possibly could. 
Yet because of the curvature of the dimple itself, the two tracks not only crossed 
but emerged in very different directions. 

“What happier illustration of Einstein’s geometric theory of gravity could one 
possibly ask?” murmured the student. “The ants move as if they were attracted 
by the apple stem. One might have believed in a Newtonian force at a distance. 
Yet from nowhere does an ant get his moving orders except from the local geometry 
along his track. This is surely Einstein’s concept that all physics takes place by 
‘local action.’ What a difference from Newton’s ‘action at a distance’ view of physics! 
Now I understand better what this book means.” 

And so saying, he opened his book and read, “Don’t try to describe motion 
relative to faraway objects. Physics is simple only when analyzed locally . And locally 



§1.2. SPACETIME WITH AND WITHOUT COORDINATES 

the world line that a satellite follows [in spacetime, around the Earth] is already 
as straight as any world line can be. Forget all this talk about ‘deflection’ and ‘force 
of gravitation.’ I’m inside a spaceship. Or I’m floating outside and near it. Do I 
feel any ‘force of gravitation’? Not at all. Does the spaceship ‘feel’ such a force? 

No. Then why talk about it? Recognize that the spaceship and I traverse a region 
of spacetime free of all force. Acknowledge that the motion through that region 
is already ideally straight” 

The dinner bell was ringing, but still the student sat, musing to himself. “Let me 
see if I can summarize Einstein’s geometric theory of gravity in three ideas: (1) 
locally, geodesics appear straight; (2) over more extended regions of space and time, 

geodesics originally receding from each other begin to approach at a rate governed Space tells matter how to 
by the curvature of spacetime, and this effect of geometry on matter is what we move 
mean today by that old word ‘gravitation’; (3) matter in turn warps geometry. The 

dimple arises in the apple because the stem is there. I think I see how to put the Matter tells space how to 

whole story even more briefly: Space acts on matter, telling it how to move. In turn, curve 

matter reacts back on space, telling it how to curve. In other words, matter here,” 

he said, rising and picking up the apple by its stem, “curves space here. To produce 

a curvature in space here is to force a curvature in space there,” he went on, as 

he watched a lingering ant busily following its geodesic a finger’s breadth away from 

the apple’s stem. “Thus matter here influences matter there. That is Einstein’s 

explanation for ‘gravitation.’” 

Then the dinner bell was quiet, and he was gone, with book, magnifying glass—and 
apple. 

§1.2. SPACETIME WITH AND WITHOUT COORDINATES 

Now it came to me: . . . the independence of the 
gravitational acceleration from the nature of the falling 
substance , may be expressed as follows: In a 
gravitational field (of small spatial extension) things 
behave as they do in a space free of gravitation. . . . This 
happened in 1908. Why were another seven years required 
for the construction of the general theory of relativity? 

The mam reason lies in the fact that it is not so easy to 
free oneself from the idea that coordinates must have an 

immediate metrical meaning. 

ALBERT EINSTEIN [in Schilpp (1949), pp. 65-67.] 

Nothing is more distressing on first contact with the idea of “curved spacetime” than Problem: how to measure in 

the fear that every simple means of measurement has lost its power in this unfamiliar curved spacetime 

context. One thinks of oneself as confronted with the task of measuring the shape 

ot a gigantic and fantastically sculptured iceberg as one stands with a meter stick 

in a tossing rowboat on the surface of a heaving ocean. Were it the rowboat itself 

whose shape were to be measured, the procedure would be simple enough. One 

would draw it up on shore, turn it upside down, and drive tacks in lightly at stratesic 

points here and there on the surface. The measurement of distances from tack to 
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Figure 1.2. 

The crossing of straws in a barn full of hay is a symbol for the world lines that fill up spacetime. By 
their crossings and bends, these world lines mark events with a uniqueness beyond all need of coordinate 
systems or coordinates. Typical events symbolized in the diagram, from left to right (black dots), are: 
absorption of a photon; reemission of a photon; collision between a particle and a particle; collision 
between a photon and a particle; another collision between a photon and a particle; explosion of a 
firecracker; and collision of a particle from outside with one of the fragments of that firecracker. 


tack would record and reveal the shape of the surface. The precision could be made 
arbitrarily great by making the number of tacks arbitrarily large. It takes more daring 
to think of driving several score pitons into the towering iceberg. But with all the 
daring in the world, how is one to drive a nail into spacetime to mark a point? 
Happily, nature provides its own way to localize a point in spacetime, as Einstein 
Resolution: characterize was the first to emphasize. Characterize the point by what happens there! Give a 

events by what happens point in spacetime the name “event.” Where the event lies is defined as clearly and 

there sharply as where two straws cross each other in a barn full of hay (Figure 1.2). To 

say that the event marks a collision of such and such a photon with such and such 
a particle is identification enough. The world lines of that photon and that particle 
are rooted in the past and stretch out into the future. They have a rich texture of 
connections with nearby world lines. These nearby world lines in turn are linked 
in a hundred ways with world lines more remote. How then does one tell the location 
of an event? Tell first what world lines participate in the event. Next follow each 
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Figure 1.3. 

Above: Assigning “telephone numbers” to events by way of a system of coordinates. To say that the 

coordinate system is “smooth is to say that events which are almost in the same place have almost 

the same coordinates. Below: Putting the same set of events into equally good order by wav of a different 
system of coordinates. Picked out specially here are two neighboring events: an event named with 
coordinates (.vVv 1 ) = (77.2.22.6) and (.v n . ,v T ) = (18.5. 51.4^ and anevent named "v"' with coordinates 
(a .a 1 ) = (79.9. 20.1) and (.v**. .v 1 ) = (18.4. 47.1). Events and i? are connected bv the separation "vector” 
£• 1 Precise definition of a vector in a curved spacetime demands going to the mathematical limit in 
which the two points have an indefinitely small separation [.Y-fold reduction of the separation — c 1 ]. 
and. in the resultant locally flat space, multiplying the separation up again by the factor A’[lim V x: 
“tangent space’*: "langent vector”]. Forego here that proper wav of stating matters, and forego complete 
accuracy: hence the quote around the word “vector”.) In each coordinate system the separation vector 
{ is characterized by "components” (differences in coordinate values between 'i* and c’)* 

(f’-f 1 ) = (79.9 - 77.2. 20.1 - 22.6) = (2.7. - 2.5). 

(£’. £ t ) = (18.4 — 18.5. 47.1 - 51.4) — (-0.1. -4.3). 

See Box 1.1 for further discussion of events, coordinates, and vectors. 
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The name of an event can 
even be arbitrary 


Coordinates provide a 
convenient naming system 


of these world lines. Name the additional events that they encounter. These events 
pick out further world lines. Eventually the whole barn of hay is catalogued. Each 
event is named. One can find one’s way as surely to a given intersection as the city 
dweller can pick his path to the meeting of St. James Street and Piccadilly. No 
numbers. No coordinate system. No coordinates. 

That most streets in Japan have no names, and most houses no numbers, illustrates 
one's ability to do without coordinates. One can abandon the names of two world 
lines as a means to identify the event where they intersect. Just as one could name 
a Japanese house after its senior occupant, so one can and often does attach arbitrary 
names to specific events in spacetime, as in Box 1.1. 

Coordinates, however, are convenient. How else from the great thick catalog of 
events, randomly listed, can one easily discover that along a certain world line one 
will first encounter event Trinity, then Baker, then Mike, then Argus—but not the 
same events in some permuted order? 

To order events, introduce coordinates! (See Figure 1.3.) Coordinates are four 
indexed numbers per event in spacetime; on a sheet of paper, only two. Trinity 
acquires coordinates 


(JC°, JC 1 , JC 2 , JC 3 ) = (77, 23,64, 11). 


Coordinates generally do not 
measure length 


Several coordinate systems 
can be used at once 


Vectors 


In christening events with coordinates, one demands smoothness but foregoes every 
thought of mensuration. The four numbers for an event are nothing but an elaborate 
kind of telephone number. Compare their “telephone” numbers to discover whether 
two events are neighbors. But do not expect to learn how many meters separate 
them from the difference in their telephone numbers! 

Nothing prevents a subscriber from being served by competing telephone systems, 
nor an event from being catalogued by alternative coordinate systems (Figure 1.3). 
Box 1.1 illustrates the relationships between one coordinate system and another, as 
well as the notation used to denote coordinates and their transformations. 

Choose two events, known to be neighbors by the nearness of their coordinate 
values in a smooth coordinate system. Draw a little arrow from one event to the 
other. Such an arrow is called a vector. (It is a well-defined concept in flat spacetime, 
or in curved spacetime in the limit of vanishingly small length; for finite lengths 
in curved spacetime, it must be refined and made precise, under the new name 
“tangent vector,” on which see Chapter 9.) This vector, like events, can be given 
a name. But whether named “John” or “Charles” or “Kip,” it is a unique, well- 
defined geometrical object. The name is a convenience, but the vector exists even 
without it. 

Just as a quadruple of coordinates 


(a: 0 , jc 1 , x 2 , x 3 ) = (77, 23, 64, 11) 

is a particularly useful name for the event “Trinity” (it can be used to identify what 
other events are nearby), so a quadruple of “components” 


(«°,€ 1 ,« 2 ,€ 3 ) = 0-2, -0.9, 0,2.1) 




Box 1.1 MATHEMATICAL NOTATION FOR EVENTS, COORDINATES, AND VECTORS 


Events are denoted by capital script, one-letter Latin names such as 
Sometimes subscripts are used: 

Coordinates of an event g? are denoted by 
or by 

or more abstractly by 

where it is understood that Greek indices can take on any value 0, 1, 
2, or 3. 

Time coordinate (when one of the four is picked to play this role) 

Space coordinates are 

and are sometimes denoted by 

It is to be understood that Latin indices take on values 1, 2 S or 3. 

Shorthand notation: One soon tires of writing explicitly the functional depen¬ 
dence of the coordinates, x^(yp)\ so one adopts the shorthand notation 
for the coordinates of the event and 

for the space coordinates. One even begins to think of as representing 
the event yP itself, but must remind oneself that the values of x°, x 1 , x*, 
x 3 depend not only on the choice of yP but also on the arbitrary choice 
of coordinates! 

Other coordinates for the same event yP may be denoted 


example: In Figure 1.3 (a 0 , a 1 ) = (77.2. 22.6) and (a (1 .a t ) = (18.5.51.4) 
refer to the same event. The bars, primes, and hats distinguish one 
coordinate system from another; by putting them on the indices rather 
than on the a’s, we simplify later notation. 

Transformation from one coordinate system to another is achieved by the four 
functions 


which are denoted more succinctly 

Separation vector* (little arrow) reaching from one event i? to neighboring event 
yP can be denoted abstractly by 

It can also be characterized by the coordinate-value dilferencesf between 
yP and £ (called "components” of the vector) 


Transformation of components of a vector from one coordinate system to another 
is achieved by partial derivatives of transformation equations 

since = x Tl {rf) - a“(l 7 ) = (?.v r * Ta^aV) - A'VlJ.t 

Einstein summation convention is used here: 

any index that is repeated in a product is automatically summed on 


2 , (! , frJ. 

^ 6 * 

t(‘P). x(yP). }'(&), z(yp), 
x 0 ( L f), a 1 ^/ 7 ), ,V 2 (^), 
a 3 (^), 

x^(yP) or x a (iP), 


x l (iP ), a 2 (‘?), a 3 (^) 
x*(yP) or x k (yP) or .. 


A' a (^) or just A a , 
A‘ a {yP) or just x a ‘, 
x a {yP) or just x a . 


a°(a°, a 1 , a 2 , a 3 ), 

A*(A°. A 1 . A 2 . A 3 ), 
A-(A°, A 1 , A 2 . A 3 ), 
x\\ A \ A 1 , A 2 . A 3 ). 

x%x- 8 ). 


u or v or <f. or yP — 
Z"=x f '(yP) - X«(i*), 

f* = X n (yP ) — A rt (-). 


fhi> definition of a vector is valid onlv in flat spacetime. The refined definition (’'tangent vector") in curved spacetime 
is not spelled out here (see Chapter 9). hut flat-geometry ideas applv with good approximation even in a curved »eometr\ 
when the two points are sufticientlv close. 

t These formulas are precisely accurate only « hen the region of spacetime under consideration is flat and when in addition 
e coordinates are Lorentzian. Otherwise they are approximate—though they become arbitrarily good when the separation 
between points and the length of the vector become arhitrarifv small. 
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Coordinate singularities 
normally unavoidable 


Continuity of spacetime 


The mathematics of 
manifolds applied to the 
physics of spacetime 


Dimensionality of spacetime 


is a convenient name for the vector “John” that reaches from 

(jc 0 ,* 1 ,* 2 ,* 3 ) = (77,23,64, 11) 


to 


(x°,x\x 2 , x 3 ) = (78.2,22.1,64.0, 13.1). 

How to work with the components of a vector is explored in Box LI. 

There are many ways in which a coordinate system can be imperfect. Figure 1.4 
illustrates a coordinate singularity. For another example of a coordinate singularity, 
run the eye over the surface of a globe to the North Pole. Note the many meridians 
that meet there (“collapse of cells of egg crates to zero content”). Can’t one do better? 
Find a single coordinate system that will cover the globe without singularity? A 
theorem says no. Two is the minimum number of “coordinate patches” required 
to cover the two-sphere without singularity (Figure 1.5). This circumstance empha¬ 
sizes anew that points and events are primary, whereas coordinates are a mere 
bookkeeping device. 

Figures 1.2 and 1.3 show only a few world lines and events. A more detailed 
diagram would show a maze of world lines and of light rays and the intersections 
between them. From such a picture, one can in imagination step to the idealized 
limit: an infinitely dense collection of light rays and of world lines of infinitesimal 
test particles. With this idealized physical limit, the mathematical concept of a 
continuous four-dimensional “manifold” (four-dimensional space with certain 
smoothness properties) has a one-to-one correspondence; and in this limit continu¬ 
ous, differentiable (i.e., smooth) coordinate systems operate. The mathematics then 
supplies a tool to reason about the physics. 

A simple countdown reveals the dimensionality of the manifold. Take a point 9? 
in an ^-dimensional manifold. Its neighborhood is an n-dimensional ball (i.e., the 
interior of a sphere whose surface has n — 1 dimensions). Choose this ball so that 
its boundary is a smooth manifold. The dimensionality of this manifold is (n — 1). 
In this (n — l)-dimensional manifold, pick a point 2. Its neighborhood is an 
(n — l)-dimensional ball. Choose this ball so that..., and so on. Eventually one 
comes by this construction to a manifold that is two-dimensional but is not yet known 
to be two-dimensional (two-sphere). In this two-dimensional manifold, pick a point 
Its neighborhood is a two-dimensional ball (“disc”). Choose this disc so that 
its boundary is a smooth manifold (circle). In this manifold, pick a point 9%. Its 
neighborhood is a one-dimensional ball, but is not yet known to be one-dimensional 
(“line segment”). The boundaries of this object are two points. This circumstance 
tells that the intervening manifold is one-dimensional; therefore the previous mani¬ 
fold was two-dimensional; and so on. The dimensionality of the original manifold 
is equal to the number of points employed in the construction. For spacetime, the 
dimensionality is 4. 

This kind of mathematical reasoning about dimensionality makes good sense at 
the everyday scale of distances, at atomic distances (10~ 8 cm), at nuclear dimensions 
(10~ 13 cm), and even at lengths smaller by several powers of ten, if one judges by 
the concord between prediction and observation in quantum electrodynamics at high 



How a mere coordinate singularity arises. Above: A coordinate system becomes singular when the “cells 
in the egg crate” are squashed to zero volume. Below: An example showing such a singularity in the 
Schwarzschild coordinates r, t often used to describe the geometry around a black hole (Chapter 31). 
For simplicity the angular coordinates 6 , <> have been suppressed. The singularity shows itself in two 
ways. First, all the points along the dotted line, while quite distinct one from another, are designated 
by the same pair of (r, t) values; namely, r = 2m, t = x. The coordinates provide no way to distinguish 
these points. Second, the “cells in the egg crate,” of which one is shown grey in the diagram, collapse 
to zero content at the dotted line. In summary, there is nothing strange about the geometry at the dotted 
line; all the singularity lies in the coordinate system (“poor system of telephone numbers”). No confusion 
should be permitted to arise from the accidental circumstance that the t coordinate attains an infinite 
value on the dotted line. No such infinity would occur if / were replaced by the new coordinate?, defined 
by 

(t/2m) = tan(7/2 m). 

When t — x, the new coordinate t is t = rrm. The r, t coordinates still provide no way to distinguish 
the points along the dotted line. They still give “cells in the egg crate” collapsed to zero content alone 
the dotted line. 







Figure 1.5. 

Singularities in familiar coordinates on the two-sphere can be eliminated by covering the sphere with 
two overlapping coordinate patches. A. Spherical polar coordinates, singular at the North and South 
Poles, and discontinuous at the international date line. B. Projection of the Euclidean coordinates of 
the Euclidean two-plane, tangent at the North Pole, onto the sphere via a line running to the South 
Pole; coordinate singularity at the South Pole. C. Coverage of two-sphere by two overlapping coordinate 
patches. One, constructed as in B, covers without singularity the northern hemisphere and also the 
southern tropics down to the Tropic of Capricorn. The other (grey) also covers without singularity all 
of the tropics and the southern hemisphere besides. 


Breakdown in smoothness of 
spacetime at Planck length 

L* = (fiG/c 3 ) 1/2 

= [(1,054 x 10“ 27 g cm 2 /sec)(6.670 x 10" 8 cm 3 /g sec 2 )] 1/2 X 

X (2.998 X 10 10 cm/sec)“ 3/2 (1.1) 

= 1.616 x 10“ 33 cm. 

No one has found any way to escape this prediction. As nearly as one can estimate, 
these fluctuations give space at small distances a “multiply connected” or “foamlike” 
character. This lack of smoothness may well deprive even the concept of dimension¬ 
ality itself of any meaning at the Planck scale of distances. The further exploration 
of this issue takes one to the frontiers of Einstein’s theory (Chapter 44). 

If spacetime at small distances is far from the mathematical model of a continuous 
manifold, is there not also at larger distances a wide gap between the mathematical 


energies (corresponding de Broglie wavelength 10 -16 cm). Moreover, classical general 
relativity thinks of the spacetime manifold as a deterministic structure, completely 
well-defined down to arbitrarily small distances. Not so quantum general relativity 
or “quantum geometrodynamics.” It predicts violent fluctuations in the geometry 
at distances on the order of the Planck length, 


§1.3. WEIGHTLESSNESS 
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idealization and the physical reality? The infinitely dense collection of light rays 
and of world lines of infinitesimal test particles that are to define all the points of 
the manifold: they surely are beyond practical realization. Nobody has ever found 
a particle that moves on timelike world lines (finite rest mass) lighter than an electron. 
A collection of electrons, even if endowed with zero density of charge (e + and e - 
world lines present in equal numbers) will have a density of mass. This density will 
curve the very manifold under study. Investigation in infinite detail means unlimited 
density, and unlimited disturbance of the geometry. 

However, to demand investigatability in infinite detail in the sense just described 
is as out of place in general relativity as it would be in electrodynamics or gas 
dynamics. Electrodynamics speaks of the strength of the electric and magnetic field 
at each point in space and at each moment of time. To measure those fields, it is 
willing to contemplate infinitesimal test particles scattered everywhere as densely 
as one pleases. However, the test particles do not have to be there at all to give 
the field reality. The field has everywhere a clear-cut value and goes about its 
deterministic dynamic evolution willy-nilly and continuously, infinitesimal test 
particles or no infinitesimal test particles. Similarly with the geometry of space. 

In conclusion, when one deals with spacetime in the context of classical physics, 
one accepts (1) the notion of “infinitesimal test particle” and (2) the idealization 
that the totality of identifiable events forms a four-dimensional continuous manifold. 
Only at the end of this book will a look be taken at some of the limitations placed 
by the quantum principle on one’s way of speaking about and analyzing spacetime. 


Difficulty in defining 
geometry even at classical 
distances? 


No; one must accept 
geometry at classical 
distances as meaningful 


§1.3. WEIGHTLESSNESS 


“Gravity is a great mystery. Drop a stone. See it fall. Hear it hit. No one understands 
why.” What a misleading statement! Mystery about fall? What else should the stone 
do except fall? To fall is normal. The abnormality is an object standing in the way 
of the stone. If one wishes to pursue a “mystery,” do not follow the track of the 
falling stone. Look instead at the impact, and ask what was the force that pushed 
the stone away from its natural “world line,” (i.e., its natural track through space- 
time). That could lead to an interesting issue of solid-state physics, but that is not 
the topic of concern here. Fall is. Free fall is synonymous with weightlessness: 
absence of any force to drive the object away from its normal track through space- 
time. Travel aboard a freely falling elevator to experience weightlessness. Or travel 
aboard a spaceship also falling straight toward the Earth. Or, more happily, travel 
aboard a spaceship in that state of steady fall toward the Earth that marks a circular 
orbit. In each case one is following a natural track through spacetime. 

The traveler has one chemical composition, the spaceship another; yet they travel 
together, the traveler weightless in his moving home. Objects of such different nuclear 
constitution as aluminum and gold fall with accelerations that agree to better than 
one part in 10 11 , according to Roll, Krotkov, and Dicke (1964), one of the most 
important null experiments in all physics (see Figure 1.6). Individual molecules fall 
in step, too, with macroscopic objects [Estermann, Simpson, and Stern (1938)]; and 
so do individual neutrons [Dabbs, Harvey, Paya, and Horstmann (1965)], individual 

- -- - - - - .{continued on page 16) 


Free fall is the natural state 
of motion 


All objects fall with the same 
acceleration 



14 


1. GEOMETRODYNAMICS IN BRIEF 


Figure 1.6. 

Principle of the Roll-Krotkov-Dicke experiment, which showed that the gravitational accelerations of 
gold and aluminum are equal to 1 part in 10 U or better (Princeton, 1964). In the upper lefthand corner, 
equal masses of gold and aluminum hang from a supporting bar. This bar in turn is supported at its 
midpoint. If both objects fall toward the sun with the same acceleration of g = 0.59 cm/sec 2 , the bar 
does not turn. If the Au mass receives a higher acceleration, g + 8g. then the gold end of the bar starts 
to turn toward the sun in the Earth-fixed frame. Twelve hours later the sun is on the other side, pulling 
the other way. The alternating torque lends itself to recognition against a background of noise because 
of its precise 24-hour period. Unhappily, any substantial mass nearby, such as an experimenter, located 
at M. will produce a torque that swamps the effect sought. Therefore the actual arrangement was as 
showm in the body of the figure. One gold weight and two aluminum weights were supported at the 
three corners of a horizontal equilateral triangle, 6 cm on a side (three-fold axis of symmetry, giving 
zero response to all the simplest nonuniformities in the gravitational field). Also, the observers performed 
all operations remotely to eliminate their own gravitational effects*. To detect a rotation of the torsion 
balance as small as — lO*" 9 rad without disturbing the balance. Roll, Krotkov, and Dicke reflected a 
very weak light beam from the optically flat back face of the quartz triangle. The image of the source 
slit fell on a wire of about the same size as the slit image. The light transmitted past the wire fell on 
a photomultiplier. A separate oscillator circuit drove the wire back and forth across the image at 3,000 
hertz. When the image was centered perfectly, only even harmonics of the oscillation frequency appeared 
in the light intensity. However, when the image was displaced slightly to one side, the fundamental 
frequency appeared in the light intensity. The electrical output of the photomultiplier then contained 
a 3,000-hertz component. The magnitude and sign of this component were determined automatically. 
Equally automatically a proportional d.c. voltage was applied to the electrodes shown in the diagram. 
It restored the torsion balance to its zero position. The d.c. voltage required to restore the balance to 
its zero position w ? as recorded as a measure of the torque acting on the pendulum. This torque was 
Fourier-analyzed over a period of many days. The magnitude of the Fourier component of 24-hour 
period indicated a ratio 8g/g = (0.96 ± 1.04) x 10 -11 . Aluminum and gold thus fall with the same 
acceleration, despite their important differences summarized in the table. 


Ratios 

A! 

Au 

Number of neutrons 



Number of proions 

1.08 

1.5 

Mass of kinetic energy of K-electron 

0.005 

0.16 

Rest mass of electron 

Electrostatic mass-energv of nucleus 

0.001 

0.004 

Mass of atom 


The theoretical implications of this experiment will be discussed in greater detail in Chapters 16 and 38. 

Braginsky and Panov (1971) at Moscow University performed an experiment identical in principle 
to that of Dicke-Roll-Krotkov, but with a modified experimental set-up. Comparing the accelerations 
of platinum and aluminum rather than of gold and aluminum, they say that 

8g/g < 1 x 10-“ 


* Other perturbations had to be, and were, guarded against. (1) A bit of iron on the torsion balance 
as big as 10 -3 cm on a side would have contributed, in the Earth’s magnetic field, a torque a hundred 
times greater than the measured torque. (2) The unequal pressure of radiation on the two sides of a 
mass would have produced an unacceptably large perturbation if the temperature difference between 
these two sides had exceeded 10 -4 °K. (3) Gas evolution from one side of a mass would have propelled 
it like a rocket. If the rate of evolution were as great as 10" 8 g/day, the calculated force would have 
been — 10' 7 g cm/sec 2 , enough to affect the measurements. (4) The rotation was measured with respect 
to the pier that supported the equipment. As a guarantee that this pier did not itself rotate, it was anchored 
to bed rock. (5) Electrostatic forces were eliminated; otherwise they would have perturbed the balance. 
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electrons [Witteborn and Fairbank (1967)] and individual mu mesons [Beall (1970)]. 
What is more, not one of these objects has to see out into space to know how to 
move. 

Contemplate the interior of a spaceship, and a key, penny, nut, and pea by accident 
or design set free inside. Shielded from all view of the world outside by the walls 
of the vessel, each object stays at rest relative to the vessel. Or it moves through 
the room in a straight line with uniform velocity. That is the lesson which experience 
shouts out. 

Forego talk of acceleration! That, paradoxically, is the lesson of the circumstance 
that “all objects fall with the same acceleration.” Whose fault were those accelera¬ 
tions, after all? They came from allowing a groundbased observer into the act. The 


r 


Box 1.2 MATERIALS OF THE MOST DIVERSE COMPOSITION FALL WITH 
THE SAME ACCELERATION ("STANDARD WORLD LINE") 


Aristotle: “the downward movement of a mass of 
gold or lead, or of any other body endowed with 
weight, is quicker in proportion to its size.” 

Pre-Galilean literature: metal and wood weights 
fall at the same rate. 

Galileo: (1) “the variation of speed in air between 
balls of gold, lead, copper, porphyry, and other 
heavy materials is so slight that in a fall of 100 
cubits [about 46 meters] a ball of gold would surely 
not outstrip one of copper by as much as four 
fingers. Having observed this, I came to the con¬ 
clusion that in a medium totally void of resistance 
all bodies would fall with the same speed.” (2) 
later experiments of greater precision “diluting 
gravity” and finding same time of descent for 
different objects along an inclined plane. 

Newton: inclined plane replaced by arc of pendu¬ 
lum bob; “time of fall” for bodies of different 
composition determined by comparing time of 
oscillation of pendulum bobs of the two materials. 
Ultimate limit of precision in such experiments 
limited by problem of determining effective length 
of each pendulum: (acceleration) = ( 27 r/pe- 
riod) 2 (length). 


Lorand von Eotvos, Budapest, 1889 and 1922: 
compared on the rotating earth the vertical defined 
by a plumb bob of one material with the vertical 
defined by a plumb bob of other material. The 
two hanging masses, by the two unbroken threads 
that support them, were drawn along identical 
world lines through spacetime (middle of the labo¬ 
ratory of Eotvos!). If cut free, would they also 
follow identical tracks through spacetime (“normal 
world line of test mass”)? If so, the acceleration 
that draws the actual world line from the normal 
free-fall world line will have a standard value, a. 
The experiment of Eotvos did not try to test agree¬ 
ment on the magnitude of a between the two 
masses. Doing so would have required (1) cutting 
the threads and (2) following the fall of the two 
masses. Eotvos renounced this approach in favor 
of a static observation that he could make with 
greater precision, comparing the direction of a for 
the two masses. The direction of the supporting 
thread, so his argument ran, reveals the direction 
in which the mass is being dragged away from its 
normal world line of “free fall” or “weightless¬ 
ness.” This acceleration is the vectorial resultant 
of (1) an acceleration of magnitude g , directed 
outward against so-called gravity, and (2) an ac¬ 
celeration directed toward the axis of rotation of 
the earth, of magnitude co 2 R sin 8 (w, angular ve- 





§1.3. WEIGHTLESSNESS 
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push of the ground under his feet was driving him away from a natural world line. 
Through that flaw in his arrangements, he became responsible for all those accelera¬ 
tions. Put him in space and strap rockets to his legs. No difference!* Again the 
responsibility for what he sees is his. Once more he notes that “all objects fall with 

* No difference” spelled out amounts to Einstein’s (1911) principle of the local equivalence between a 
gravitational field and an acceleration: “ We arrive at a very satisfactory interpretation of this law of 
experience, if we assume that the systems K and K' are physically exactly equivalent, that is, if we assume 
that we may just as well regard the system K as being in a space free from gravitational fields, if we then 
regard K as uniformly accelerated. This assumption of exact physical equivalence makes it impossible for 
us to speak of the absolute acceleration of the system of reference, just as the usual theory of relativity 
forbids us to talk of the absolute velocity of a system; and it makes the equal falling of all bodies in a 
gravitational field seem a matter of coursed* 


locity; R , radius of earth; 8 , polar angle measured 
from North Pole to location of experiment). This 
centripetal acceleration has a vertical component 
— to 2 R sin 2 0 too small to come into discussion. 
The important component is co 2 R sin 8 cos 8 , di¬ 
rected northward and parallel to the surface of the 
earth. It deflects the thread by the angle 

horizontal acceleration 
vertical acceleration 


co 2 R sin 8 cos 8 

8 


3.4 cm/sec 2 
980 cm/sec 2 


sin 8 cos 8 


= 1.7 X 10 3 radian at 8 = 45 ° 


from the straight line connecting the center of the 
earth to the point of support. A difference, 8g , of 
one part in 10 8 between g for the two hanging 
substances would produce a difference in angle of 
hang of plumb bobs equal to 1.7 x 10~ n radian 
at Budapest (8 = 42.5°). Eotvos reported 8g/g less 
than a few parts in 10 9 . 


Roll, Krotkov, and Dicke, Princeton, 1964: em¬ 
ployed as fiducial acceleration, not the 1.7 cm/sec 2 
steady horizontal acceleration, produced by the 
earth’s rotation at 8 = 45% but the daily alternat¬ 


ing 0.59 cm/sec 2 produced by the sun’s attraction. 
Reported |g(Au) - g(Al)\/g less than 1 X 10" u . 
See Figure 1.6. 

Braginsky and Panov, Moscow, 1971: like Roll, 
Krotkov, and Dicke, employed Sun’s attraction as 
fiducial acceleration. Reported |g(Pt) — g(Al)|/g 
less than 1 X 10“ 12 . 


Beall, 1970: particles that are deflected less by the 
Earth’s or the sun’s gravitational field than a pho¬ 
ton would be, effectively travel faster than light. 
If they are charged or have other electromagnetic 
structure, they would then emit Cerenkov radia¬ 
tion, and reduce their velocity below threshold in 
less than a micron of travel. The threshold is at 
energies around 10 3 me 2 . Ultrarelativistic particles 
in cosmic-ray showers are not easily identified, but 
observations of 10 13 eV muons show that muons 
are not “too light” by as much as 5 X 10“ 5 . Con¬ 
versely, a particle P bound more strongly than 
photons by gravity will transfer the momentum 
needed to make pair production y P + P occur 
within a submicron decay length. The existence of 
photons with energies above 10 13 eV shows that 
e ± are not “too heavy” by 5 parts in 10 9 , pr not 
by 2 in 10% A, Z% Q~ not by a few per cent. 



Figure 1.7. 

“Weightlessness” as test for a local inertial frame of reference (“Lorentz frame”). Each spring-driven 
cannon succeeds in driving its projectile, a steel ball bearing, through the aligned holes in the sheets 
of lucite, and into the woven-mesh pocket, when the frame of reference is free of rotation and in free 
fall (“normal world line through spacetime”). A cannon would fail (curved and ricocheting trajectory 
at bottom of drawing) if the frame were hanging as indicated when the cannon went off (“frame drawn 
away by pull of rope from its normal world line through spacetime”). Harold Waage at Princeton has 
constructed such a model for an inertial reference frame with lucite sheets about 1 m square. The “fuses” 
symbolizing time delay were replaced by electric relays. Penetration fails if the frame (1) rotates, (2) 
accelerates, or (3) does any combination of the two. It is difficult to cite any easily realizable device 
that more fully illustrates the meaning of the term “local Lorentz frame.” 


Eliminate the acceleration by 
use of a local inertial frame 


the same acceleration.” Physics looks as complicated to the jet-driven observer as 
it does to the man on the ground. Rule out both observers to make physics look 
simple. Instead, travel aboard the freely moving spaceship. Nothing could be more 
natural than what one sees: every free object moves in a straight line with uniform 
velocity. This is the way to do physics! Work in a very special coordinate system: 
a coordinate frame in which one is weightless; a local inertial frame of reference. 
Or calculate how things look in such a frame. Or—if one is constrained to a ground- 
based frame of reference—use a particle moving so fast, and a path length so limited, 
that the ideal, freely falling frame of reference and the actual ground-based frame 
get out of alignment by an amount negligible on the scale of the experiment. [Given 
a 1,500-m linear accelerator, and a 1 GeV electron, time of flight ~ (1.5 X 10 5 cm)/ 
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(3 X 10 10 cm/sec) = 0.5 X 10 -5 sec; fall in this time = (490 cm/sec 2 )(0.5 X 

10 -5 sec) 2 ~ 10 -8 cm.] 

In analyzing physics in a local inertial frame of reference, or following an ant 
on his little section of apple skin, one wins simplicity by foregoing every reference 
to what is far away. Physics is simple only when viewed locally: that is Einstein’s 
great lesson. 

Newton spoke differently: “Absolute space, in its own nature, without relation 
to anything external, remains always similar and immovable.” But how does one 
give meaning to Newton’s absolute space, find its cornerstones, mark out its straight 
lines? In the real world of gravitation, no particle ever follows one of Newton’s 
straight lines. His ideal geometry is beyond observation. “A comet going past the 
sun is deviated from an ideal straight line.” No. There is no pavement on which 
to mark out that line. The “ideal straight line” is a myth. It never happened, and 
it never will. 

“It required a severe struggle [for Newton] to arrive at the concept of independent 
and absolute space, indispensible for the development of theory.... Newton’s decision 
was, in the contemporary state of science, the only possible one, and particularly the 
only fruitful one. But the subsequent development of the problems, proceeding in a 
roundabout way which no one could then possibly foresee, has shown that the resistance 
of Leibniz and Huygens, intuitively well-founded but supported by inadequate argu¬ 
ments, was actually justified.... It has required no less strenuous exertions subsequently 
to overcome this concept [of absolute space]” 

[A. EINSTEIN (1954)]. 

What is direct and simple and meaningful, according to Einstein, is the geometry 
in every local inertial reference frame. There every particle moves in a straight line 
with uniform velocity. Define the local inertial frame so that this simplicity occurs 
for the first few particles (Figure 1.7). In the frame thus defined, every other free 
particle is observed also to move in a straight line with uniform velocity. Collision 
and disintegration processes follow the laws of conservation of momentum and 
energy of special relativity. That all these miracles come about, as attested by tens 
of thousands of observations in elementary particle physics, is witness to the inner 
workings of the machinery of the world. The message is easy to summarize: (1) 
physics is always and everywhere locally Lorentzian; i.e., locally the laws of special 
relativity are valid; (2) this simplicity shows most clearly in a local Lorentz frame 
of reference (“inertial frame of reference”; Figure 1.7); and (3) to test for a local 
Lorentz frame, test for weightlessness! 

§1.4. LOCAL LORENTZ GEOMETRY, 

WITH AND WITHOUT COORDINATES 

On the surface of an apple within the space of a thumbprint, the geometry is 
Euclidean (Figure 1.1; the view in the magnifying glass). In spacetime, within a 
limited region, the geometry is Lorentzian. On the apple the distances between point 
and point accord with the theorems of Euclid. In spacetime the intervals (“proper 
distance,” “proper time”) between event and event satisfy the corresponding theo¬ 
rems of Lorentz-Minkowski geometry (Box 1.3). These theorems lend themselves 

{continued on page 23) 



Newton's absolute space is 
unobservable, nonexistent 


But Einstein's local inertial 
frames exist, are simple 


In local inertial frames, 
physics is Lorentzian 


Local Lorentz geometry is the 
spacetime analog of local 
Euclidean geometry. 
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Box 1.3 LOCAL LORENTZ GEOMETRY AND LOCAL EUCLIDEAN GEOMETRY 
WITH AND WITHOUT COORDINATES 


I. Local Euclidean Geometry 

What does it mean to say that the geometry of 
a tiny thumbprint on the apple is Euclidean? 

A. Coordinate-free language (Euclid): 

Given a line (16. Extend it by an equal 
distance 62. Let id be a point not on 62 
but equidistant from 6 and 2. Then 

O 2 - r 2 _l_ n 2 

■Vtf — ‘Vc’ * * 

(Theorem of Pythagoras; also other theo¬ 
rems of Euclidean geometry.) 

B. Language of coordinates (Descartes): 

From any point 6 to any other point id 
there is a distance s given in suitable (Eucli¬ 
dean) coordinates by 

W = - x\a)f + [* 2 ($) - x\a )\ 2 . 

If one succeeds in finding any coordinate 
system where this is true for all points 6 
and id in the thumbprint, then one is guar¬ 
anteed that (i) this coordinate system is 
locally Euclidean, and (ii) the geometry of 
the apple’s surface is locally Euclidean. 




<N m cn 


* H * * 


II. Local Lorentz Geometry 

What does it mean to say that the geometry of 
a sufficiently limited region of spacetime in the 
real physical world is Lorentzian? 

A. Coordinate-free language (Robb 1936): 

Let 62 be the world line of a free particle. 
Let $ be an event not on this world line. 
Let a light ray from id strike 62 at the 
event 2. Let a light ray take off from such 
an earlier event 9 along 62 that it reaches 
Then the proper distance (spacelike 
separation) or proper time t^ tJ (timelike 
separation) is given by 



2 _ 2 
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Proof of above criterion for local Lorentz 
geometry, using coordinate methods in the 
local Lorentz frame where particle remains 
at rest: 

Vff ? 2 ~ t 1 ~ x 1 - {t - x){t 4 - x) 

= 7 a*? 7 a £- 



B. Language of coordinates (Lorentz, Poincare, 
Minkowski, Einstein): 

From any event (I to any other nearby 
event fg, there is a proper distance or 
proper time given in suitable (local 
Lorentz) coordinates by 


W = “W = - x°{d)f 

+ [x\@) - x\d)f 
+ [* 2 («) - x 2 (d)Y 
+ [* 3 ($) — x\tf)] 2 . 

If one succeeds in finding any coordinate 
system where this is locally true for all 
neighboring events (i and fg, then one is 
guaranteed that (i) this coordinate system 
is locally Lorentzian, and (ii) the geometry 
of spacetime is locally Lorentzian. 



III. Statements of Fact 

The geometry of an apple’s surface is locally Eu¬ 
clidean everywhere. The geometry of spacetime is 
locally Lorentzian everywhere. 


v. 




Box 1.3 (continued) 


IV. Local Geometry in the Language of 
Modern Mathematics 


local Euclidean coordinates (on apple) or 
local Lorentz coordinates (in spacetime). 


A. The metric for any manifold: 

At each point on the apple, at each event 
of spacetime, indeed, at each point of any 
“Riemannian manifold,” there exists a geo¬ 
metrical object called the metric tensor g. 
It is a machine with two input slots for the 
insertion of two vectors; 

slot 1 slot 2 

g( , )• 

Ifone inserts the same vector u into both slots, 
one gets out the square of the length of u: 

g{u , u) = u 2 . 

If one inserts two different vectors, u and v 
(it matters not in which order!), one gets out 
a number called the “scalar product of u on 
v” and denoted u • v: 



Let f be the separation vector reaching from 
Cf to Its components in the local Eucli¬ 
dean (Lorentz) coordinates are 

!<* = *«(#)■- x a (cf) 

(cf. Box 1.1). Then the squared length ofu ^ yJ , 
which is the same as the squared distance 
from Cf tomust be (cf. I.B. and II.B. above) 

(■ ( = g(C () = g a ^ 

= W = (I 1 ) 2 + (I 2 ) 2 on apple 

= - a 0 ) 2 + a 1 ) 2 + ( i 2 ) 2 + « 3 ) 2 

in spacetime. 


g(u, v) = g(v, u) ~ u ’ v — v' u. 

The metric is a linear machine: 

g(2u + 3w, v) = 2g(u, v) + 3 g(w, v), 
g(u , av + bw) = ag(u , v) + bg(u, w). 

Consequently, in a given (arbitrary) coordi¬ 
nate system, its operation on two vectors can 
be written in terms of their components as a 
bilinear expression: 

g(u, v) = 

(implied summation on a, ft) 

= gn ul v l + g\2 ul v 2 + £ 21 wV + • • • • 

The quantities g a p = g^ a (a and ft running 
from 0 to 3 in spacetime, from 1 to 2 on the 
apple) are called the “components of g in the 
given coordinate system.” 

B. Components of the metric in local Lorentz and 
local Euclidean frames: 

To connect the metric with our previous de¬ 
scriptions of the local geometry, introduce 


Consequently, the components of the met¬ 
ric are 

Sn = §22 ~ ^ §12 ~ §21 — 
i e -. gap = 8 ap on apple, in 

local Euclidean 
coordinates; 

£bo §0k §jk ^jk 

in spacetime, in 

local Lorentz 
coordinates. 

These special components of the metric in 
local Lorentz coordinates are written here 
and hereafter as or r] a/3 , by analogy 
with the Kronecker delta 8 a p. In matrix 
notation: 


— ft —- 

0 12 3 


1 0 

-1 

0 

0 

0 

a 1 

0 

1 

0 

0 

i 2 

0 

0 

1 

0 

* 3 

0 

0 

0 

1 
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to empirical test in the appropriate, very special coordinate systems: Euclidean 
coordinates in Euclidean geometry; the natural generalization of Euclidean coordi¬ 
nates (local Lorentz coordinates; local inertial frame) in the local Lorentz geometry 
of physics. However, the theorems rise above all coordinate systems in their content. 
They refer to intervals or distances. Those distances no more call on coordinates 
for their definition in our day than they did in the time of Euclid. Points in the 
great pile of hay that is spacetime; and distances between these points: that is 
geometry! State them in the coordinate-free language or in the language of coordi¬ 
nates: they are the same (Box 1.3). 


§1.5. TIME 

Time is defined so that motion looks simple. 


Time is awake when all things sleep. 
Time stands straight when all things fall. 

Time shuts in all and will not be shut. 
Is, was, and shall be are Time's children. 
0 Reasoning, be witness, be stable. 

VYASA, the Mahabarata (ca. A.D. 400) 


Relative to a local Lorentz frame, a free particle “moves in a straight line with 
uniform velocity.” What “straight” means is clear enough in the model inertial 
reference frame illustrated in Figure 1.7. But where does the “uniform velocity” come 
in? Or where does “velocity” show itself? There is not even one clock in the drawing! 

A more fully developed model of a Lorentz reference frame will have not only 
holes, as in Fig. 1.7, but also clock-activated shutters over each hole. The projectile 
can reach its target only if it (1) travels through the correct region in space and 
(2) gets through that hole in the correct interval of time (“window in time”). How 
then is time defined? Time is defined so that motion looks simple! 

No standard of time is more widely used than the day, the time from one high 
noon to the next. Take that as standard, however, and one will find every good clock 
or watch clashing with it, for a simple reason. The Earth spins on its axis and also 
revolves in orbit about the sun. The motion of the sun across the sky arises from 
neither effect alone, but from the two in combination, different in magnitude though 
they are. The fast angular velocity of the Earth on its axis (roughly 366.25 complete 
turns per year) is wonderfully uniform. Not so the apparent angular velocity of the 
sun about the center of the Earth (one turn per year). It is greater than average 
by 2 per cent when the Earth in its orbit (eccentricity 0.017) has come 1 per cent 
closer than average to the sun (Kepler’s law) and lower by 2 per cent when the 
Earth is 1 per cent further than average from the sun. In the first case, the momentary 
rate of rotation of the sun across the sky, expressed in turns per year, is approximately 


The time coordinate of a 
local Lorentz frame is so 
defined that motion looks 
simple 


366.25 - (1 + 0.02); 
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in the other, 


366.25 - (1 - 0.02). 

Taking the “mean solar day” to contain 24 X 3.600 = 86,400 standard seconds, one 
sees that, when the Earth is 1 per cent closer to (or further from) the sun than average, 
then the number of standard seconds from one high noon to the next is greater 
(or less) than normal by 


0.02 (drop in turns per year) 
365.25 (turns per year on average) 


86,400 sec — 4.7 sec. 


This is the bookkeeping on time from noon to noon. No standard of time that varies 
so much from one month to another is acceptable. If adopted, it would make the 
speed of light vary from month to month! 

This lack of uniformity, once recognized (and it was already recognized by the 
ancients), forces one to abandon the solar day as the standard of time; that day 
does not make motion look simple. Turn to a new standard that eliminates the motion 
of the Earth around the sun and concentrates on the spin of the Earth about its 
axis: the sidereal day, the time between one arrival of a star at the zenith and the 
next arrival of that star at the zenith. Good! Or good, so long as one’s precision 
of measurement does not allow one to see changes in the intrinsic angular velocity 
of the Earth. What clock was so bold as first to challenge the spin of the Earth for 
accuracy? The machinery of the heavens. 

Halley (1693) and later others, including Kant (1754), suspected something was 
amiss from apparent discrepancies between the paths of totality in eclipses of the 
sun, as predicted by Newtonian gravitation theory using the standard of time then 
current, and the location of the sites where ancient Greeks and Romans actually 
recorded an eclipse on the day in question. The moon casts a moving shadow in 
space. On the day of a solar eclipse, that shadow paints onto the disk of the spinning 
Earth a black brush stroke, often thousands of kilometers in length, but of width 
generally much less than a hundred kilometers. He who spins the globe upon the 
table and wants to make the shadow fall rightly on it must calculate back meticu¬ 
lously to determine two key items: (1) where the moon is relative to Earth and sun 
at each moment on the ancient day in question; and (2) how much angle the Earth 
has turned through from then until now. Take the eclipse of Jan. 14, a.d. 484, as 
an example (Figure 1.8), and assume the same angular velocity for the Earth in 
the intervening fifteen centuries as the Earth had in 1900 (astronomical reference 
point). One comes out wrong. The Earth has to be set back by 30° (or the moon 
moved from its computed position, or some combination of the two effects) to make 
the Athens observer fall under the black brush. To catch up those 30° (or less, if 
part of the effect is due to a slow change in the angular momentum of the moon), 
the Earth had to turn faster in the past than it does today. Assigning most of the 
discrepancy to terrestrial spin-down (rate of spin-down compatible with modern 
atomic-clock evidence), and assuming a uniform rate of slowing from then to now 




HHH B: Computed track if 

length of day constant 


Figure 1.8. 

Calculated path of totality for the eclipse of January 14, a.d. 484 (left; calculation based on no spin-down 
of Earth relative to its 1900 angular velocity) contrasted with the same path as set ahead enough to 
put the center of totality (at sunrise) at Athens [displacement very close to 30°; actual figure of deceleration 
adopted in calculations, 32.75 arc sec/(century) 2 ]. This is “undoubtedly the most reliable of all ancient 
European eclipses,” according to Dr. F. R. Stephenson, of the Department of Geophysics and Planetary 
Physics of the University of Newcastle upon Tyne, who most kindly prepared this diagram especially 
for this book. He has also sent a passage from the original Greek biography of Proclus of Athens (died 
at Athens a.d. 485) by Marinus of Naples, reading, “Nor were there portents wanting in the year which 
preceded his death; for example, such a great eclipse of the Sun that night seemed to fall by day. For 
a profound darkness arose so that stars even appeared in the sky. This happened in the eastern sky 
when the Sun dwelt in Capricorn” [from Westermann and Boissonade (1878)]. 

Does this 30° for this eclipse, together with corresponding amounts for other eclipses, represent the 
“right” correction? “Right” is no easy word. From one total eclipse of the sun in the Mediterranean 
area to another is normally many years. The various provinces of the Greek and Roman worlds were 
far from having a uniform level of peace and settled life, and even farther from having a uniform standard 
of what it is to observe an eclipse and put it down for posterity. If the scores of records of the past 
are unhappily fragmentary, even more unhappy has been the willingness of a few uncritical “investigators” 
in recent times to rush in and identify this and that historical event with this and that calculated eclipse. 
Fortunately, by now a great literature is available on the secular deceleration of the Earth’s rotation, 
in the highest tradition of critical scholarship, both astronomical and historical. In addition to the books 
of 0. Neugebauer (1959) and Munk and MacDonald (1960). the paper of Curott (1966), and items cited 
by these workers, the following are key items. (For direction to them, we thank Professor Otto Neuge¬ 
bauer no relation to the other Neugebauer cited below!) For the ancient records, and for calculations 
of the tracks of ancient eclipses, F. K. Ginzel (1882, 1883, 1884); for an atlas of calculated eclipse tracks, 
Oppolzer (1887) and Ginzel (1899); and for a critical analysis of the evidence. P. V. Neugebauer (1927, 
1929, and 1930). This particular eclipse was chosen rather than any other because of the great reliability 
of the historical record of it. 




Good clocks make spacetime 
trajectories of free particles 
look straight 


Our choice of unit for 
measuring time; the 
geometrodynamic centimeter. 
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(angular velocity correction proportional to first power of elapsed time: angle cor¬ 
rection itself proportional to square of elapsed time), one estimates from a correction 
of 


30° or 2 hours 1,500 years ago 
the following corrections for intermediate times: 


30° /10 2 , or 1.2 min 150 years ago, 

30°/10 4 , or 0.8 sec 15 years ago. 

Thus one sees the downfall of the Earth as a standard of time and its replacement 
by the orbital motions of the heavenly bodies as a better standard: a standard that 
does more to “make motion look simple.” Astronomical time is itself in turn today 
being supplanted by atomic time as a standard of reference (see Box 1.4, “Time 
Today”). 

Look at a bad clock for a good view of how time is defined. Let t be time on 
a “good” clock (time coordinate of a local inertial frame); it makes the tracks of 
free particles through the local region of spacetime look straight. Let T(t) be the 
reading of the “bad” clock; it makes the world lines of free particles through the 
local region of spacetime look curved (Figure 1.9). The old value of the acceleration, 
translated into the new (“bad”) time, becomes 

0 — d 2 x _ d I dT dx \ _ d 2 T dx ldT\ 2 d 2 x 

~ dt 2 ~ dt\dt dT) ~ dt 2 dT \dt / dT 2 * 

To explain the apparent accelerations of the particles, the user of the new time 
introduces a force that one knows to be fictitious: 



It is clear from this example of a “bad” time that Newton thought of a “good” time 
when he set up the principle that “Time flows uniformly” ( d 2 T/dt 2 = 0). Time is 
defined to make motion look simple! 

The principle of uniformity, taken by itself, leaves free the scale of the time 
variable. The quantity T — at -f b satisfies the requirement as well as t itself. The 
history of timekeeping discloses many choices of the unit and origin of time. Each 
one required some human action to give it sanction, from the fiat of a Pharaoh to 
the communique of a committee. In this book the amount of time it takes light to 
travel one centimeter is decreed to be the unit of time. Spacelike intervals and 
timelike intervals are measured in terms of one and the same geometric unit: the 
centimeter. Any other decision would complicate in analysis what is simple in nature. 
No other choice would live up to Minkowski’s words, “Henceforth space by itself, 
and time by itself, are doomed to fade away into mere shadows, and only a kind 
of union of the two will preserve an independent reality.” 
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Figure 1.9. 

Good clock (left) vs. bad clock (right) as seen in the maps they give of the same free particles moving 
through the same region of spacetime. The world lines as depicted at the right give the impression that 
a force is at work. The good definition of time eliminates such fictitious forces. The dashed lines connect 
corresponding instants on the two time scales. 


One can measure time more accurately today than distance. Is that an argument 
against taking the elementary unit to be the centimeter? No, provided that this 
definition of the centimeter is accepted: the geometrodynamic standard centimeter 
is the fraction 

1/(9.460546 X 10 17 ) (1.3) 

of the interval between the two “effective equinoxes ” that bound the tropical year 
1900.0. The tropical year 1900.0 has already been recognized internationally as the 
fiducial interval by reason of its definiteness and the precision with which it is known. 
Standards committees have defined the ephemeris second so that 31,556,925.974 sec 
make up that standard interval. Were the speed of light known with perfect precision, 
the standards committees could have given in the same breath the number of 
centimeters in the standard interval. But it isn’t; it is known to only six decimals. 
Moreover, the international centimeter is defined in terms of the orange-red wave¬ 
length of Kr 86 to only nine decimals (16,507.6373 wavelengths). Yet the standard 
second is given to 11 decimals. We match the standard second by arbitrarily defining 
the geometrodynamic standard centimeter so that 

9.4605460000 x 10 17 

such centimeters are contained in the standard tropical year 1900.0. The speed of 
light then becomes exactly 

9.4605460000 x 10 17 

- 3 Y 556 925,974 -geometrodynamic cm/sec. (1.4) 

This is compatible with the speed of light, as known in 1967, in units of “international 
cm/sec”: 


29,979,300,000 ± 30,000 international cm/sec. 


Box 1.4 TIME TODAY 


Prior to 1956 the second was defined as the frac¬ 
tion 1/86,400 of the mean solar day. 

From 1956 to 1967 the “second’’ meant the 
ephemeris second, defined as the fraction 
1/(31,556,925.9747) of the tropical year 
OOhOOmOOs December 31, 1899. 

Since 1967 the standard second has been the 
SI (Systeme International) second, defined as 
9,192,631,770 periods of the unperturbed micro- 
wave transition between the two hyperfine levels 
of the ground state of Cs 133 . 

Like the foregoing evolution of the unit for the 
time interval, the evolution of a time coordinate 
has been marked by several stages. 

Universal time, UTO, is based on the count of 
days as they actually occurred historically; in other 
words, on the actual spin of the earth on its axis; 
historically, on mean solar time (solar position as 
corrected by the “equation of time”; i.e., the faster 
travel of the earth when near the sun than when 
far from the sun) as determined at Greenwich 
Observatory. 

UT1, the “navigator’s time scale,” is the same 
time as corrected for the wobble of the earth on 
its axis (At — 0.05 sec). 

UT2 is UT1 as corrected for the periodic fluc¬ 
tuations of unknown origin with periods of one- 
half year and one year (At — 0.05 sec; measured 
to 3 ms in one day). 

Ephemeris Time, ET (as defined by the theory 
of gravitation and by astronomical observations 
and calculations), is essentially determined by the 
orbital motion of the earth around the sun. 
“Measurement uncertainties limit the realization 
of accurate ephemeris time to about 0.05 sec for 
a nine-year average.” 

Coordinated Universal Time (UTC) is broadcast 
on stations such as WWV. It was adopted interna¬ 
tionally in February 1971 to become effective Jan¬ 
uary 1, 1972. The clock rate is controlled by atomic 
clocks to be as uniform as possible for one year 
(atomic time is measured to —0.1 microsec in 1 
min, with diffusion rates of 0.1 microsec per day 
for ensembles of clocks), but is changed by the 
infrequent addition or deletion of a second—called 
a “leap second”—so that UTC never differs more 
than 0.7 sec from the navigator’s time scale, UT1. 


THE TIMES 
Wednesday 
June 21 1972 



The foregoing account is abstracted from J. A. 
Barnes (1971). The following is extracted from a 
table (not official at time of receipt), kindly sup¬ 
plied by the Time and Frequency Division of the 
U.S. National Bureau of Standards in Boulder, 
Colorado. 

Timekeeping capabilities of some familiar clocks 
are as follows: 

Tuning fork wrist watch (I960), 

1 min/mo. 

Quartz crystal clock (1921-1930), 

1 jusec/day, 

1 sec/yr. 

Quartz crystal wrist watch (1971), 

0.2 sec/2 mos., 

1 sec/yr. 

Cesium beam (atomic resonance, Cs 133 ), (1952- 
1955), 

0.1 jusec/day, 

0.5 jusec/mo. 

Rubidium gas cell (Rb 87 resonance), (1957), 

0.1 jusec/day, 

1-5 jusec/mo. 

Hydrogen maser (1960), 

0.01 jusec/2 hr, 

0.1 jusec/day. 

Methane stabilized laser (1969), 

0.01 jusec/100 sec. 
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Recent measurements [Evenson et al. (1972)] change the details of the foregoing 
1967 argument, but not the principles. 


§1.6. CURVATURE 


Gravitation seems to have disappeared. Everywhere the geometry of spacetime is 
locally Lorentzian. And in Lorentz geometry, particles move in a straight line with 
constant velocity. Where is any gravitational deflection to be seen in that? For 
answer, turn back to the apple (Figure 1.1). Inspect again the geodesic tracks of 
the ants on the surface of the apple. Note the reconvergence of two nearby geodesics 
that originally diverged from a common point. What is the analog in the real world 
of physics? What analogous concept fits Einstein’s injunction that physics is only 
simple when analyzed locally? Don’t look at the distance from the spaceship to the 
Earth. Look at the distance from the spaceship to a nearby spaceship! Or, to avoid 
any possible concern about attraction between the two ships, look at two nearby 
test particles in orbit about the Earth. To avoid distraction by the nonlocal element 
(the Earth) in the situation, conduct the study in the interior of a spaceship, also 
in orbit about the Earth. But this region has already been counted as a local inertial 
frame! What gravitational physics is to be seen there? None. Relative to the spaceship 
and therefore relative to each other, the two test particles move in a straight line 
with uniform velocity, to the precision of measurement that is contemplated (see 
Box 1.5, Test tor Flatness ). Now the key point begins to appear: precision of 
measurement. Increase it until one begins to discern the gradual acceleration of the 
test particles away from each other, if they lie along a common radius through the 
center of the Earth; or toward each other, if their separation lies perpendicular to 
that line. In Newtonian language, the source of these accelerations is the tide-pro¬ 
ducing action of the Earth. To the observer in the spaceship, however, no Earth 
is to be seen. And following Einstein, he knows it is important to analyze motion 
locally. He represents the separation of the new test particle from the fiducial test 
particle by the vector £ fc (k = 1,2, 3; components measured in a local Lorentz frame). 
For the acceleration of this separation, one knows from Newtonian physics what 
he will find: if the Cartesian z-axis is in the radial direction, then 


_Gm S21 
dt 2 c 2 r 3 


d 2 ^ _ Gm, 
dt 2 ~ T 2 ! 


d 2 ^ 

dt 2 


2Gm„ 




(1.5) 


Proof: In Newtonian physics the acceleration of a single particle toward the center 
of the Earth in conventional units of time is Gm coav /r 2 , where G is the Newtonian 
constant of gravitation, 6.670 x 10- 8 cm 3 /g sec 2 and m conv is the mass of the Earth 
in conventional units of grams. In geometric units of time (cm of light-travel time). 


Gravitation is manifest in 
relative acceleration of 
neighboring test particles 
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Relative acceleration is 
caused by curvature 


the acceleration is Gm conv /c 2 r 2 . When the two particles are separated by a distance 
£ perpendicular to r, the one downward acceleration vector is out of line with the 
other by the angle |/r. Consequently one particle accelerates toward the other by 
the stated amount. When the separation is parallel to r, the relative acceleration 
is given by evaluating the Newtonian acceleration at r and at r + |, and taking the 
difference (£ times d/dr) q.e.d. In conclusion, the “local tide-producing acceleration” 
of Newtonian gravitation theory provides the local description of gravitation that 
Einstein bids one to seek. 

What has this tide-producing acceleration to do with curvature? (See Box 1.6.) 
Look again at the apple or, better, at a sphere of radius a (Figure 1.10). The 
separation of nearby geodesics satisfies the “equation of geodesic deviation,” 

dH/ds 2 + R£ = 0. (1.6) 

Here R = 1 /a 2 is the so-called Gaussian curvature of the surface. For the surface 
of the apple, the same equation applies, with the one difference that the curvature 
R varies from place to place. 


Box 1.5 TEST FOR FLATNESS 

L Specify the extension in space L (cm or m) 
and extension in time T (cm or m of light travel 
time) of the region under study. 

2. Specify the precision with which one can 
measure the separation of test particles in this 
region. 

3. Follow the motion of test particles moving 
along initially parallel world lines through this 
region of spacetime. 

4. When the world lines remain parallel to the 
precision for ail directions of travel, then one 
says that “in a region so limited and to a precision 
so specified, spacetime is fiat.” 

example: Region just above the surface of the 
earth, 100 m X 100 m x 100 m (space extension), 
followed for 10 9 m of light-travel time (T conv — 
3 sec). Mass of Earth, w conv = 5.98 x 10 27 g, 
m = (0.742 x 10" 28 cm/g) x (5.98 X 10 27 g) = 
0.444 cm [see eq. (1.12)]. Tide-producing accelera¬ 
tion R z 0z o (relative acceleration in z-direction of 
two test particles initially at rest and separated 
from each other by 1 cm of vertical elevation) is 


( d/dr)(m/r 2 ) = — 2m/r 3 

— —0.888 cm/(6.37 x 10 8 cm) 3 
= —3.44 x 10 -27 cm -2 

(“cm of relative displacement per cm of light- 
travel time per cm of light-travel time per cm of 
vertical separation”). Two test particles with a ver¬ 
tical separation £* = 10 4 cm acquire in the time 
i — 10 u cm (difference between time and proper 
time negligible for such slowly moving test parti¬ 
cles) a relative displacement 

*£* = -\R W 2 ** 

= 1.72 X 10~ 27 cm -2 (10 11 cm) 2 10 4 cm 
= 1.72 mm. 

(Change in relative separation less for other direc¬ 
tions of motion). When the minimum uncertainty 
attainable in a measurement over a 100 m 
spacing is “worse” than this figure (exceeds 1.72 
mm), then to this level of precision the region of 
spacetime under consideration can be treated as 
flat. When the uncertainty in measurement is 
“better” (less) than 1.72 mm, then one must limit 
attention to a smaller region of space or a shorter 
interval of time or both, to find a region of space- 
time that can be regarded as fiat to that precision. 
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Figure 1.10. 

Curvature as manifested in the “acceleration of the separation” of two 
nearby geodesics. Two geodesics, originally parallel, and separated by the 
distance (“geodesic deviation”) £ 0 , are no longer parallel when followed 
a distance s. The separation is £ = £ 0 cos <p = £ 0 cos ( s/a) y where a is 
the radius of the sphere. The separation follows the equation of simple 
harmonic motion, cF^/ds 2 + (1/a 2 ) £ = 0 (“equation of geodesic devia¬ 
tion”). 

The direction of the separation vector, f, is fixed fully by its orthogon¬ 
ality to the fiducial geodesic. Hence, no reference to the direction of f 
is needed or used in the equation of geodesic deviation; only the magni¬ 
tude £ of f appears there, and only the magnitude, not direction, of the 
relative acceleration appears. 


In a space of more than two dimensions, an equation of the same general form 
applies, with several differences. In two dimensions the direction of acceleration of 
one geodesic relative to a nearby, fiducial geodesic is fixed uniquely by the demand 
that their separation vector, f, be perpendicular to the fiducial geodesic (see Figure 
1.10). Not so in three dimensions or higher. There f can remain perpendicular to 
the fiducial geodesic but rotate about it (Figure 1.11). Thus, to specify the relative 
acceleration uniquely, one must give not only its magnitude, but also its direction. 

The relative acceleration in three dimensions and higher, then, is a vector. Call 
it “D 2 f/<ft 2 ,” and call its four components u D 2 £ a /ds 2 ” Why the capital D? Why 
not “d^/ds 2 ”*! Because our coordinate system is completely arbitrary (cf. § 1.2). The 
twisting and turning of the coordinate lines can induce changes from point to point 
in the components of f, even if the vector f is not changing at all. Consequently, 
the accelerations of the components d 2 g*/ds 2 are generally not equal to the compo¬ 
nents D 2 % x /ds 2 of the acceleration! 

How, then, in curved spacetime can one determine the components D 2 g*/ds 2 of 
the relative acceleration? By a more complicated version of the equation of geodesic 

deviation (1.6). Differential geometry (Part III of this book) provides us with a Curvature is characterized by 
geometrical object called the Riemann curvature tensor , “ Riemann ” Riemann is Riemann tensor 

{continued on page 34) 




Figure 1.11. 

The separation vector f between two geodesics in a curved three- 
dimensional manifold. Here f can not only change its length from 
point to point, but also rotate at a varying rate about the fiducial 
geodesic. Consequently, the relative acceleration of the geodesics must 
be characterized by a direction as well as a magnitude; it must be 
a vector, D 2 (/ds 2 . 




Box 1.6 CURVATURE OF WHAT? 


Nothing seems more attractive at first glance than 
the idea that gravitation is a manifestation of the 
curvature of space (A), and nothing more ridicu¬ 
lous at a second glance (B). How can the tracks 
of a bail and of a bullet be curved so differently 
if that curvature arises from the geometry of 
space? No wonder that great Riemann did not give 
the world a geometric theory of gravity. Yes, at 
the age of 28 (June 10, 1854) he gave the world 
the mathematical machinery to define and calcu¬ 
late curvature (metric and Riemannian geometry). 
Yes, he spent his dying days at 40 working to find 
a unified account of electricity and gravitation. But 
if there was one reason more than any other why 
he failed to make the decisive connection between 
gravitation and curvature, it was this, that he 
thought of space and the curvature of space, not 


of spacetime and the curvature of spacetime. To 
make that forward step took the forty years to 
special relativity (1905: time on the same footing 
as space) and then another ten years (1915: gen¬ 
eral relativity). Depicted in spacetime (C), the 
tracks of ball and bullet appear to have compara¬ 
ble curvature. In fact, however, neither track has 
any curvature at all. They both look curved in (C) 
only because one has forgotten that the spacetime 
they reside in is itself curved—curved precisely 
enough to make these tracks the straightest lines 
in existence (“geodesics”). 

If it is at first satisfying to see curvature, and 
curvature of spacetime at that, coming to the fore 
in so direct a way, then a little more reflection 
produces a renewed sense of concern. Curvature 
with respect to what? Not with respect to the labo- 





is curved (actual travel takes place in spacetime rather 
than space; correct deflection is twice that given by 
above elementary picture). Deflection inversely propor¬ 
tional to angular separation between star and center of 
sun. See Box 40.1 for actual deflections observed at time 
of an eclipse. 


[32] 











ratory. The earth-bound laboratory has no simple 
status whatsoever in a proper discussion. First, it 
is no Lorentz frame. Second, even to mention the 
earth makes one think of an action-at-a-distance 
version of gravity (distance from center of earth 
to ball or bullet). In contrast, it was the whole 
point of Einstein that physics looks simple only 
when analyzed locally. To look at local physics, 
however, means to compare one geodesic of one 
test particle with geodesics of other test particles 
traveling (1) nearby with (2) nearly the same di¬ 
rections and (3) nearly the same speeds. Then one 
can “look at the separations between these nearby 
test particles and from the second time-rate of 
change of these separations and the ‘equation of 
geodesic deviation’ (equation 1.8) read out the 
curvature of spacetime.” 



C. Tracks of ball and bullet through spacetime, as re¬ 
corded in laboratory, have comparable curvatures. 
Track compared to arc of circle: (radius) = (horizontal 
distance) 2 /8 (rise). 



B. Tracks of ball and bullet through space as seen in 
laboratory have very different curvatures. 
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the higher-dimensional analog of the Gaussian curvature R of our apple’s surface. 
Riemann is the mathematical embodiment of the bends and warps in spacetime. 
And Riemann is the agent by which those bends and warps (curvature of spacetime) 
produce the relative acceleration of geodesics. 

Riemann , like the metric tensor g of Box 1.3, can be thought of as a family of 
machines, one machine residing at each event in spacetime. Each machine has three 
slots for the insertion of three vectors: 

slot 1 slot 2 slot 3 

\ \ \ 

Riemann ( , , ). 

Choose a fiducial geodesic (free-particle world line) passing through an event i?, 
and denote its unit tangent vector (particle 4-velocity) there by 

u = dx/dr; components, if* = dx a /dr. (1.7) 

Choose another, neighboring geodesic, and denote by f its perpendicular separation 
from the fiducial geodesic. Then insert u into the first slot of Riemann at 2, f into 
the second slot, and u into the third. Riemann will grind for awhile; then out will 
pop a new vector, 


Riemann (i/, f, u). 


Riemann tensor, through 
equation of geodesic 
deviation, produces relative 
accelerations 


The equation of geodesic deviation states that this new vector is the negative of 
the relative acceleration of the two geodesics: 

D 2 £/dr 2 + Riemann (i/, f, u) — 0. (1.8) 


The Riemann tensor, like the metric tensor (Box 1.3), and like all other tensors, 
is a linear machine. The vector it puts out is a linear function of each vector inserted 
into a slot: 


Riemann (2i/, aw + bv , 3 r) 

= 2 X a x 3 Riemann (i/, iv, r) + 2 x b X 3 Riemann ( u , v , r). (1.9) 

Consequently, in any coordinate system the components of the vector put out can 
be written as a “trilinear function” of the components of the vectors put in: 

r — Riemann (i /, v , w) r a = R a 0yS u 13 v y w 6 . (1.10) 

(Here there is an implied summation on the indices /?, y, 5; cf. Box 1.1.) The 
4 x 4 x 4 x 4 = 256 numbers R a p y s are called the “components of the Riemann 
tensor in the given coordinate system.” In terms of components, the equation of 
geodesic deviation states 


D 2 t« 
dr 2 


+ Ra 0y 8 


dx& dx h 

~3r ~3T 


= 0 . 


(1.80 



§1.6. CURVATURE 


35 


In Einstein s geometric theory of gravity, this equation of geodesic deviation Equation of geodesic 
summarizes the entire effect of geometry on matter. It does for gravitation physics deviation is analog of Lorentz 
what the Lorentz force equation, force law 


does for electromagnetism. See Box 1.7. 

The units of measurement of the curvature are cm -2 just as well in spacetime 
as on the surface of the apple. Nothing does so much to make these units stand 
out clearly as to express mass in “geometrized units”: 

m(cm) = (G/c 2 )m CODY (g) 

= (°- 742 X !0- 28 cm/g)w C0nv (g). (1.12) 


Geometrized units 


Box 1.7 EQUATION OF MOTION UNDER THE INFLUENCE OF A GRAVITATIONAL FIELD 
AND AN ELECTROMAGNETIC FIELD, COMPARED AND CONTRASTED 


Acceleration is defined for 
one particle? 

Acceleration defined how? 


Acceleration depends on all 
four components of the 
4-velocity of the particle? 

Universal acceleration for all 
test particles in same 
locations with same 
4-velocity? 

Driving field 

Ostensible number of distinct 
components of driving 
field 

Actual number when allowance 
is made for symmetries of 
tensor 

Names for more familiar of 
these components 


Electromagnetism 
[Lorentz force, equation {1.11)] 


Actual world line compared to 
w'orld line of uncharged 
“fiducial” test particle 
passing through same point 
with same 4-velocity. 


No: is proportional to e/m 
Electromagnetic field 


4x4= 16 


3 electric 
3 magnetic 


Gravitation [Equation of 
geodesic deviation {1.8')] 


Already an uncharged test 
particle, which can’t 
accelerate relative to 
itself! Acceleration 
measured relative to a 
nearby test particle as 
fiduciary standard. 


Riemann curvature tensor 


4 4 = 256 


6 components of local 
Newtonian tide-producing 
acceleration 
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This conversion from grams to centimeters by means of the ratio 

G/c 2 = 0.742 x 10~ 28 cm/g 

is completely analogous to converting from seconds to centimeters by means of the 
ratio 


9.4605460000 X 10 17 cm 
31,556,925.974 sec 


Components of Riemann 
tensor evaluated from relative 
accelerations of slowly 
moving particles 


(see end of §1.5). The sun, which in conventional units has m conv = 1.989 x 10 33 g, 
has in geometrized units a mass m = 1.477 km. Box 1.8 gives further discussion. 

Using geometrized units, and using the Newtonian theory of gravity, one can 
readily evaluate nine of the most interesting components of the Riemann curvature 
tensor near the Earth or the sun. The method is the gravitational analog of deter¬ 
mining the electric field strength by measuring the acceleration of a slowly moving 
test particle. Consider the separation between the geodesics of two nearby and slowly 
moving (u <c) particles at a distance r from the Earth or sun. In the standard, nearly 
inertial coordinates of celestial mechanics, all components of the 4-velocity of the 


— 

Box 1.8 GEOMETRIZED UNITS 

Throughout this book, we use “geometrized units,” 
in which the speed of light c, Newton’s gravita¬ 
tional constant G , and Boltzman’s constant k are 
all equal to unity. The following alternative ways 
to express the number 1.0 are of great value: 

1.0 — c — 2.997930 ■ • • X 10 10 cm/sec 
1.0 = G/c 2 = 0.7425 X 10” 28 cm/g; 

1.0 = G/c 4 = 0.826 X 10 -49 cm/erg; 

1.0 — Gk/c 4 = 1.140 X 10' 65 cm/K; 

1.0 = c 2 /G in = 3.48 x 10 24 cm/gauss" 1 . 

One can multiply a factor of unity, expressed in 
any one of these ways, into any term in any equa¬ 
tion without affecting the validity of the equation. 
Thereby one can convert one’s units of measure 


\ 

from grams to centimeters to seconds to ergs to 
. ... For example: 

Mass of sun = M e = 1.989 x 10 33 g 

= (1.989 x 10 33 g) x (G/c 2 ) 

= 1.477 x 10 5 cm 

= (1.9 89 x 10 33 g) x (c 2 ) 

= 1.788 X 10 54 ergs. 

The standard unit, in terms of which everything 
is measured in this book, is centimeters. However, 
occasionally conventional units are used; in such 
cases a subscript “conv” is sometimes, but not 
always, appended to the quantity measured: 

* 4 ,, = 1-9S9 x 10’= g. 


V 


J 
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fiducial test particle can be neglected except dx°/dr = 1. The space components of 
the equation of geodesic deviation read 


dH k /di* + R k 0j0 & = 0. (1.13) 

Comparing with the conclusions of Newtonian theory, equations (1.5), we arrive at 
the following information about the curvature of spacetime near a center of mass: 


Rx 0x6 

Woio 

^ 0x0 

= 

m/r 3 

0 

0 

RX 6y0 

Rv m 

^ OyO 

= 

0 

m/r 3 

0 


R\o 

^ 0x0 

= 

0 

0 

-2m/r 3 


(units cm" 2 ). Here and henceforth the caret or “hat” is used to indicate the compo¬ 
nents of a vector or tensor in a local Lorentz frame of reference (“physical compo¬ 
nents,” as distinguished from components in a general coordinate system). Einstein’s 
theory will determine the values of the other components of curvature (e.g., 
Rx ziz = -wA* 3 ); but these nine terms are the ones of principal relevance for 
many applications of gravitation theory. They are analogous to the components 
of the electric field in the Lorentz equation of motion. Many of the terms not 
evaluated are analogous to magnetic field components—ordinarily weak unless the 
source is in rapid motion. 

This ends the survey of the effect of geometry on matter (“effect of curvature 
of apple in causing geodesics to cross”—especially great near the dimple at the top, 
just as the curvature of spacetime is especially large near a center of gravitational 
attraction). Now for the effect of matter on geometry (“effect of stem of apple in 
causing dimple”)! 
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The weight of any heavy body of known weight at a particular 
distance from the center of the world varies according to the 
variation of its distance therefrom; so that as often as it is 
removed from the center, it becomes heavier, and when brought 
near to it, is lighter. On this account, the relation of gravity to 
gravity is as the relation of distance to distance from the center. 

AL KHAZINT (Merv , A.D. 1115), Book of the Balance of Wisdom 


Figure 1.12 shows a sphere of the same density, p — 5.52 g/cm 3 , as the average 
density of the Earth. A hole is bored through this sphere. Two test particles, A and 
B , execute simple harmonic motion in this hole, with an 84-minute period. Therefore 
their geodesic separation however it may be oriented, undergoes a simple periodic 
motion with the same 84-minute period: 

d'^’/d - 2 = - (~ pj £>, j = * or y or z. 


( 1 . 15 ) 
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Box 1.9 GALILEO GALILEI 

Pisa, February 15, 1564—Arcetri, Florence, January 8, 1642 



Uffizi Gallery, Florence 


"In questions of science the authority 
of a thousand is not worth the humble 
reasoning of a single individual. " 

GALILEO GALILEI (1632) 

"The spaces described by a body falling from rest 
with a uniformly accelerated motion are to each other 
as the squares of the time intervals employed in 
traversing these distances." 

GALILEO GALILEI (1638) 


"Everything that has been said before and imagined by other people [about the 
tides ] is in my opinion completely invalid. But among the great men who have 
philosophised about this marvellous effect of nature the one who surprised me the 
most is Kepler. More than other people he was a person of independent genius, 
sharp, and had in his hands the motion of the earth. He later pricked up his ears 
and became interested in the action of the moon on the water, and in other occult 

phenomena, and simitar childishness. " 

GALILEO GALILEI (1632) 

"It is a most beautiful and delightful sight to behold [with the new telescope] the 
body of the Moon . . . the Moon certainly does not possess a smooth and polished 
surface, but one rough and uneven . . . full of vast protuberances, deep chasms 
and sinuosities . . . stars in myriads, which have never been seen before and 
which surpass the old, previously known, stars in number more than ten times. / 
have discovered four planets, neither known nor observed by any one of the 
astronomers before my time . . . got rid of disputes about the Galaxy or Milky 
Way, and made its nature dear to the very senses, not to say to the 
understanding . . . the galaxy is nothing else than a mass of luminous stars 
planted together in clusters . . . the number of small ones is quite beyond 
determination—the stars which have been called by every one of the astronomers 
up to this day nebulous are groups of small stars set thick together in a wonderful 

way ." 

GALILEO GALILEI IN SIDEREUS NUNC/US (1610) 


"So the principles which are set forth in this treatise will, when taken up by 
thoughtful minds, lead to many another more remarkable result; and it is to be 
believed that it will be so on account of the nobility of the subject, which is 

superior to any other in nature." 

GALILEO GALILEI {1638) 
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5.52 g/cm 3 


Figure 1.12. 

Test particles A and B move up and down a hole bored through 
the Earth, idealized as of uniform density. At radius r, a parti¬ 
cle feels Newtonian acceleration 

— = _L d~r 

** c2 d UoJ 

_ G (mass inside radius r) 



Consequently, each particle oscillates in simple harmonic mo¬ 
tion with precisely the same angular frequency as a satellite, 
grazing the model Earth, traverses its circular orbit: 


co-(cm“ 2 ) = ~~ p(cm -2 ). 
w2 conv(sec’ 2 ) - Pconv (g/cm 3 ). 


Comparing this actual motion with the equation of geodesic deviation (1.13) for 
slowly moving particles in a nearly inertial frame, we can read off some of the 
curvature components for the interior of this model Earth. 


A 0*0 

R y ~.. 

A 0*0 

A 0*0 


1 

0 

0 


R 

A OyO 

R v* -« 

A OyO 

Rz.. . 
A Oj/O 

= (477p/3) 

0 

1 

0 


Rz, „ 

A 0*0 

Ry. 

A 0*0 

^0*6 


0 

0 

1 



This example illustrates how the curvature of spacetime is connected to the distribu¬ 
tion of matter. 

Let a gravitational wave from a supernova pass through the Earth. Idealize the 
Earth s matter as so nearly incompressible that its density remains practically un¬ 
changed. The wave is characterized by ripples in the curvature of spacetime, propa¬ 
gating with the speed of light. The ripples will show up in the components R\ m 
of the Riemann tensor, and in the relative acceleration of our two test particles. 
The left side of equation (1.16) will ripple: but the right side will not. Equation 
(1.16) will break down. No longer will the Riemann curvature be generated directly 
and solely by the Earth’s matter. 

Nevertheless. Einstein tells us. a part of equation (1.16) is undisturbed by the 


The Riemann tensor inside 
the Earth 


Effect of gravitational wave 
on Riemann tensor 
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waves: its trace 


Rob = R x bio + R v bib + R ‘m = P■ (1.17) 

Even in the vacuum outside the Earth this is valid; there both sides vanish [cf. (1.14)]. 
More generally, a certain piece of the Riemann tensor, called the Einstein tensor 
Einstein tensor introduced and denoted Einstein or 6 , is always generated directly by the local distribution 

of matter. Einstein is the geometric object that generalizes R gg, the lefthand side 


Box 1.10 ISAAC IMEWTOIM 

Woolsthorpe, Lincolnshire, England, December 25, 1642— 
Kensington, London, March 20, 1726 



"The description of right lines and circles, upon which geometry 
is founded, belongs to mechanics. Geometry does not teach 
us to draw these lines, but requires them to be drawn ." 

[FROM P. 1 OF NEWTON'S PREFACE TO 
THE FIRST (1687) EDITION OF THE PRINCIPIA ] 

"Absolute space, in its own nature, 
without relation to anything external, remains 
always simitar and immovable 
"Absolute, true, and mathematical time, 
of itself, and from its own nature, flows 
equably without relation to anything external." 

[FROM THE SCHOLIUM IN THE PRINCIPIA] 


/ have not been able to discover the cause of those properties of gravity from 
phenomena, and / frame no hypotheses; for whatever is not reduced from the 
phenomena is to be called an hypothesis; and hypotheses . . . have no place in 
experimental philosophy. . . . And to us it is enough that gravity does really exist, 
and act according to the laws which we have explained, and abundantly serves to 
account for all the motions of the celestial bodies, and of our sea. " 

[FROM THE GENERAL SCHOLIUM ADDED AT THE END OF THE THIRD BOOK OF THE PRINCIPIA IN 
THE SECOND EDITION OF 1713; ESPECIALLY FAMOUS FOR THE PHRASE OFTEN QUOTED FROM 

NEWTON'S ORIGINAL LATIN, " HYPOTHESES NON F!NGO.“] 

And the same year [ 1665 or 1666] / began to think of gravity extending to the 
orb of the Moon, and having found out. . . . AH this was in the two plague years 
of 1665 and 1666, for in those days / was in the prime of my age for invention, 
and minded Mathematicks and Philosophy more than at any time since ." 

[FROM MEMORANDUM IN NEWTON’S HANDWRITING ABOUT HIS DISCOVERIES ON FLUXIONS, THE 
BINOMIAL THEOREM, OPTICS, DYNAMICS, AND GRAVITY, BELIEVED TO HAVE BEEN WRITTEN 
ABOUT 1714, AND FOUND BY ADAMS ABOUT 1887 IN THE "PORTSMOUTH COLLECTION" OF 

NEWTON PAPERS] 
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of equation (1.17). Like R 55 , Einstein is a sort of average of Riemann over all 

directions. Generating Einstein and generalizing the righthand side of (1.16) is a 

geometric object called the stress-energy tensor of the matter. It is denoted T. No Stress-energy tensor 

coordinates are need to define Einstein , and none to define T; like the Riemann introduced 

tensor, Riemann , and the metric tensor, g, they exist in the complete absence of 

coordinates. Moreover, in nature they are always equal, aside from a factor of 877 : 

Einstein = G — 87 tT. (1.18) 




"For hypotheses ought ... to explain the properties of things and not attempt to 
predetermine them except in so far as they can be an aid to experiments. " 

[FROM LETTER OF NEWTON TO I. M. PARDIES, 1672, AS QUOTED IN THE CAJORI NOTES AT THE 

END OF NEWTON (1687), P. 673] 

"That one body may act upon another at a distance through a vacuum, without 
the mediation of any thing else, by and through which their action and force may 
be conveyed from one to another, is to me so great an absurdity, that / believe no 
man, who has in philosophical matters a competent faculty of thinking, can ever 

fall into it." 

[PASSAGE OFTEN QUOTED BY MICHAEL FARADAY FROM LETTERS OF NEWTON TO RICHARD 
BENTLY, 1692-1693, AS QUOTED IN THE NOTES OF THE CAJORI EDITION OF NEWTON {1687). P. 

643] 

"The attractions of gravity, magnetism, and electricity, reach to very sensible 
distances, and so have been observed. . . ; and there may be others which reach 
to so small distances as hitherto escape observation ; . . . some force, which in 
immediate contract is exceeding strong, at small distances performs the chemical 
operations above-mentioned, and reaches not far from the particles with any 

sensible effect ." 

[FROM QUERY 31 AT THE END OF NEWTON'S OPTICKS (1730)] 

"What is there in places almost empty of matter, and whence is it that the sun 
and planets gravitate towards one another, without dense matter between them? 
Whence is it that nature doth nothing in vain; and whence arises all that order and 
beauty which we see in the world? To what end are comets, and whence is it that 
planets move all one and the same way in orbs concentrick, while comets move all 
manner of ways in orbs very excentrick; and what hinders the fixed stars from 

falling upon one another?" 

[FROM QUERY 28] 


"He is not eternity or infinity, but eternal and infinite; He is not duration or space, 
but He endures and is present. He endures forever, and is everywhere present; and 
by existing always and everywhere. He constitutes duration and space. . . . And 
thus much concerning God; to discourse of whom from the appearances of things, 

does certainly belong to natural philosophy. " 

[FROM THE GENERAL SCHOLIUM AT THE END OF THE PRINCIPIA (1687)] 
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1. geometrodynamics in brief 


Einstein field equation: how 
matter generates curvature 


Consequences of Einstein 
field equation 


This Einstein field equation, rewritten in terms of components in an arbitrary coordi¬ 
nate system, reads 

G ap = 8irT ap . ( 1 . 19 ) 

The Einstein field equation is elegant and rich. No equation of physics can be 
written more simply. And none contains such a treasure of applications and conse¬ 
quences. 

The field equation shows how the stress-energy of matter generates an average 
curvature ( Einstein = G) in its neighborhood. Simultaneously, the field equation 
is a propagation equation for the remaining, anisotropic part of the curvature: it 
governs the external spacetime curvature of a static source (Earth); it governs the 
generation of gravitational waves (ripples in curvature of spacetime) by stress-energy 
in motion; and it governs the propagation of those waves through the universe. The 
field equation even contains within itself the equations of motion (“Force = 
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mass X acceleration”) for the matter whose stress-energy generates the curvature. 

Those were some consequences of G = 877 T. Now for some applications. 

The field equation governs the motion of the planets in the solar system; it governs 
the deflection of light by the sun; it governs the collapse of a star to form a black 
hole; it determines uniquely the external spacetime geometry of a black hole (“a 
black hole has no hair”); it governs the evolution of spacetime singularities at the 
end point of collapse; it governs the expansion and recontraction of the universe. 
And more; much more. 

In order to understand how the simple equation G = 877 7" can be so all powerful, 
it is desirable to backtrack, and spend a few chapters rebuilding the entire picture 
of spacetime, of its curvature, and of its laws, this time with greater care, detail, 
and mathematics. 

Thus ends this survey of the effect of geometry on matter, and the reaction of 
matter back on geometry, rounding out the parable of the apple. 



Applications of Einstein field 
equation 


A 


"What really interests me is whether God had any choice in the creation of the 

world " 

EINSTEIN TO AN ASSISTANT, AS QUOTED BY G. HOLTON (1971), P 20 

"But the years of anxious searching in the dark , with their intense longing, their 
alternations of confidence and exhaustion, and the final emergence into the 
light—only those who have experienced it can understand that" 

EINSTEIN, AS QUOTED BY ATI. KLEIN (1971), P. 1315 

Of all the communities available to us there is not one I would want to devote 
myself to, except for the society of the true searchers, which has very few living 

members at any time. . ." 
EINSTEIN LETTER TO BORN. QUOTED BY BORN (1971), P. 82 

"I am studying your great works and—when i get stuck anywhere—now have the 
pleasure of seeing your friendly young face before me smiling and explaining " 

EINSTEIN, LETTER OF MAY 2, 1920, AFTER MEETING NIELS BOHR 

As far as the laws of mathematics refer to reality, they are not certain; and as far 

as they are certain , they do not refer to reality. " 

EINSTEIN (1921), P. 28 


The most incomprehensible thing about the world is that it is comprehensible. " 

EINSTEIN, IN SCHILPP (1949), P 112 


J 
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EXERCISES 


Exercise 1.1. CURVATURE OF A CYLINDER 

Show that the Gaussian curvature R of the surface of a cylinder is zero by showing that 
geodesics on that surface (unroll!) suffer no geodesic deviation. Give an independent argu¬ 
ment for the same conclusion by employing the formula R — \/p t p 2 ^ where p 1 and p 2 are 
the principal radii of curvature at the point in question with respect to the enveloping 
Euclidean three-dimensional space. 

Exercise 1.2. SPRING TIDE VS. NEAP TIDE 

Evaluate (1) in conventional units and (2) in geometrized units the magnitude of the Newton¬ 
ian tide-producing acceleration R m 0n0 (m,n= 1,2,3) generated at the Earth by (1) the 
moon (w conv = 7.35 x 10 25 g, r — 3.84 x 10 10 cm) and (2) the sun (w conv = 1.9 89 x 10 33 g, 
r — 1.496 x 10 13 cm). By what factor do you expect spring tides to exceed neap tides? 

Exercise 1.3. KEPLER ENCAPSULATED 

A small satellite has a circular frequency w(cm _1 ) in an orbit of radius r about a central 
object of mass w(cm). From the known value of to, show that it is possible to determine 
neither r nor m individually, but only the effective “Kepler density” of the object as averaged 
over a sphere of the same radius as the orbit. Give the formula for to 2 in terms of this Kepler 
density. 

It is a reminder of the continuity of history that Kepler and Galileo (Box 1.9) wrote back 
and forth, and that the year that witnessed the death of Galileo saw the birth of Newton 
(Box 1.10). After Newton the first dramatically new synthesis of the laws of gravitation came 
from Einstein (Box 1.11). 


And what the dead had no speech for , when living. 
They can tell you, being dead; the communication 
Of the dead is tongued with fire beyond 

the language of the living . 

T. S. ELIOT, in UTTLE GfDOING (1942) 


/ measured the skies 
Now the shadows / measure 
Skybound was the mind 
Earthbound the body rests 

JOHANNES KEPLER, d. November 15, 1630. 

He wrote his epitaph in Latin; 
it is translated by Coleman (1967), p. 109. 


Ubi materia, ibi geometria. 

JOHANNES KEPLER 




PART 


PHYSICS IN FLAT 
SPACETIME 


Wherein the reader meets an old friend , Special Relativity , 
outfitted in new , /notf aff/re, anc/ becomes more 
intimately acquainted with her charms. 



CHAPTER 


FOUNDATIONS OF 
SPECIAL RELATIVITY 


In geometric and physical applications, it always turns out that a 
quantity is characterized not only by its tensor order, 

but also by symmetry. 

HERMAN WEYL (1925) 

Undoubtedly the most striking development of geometry during 
the last 2,000 years is the continual expansion of the concept 
geometric object. This concept began by comprising only the 
few curves and surfaces of Greek synthetic geometry; it was 
stretched, during the Renaissance, to cover the whole domain of 
those objects defined by analytic geometry; more recently, it has 
been extended to cover the boundless universe treated by 

point-set theory. 
KARL MENGER, IN SCHILPP (1949), P 466. 


§2.1. OVERVIEW 

Curvature in geometry manifests itself as gravitation. Gravitation works on the 
separation of nearby particle world lines. In turn, particles and other sources of 
mass-energy cause curvature in the geometry. How does one break into this closed 
loop of the action of geometry on matter and the reaction of matter on geometry? 

One can begin no better than by analyzing the motion of particles and the dynamics 
of fields in a region of spacetime so limited that it can be regarded as flat. (See 
“Test for Flatness,” Box 1.5). 

Chapters 2-6 develop this flat-spacetime viewpoint (special relativity). The reader, Background assumed of 
it is assumed, is already somewhat familiar with special relativity:* 4-vectors in reader 
general; the energy-momentum 4-vector; elementary Lorentz transformations; the 
Lorentz law for the force on a charged particle; at least one look at one equation 

/f'o - 

* For example, see Goldstein (1959). Leighton (1959), Jackson (1962). or. for the physical perspective O 

presented geometrically. Taylor and Wheeler (1966). 
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Every physical quantity can 
be described by a geometric 
object 


All laws of physics can be 
expressed geometrically 


in one book that refers to the electromagnetic field tensor F^ v \ and the qualitative 
features of spacetime diagrams, including such points as (1) future and past light 
cones, (2) causal relationships (“past of,” “future of,” “neutral,” or “in a spacelike 
relationship to”), (3) Lorentz contraction, (4) time dilation, (5) absence of a universal 
concept of simultaneity, and (6) the fact that the T and z axes in Box 2.4 are 
orthogonal even though they do not look so. If the reader finds anything new in 
these chapters, it will be: (i) a new viewpoint on special relativity, one emphasizing 
coordinate-free concepts and notation that generalize readily to curved spacetime 
(“geometric objects,” tensors viewed as machines—treated in Chapters 2-4); or (ii) 
unfamiliar topics in special relativity, topics crucial to the later exposition of gravita¬ 
tion theory (“stress-energy tensor and conservation laws,” Chapter 5; “accelerated 
observers,” Chapter 6). 


§2.2. GEOMETRIC OBJECTS 

Everything that goes on in spacetime has its geometric description, and almost every 
one of these descriptions lends itself to ready generalization from flat spacetime to 
curved spacetime. The greatest of the differences between one geometric object and 
another is its scope: the individual object (vector) for the momentum of a certain 
particle at a certain phase in its history, as contrasted to the extended geometric 
object that describes an electromagnetic field defined throughout space and time 
(“antisymmetric second-rank tensor field” or, more briefly, “field of 2-forms”). The 
idea that every physical quantity must be describable by a geometric object, and 
that the laws of physics must all be expressible as geometric relationships between 
these geometric objects, had its intellectual beginnings in the Erlanger program of 
Felix Klein (1872), came closer to physics in Einstein’s “principle of general covari¬ 
ance” and in the writings of Hermann Weyl (1925), seems to have first been formu¬ 
lated clearly by Veblen and Whitehead (1932), and today pervades relativity theory, 
both special and general. 

A. Nijenhuis (1952) and S.-S. Chern (1960,1966,1971) have expounded the mathe¬ 
matical theory of geometric objects. But to understand or do research in geometro- 
dynamics, one need not master this elegant and beautiful subject. One need only 
know that geometric objects in spacetime are entities that exist independently of 
coordinate systems or reference frames. A point in spacetime (“event”) is a geometric 
object. The arrow linking two neighboring events (“vector”) is a geometric object 
in flat spacetime, and its generalization, the “tangent vector” is a geometric object 
even when spacetime is curved. The “ metric” (machine for producing the squared 
length of any vector; see Box 1.3) is a geometric object. No coordinates are needed 
to define any of these concepts. 

The next few sections will introduce several geometric objects, and show the roles 
they play as representatives of physical quantities in flat spacetime. 
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Figure 2.1. 

From vector as connector of two points to vector as derivative 
(“tangent vector”; a local rather than a bilocal concept). 


§2.3. VECTORS 

Begin with the simplest idea of a vector (Figure 2.1 B): an arrow extending from 
one spacetime event a (“tail”) to another event $ (“tip”). Write this vector as 

v a9 = $ - (2 (or d<$). 

For many purposes (including later generalization to curved spacetime) other com¬ 
pletely equivalent ways to think of this vector are more convenient. Represent the 
arrow by the parametrized straight line 9{\) = a + A($ - rf), with A = 0 the tail 
of the arrow, and X = 1 its tip. Form the derivative of this simple linear expression 
for 9(k): 

( d/d\)[a + - a)] = ® - a = l) - 9(0) = (tip) - (tail) = v m . 

This result allows one to replace the idea of a vector as a 2-point object (“bilocal”) 
by the concept of a vector as a 1-point object (“tangent vector”; local): 

v a*> = (d9/d\\^. ( 2 . 1 ) 

Example: if 9(t) is the straight world line of a free particle, parametrized by its 
proper time, then the displacement that occurs in a proper time interval of one second 
gives an arrow u — 9(\ ) — 9(0). This arrow is easily drawn on a spacetime diagram. 
It accurately shows the 4-velocity of the particle. However, the derivative formula 
u = d9/dr for computing the same displacement (1) is more suggestive of the 
velocity concept and (2) lends itself to the case of accelerated motion. Thus, given 
a world line S?(t) that is not straight, as in Figure 2.2, one must first form dy’/dr, 
and only thereafter draw the straight line S?(0 ) + X(dd’/dT) 0 of the arrow u - d^/di 
to display the 4-velocity u. 


Ways of defining vector: 
As arrow 


As parametrized straightMine 


As derivative of point along 
curve 
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Figure 2.2. 

Same tangent vector derived from two very different curves. That parame¬ 
trized straight line is also drawn which best fits the two curves at t? 0 . The 
tangent vector reaches from 0 to 1 on this straight line. 


Components of a vector 


The reader may be unfamiliar with this viewpoint. More familiar may be the 
components of the 4-velocity in a specific Lorentz reference frame; 


u 


o 


A 

dr 



u’ 


dx j 

dr 



(2.2) 


where 

v j = dx j /dt — components of “ordinary velocity,” 
v 2 = ( v x ) 2 -f (u y ) 2 -f ( v z ) 2 . 


Even the components (2.2) of 4-velocity may seem slightly unfamiliar if the reader 
is acc ustom ed to having the fourth component of a vector be multiplied by a factor 
i = \/—1 • If so, he must adjust himself to new notation. (See “Farewell to ‘ict 9 9 99 
Box 2.1.) 

More fundamental than the components of a vector is the vector itself. It is a 
geometric object with a meaning independent of all coordinates. Thus a particle 
has a world line ^(t), and a 4-velocity u = dty/dr, that have nothing to do with 
any coordinates. Coordinates enter the picture when analysis on a computer is 
required (rejects vectors; accepts numbers). For this purpose one adopts a Lorentz 
Basis vectors frame with orthonormal basis vectors (Figure 2.3) e 0 , e v e 2 , and e 3 . Relative to 

the origin 6 of this frame, the world line has a coordinate description 

<3>(j) - 6 = x°(t )e 0 + x\j)e x + x 2 (r)e 2 + x 3 (r)e 3 = x^e^. 

Expressed relative to the same Lorentz frame, the 4-velocity of the particle is 

u = d^P/dr ~ (dx*/di)e^ = w°e 0 -f w 1 e 1 -f u 2 e 2 -f u z e z . (2.3) 
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| Bo* 2.1FAREWELL TO “ict 


"\ 


One sometime participant in special relativity will 
have to be put to the sword: “x 4 = let” This 
imaginary coordinate was invented to make the 
geometry of spacetime look formally as little 
different as possible from the geometry of Eu¬ 
clidean space; to make a Lorentz transformation 
look on paper like a rotation; and to spare one 
the distinction that one otherwise is forced to make 
between quantities with upper indices (such as the 
components p* of the energy-momentum vector) 
and quantities with lower indices (such as the 
components of the energy-momentum 1-form). 
However, it is no kindness to be spared this latter 
distinction. Without it, one cannot know whether 
a vector (§2.3) is meant or the very different geo¬ 
metric object that is a 1-form (§2.5). Moreover, 
there is a significant difference between an angle 
on which everything depends periodically (a rota¬ 
tion) and a parameter the increase of which gives 
rise to ever-growing momentum differences (the 
“velocity parameter” of a Lorentz transformation; 
Box 2.4). If the imaginary time-coordinate hides 
from view the character of the geometric object 
being dealt with and the nature of the parameter 
in a transformation, it also does something even 
more serious: it hides the completely different 
metric structure (§2.4) of + + + geometry and 
—I- 4. 4- geometry. In Euclidean geometry, when 
the distance between two points is zero, the two 

___ 


points must be the same point. In Lorentz-Min- 
kowski geometry, when the interval between two 
events is zero, one event may be on Earth and the 
other on a supernova in the galaxy M31, but their 
separation must be a null ray (piece of a light 
cone). The backward-pointing light cone at a given 
event contains all the events by which that event 
can be influenced. The forward-pointing light cone 
contains all events that it can influence. The multi¬ 
tude of double light cones taking off from all the 
events of spacetime forms an interlocking causal 
structure. This structure makes the machinery of 
the physical world function as it does (further 
comments on this structure in Wheeler and Feyn¬ 
man 1945 and 1949 and in Zeeman 1964). If in 
a region where spacetime is flat, one can hide this 
structure from view by writing 

{As) 2 = (Ax 1 ) 2 + {Ax 2 ) 2 + {Ax 3 ) 2 + {Ax 4 ) 2 , 

with x 4 = ict , no one has discovered a way to 
make an imaginary coordinate work in the general 
curved spacetime manifold. If “x 4 = ict” cannot 
be used there, it will not be used here. In this 
chapter and hereafter, as throughout the literature 
of general relativity, a real time coordinate is used, 
x° = t = ct CQnv (superscript 0 rather than 4 to 
avoid any possibility of confusion with the imagi¬ 
nary time coordinate). 


The components w a of any other vector w in this frame are similarly defined as Expansion of vector in terms 
the coefficients in such an expansion, of basis 


w = w a e a . (2.4) 

Notice: the subscript a on e a tells which vector, not which component! 


§2.4. THE METRIC TENSOR 

The metric tensor, one recalls from part IV of Box 1.3, is a machine for calculating 
the squared length of a single vector, or the scalar product of two different vectors. 
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Metric defined as machine 
for computing scalar 
products of vectors 


/ 



Figure 2.3. 

The 4-velocity of a particle in flat spacetime. The 4-velocity u is the unit vector 
(arrow) tangent to the particle’s world line—one tangent vector for each event on 
the world line. In a specific Lorentz coordinate system, there are basis vectors of 
unit length, which point along the four coordinate axes: e 0 , e v e 2 , e 3 . The 4-velocity, 
like any vector, can be expressed as a sum of components along the basis vectors: 

u = u°e 0 4- u l e 1 + u 2 e 2 4- u 3 e 3 = u a e a . 


More precisely, the metric tensor g is a machine with two slots for inserting vectors 

slot 1 slot 2 

g( , )• ( 2 . 5 ) 

Upon insertion, the machine spews out a real number: 

g(u , v) = “scalar product of u and v” also denoted u • v . (2.6) 

g(u , u) = “squared length of i/,” also denoted i/ 2 . 

Moreover, this number is independent of the order in which the vectors are inserted 
(“symmetry of metric tensor”), 

g(u, v) = g(v, U); ( 2 . 7 ) 

and it is linear in the vectors inserted 

g(au + bv f w) ~ g(w , au + bv) — ag(u , iv) + bg(v , iv). (2.8) 

Because the metric “machine” is linear, one can calculate its output, for any input, 
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as follows, if one knows only what it does to the basis vectors e a of a Lorentz frame. 

(1) Define the symbols (“metric coefficients”) r} a/3 by 

Tj a/J = g(e a , e p ) = e„ • e $ . (2.9) 


(2) Calculate their numerical values from the known squared length of the separation 
vector <f = Ax a e n between two events: 


(As) 2 = — (Ax 0 ) 2 + (Ax 1 ) 2 + (Ax 2 ) 2 + (Ax 3 ) 2 
= g(Ax a e a , Ax^ef) = Ax a Ax^g(e a , e $ ) 

= Ax a Ax^ for every choice of Ax tt 




-1 

0 

0 

0 


0 0 0 
1 0 0 
0 1 0 
0 0 1 


in any Lorentz frame. 


(3) Calculate the scalar product of any two vectors u and v from 


( 2 . 10 ) 


u v - g(u, v) = g(u tt e a , v^ef) = u a v< 3 g(e a , e fi )\ 
u • v = u a v^ij a p = —u°v° + mV + u 2 u 2 + u 3 v 3 . (2.11) 

That one can classify directions and vectors in spacetime into “timelike” (negative 
squared length), “spacelike” (positive squared length), and “null” ox “lightlike ”(zero 
squared length) is made possible by the negative sign on the metric coefficient ij 00 . 

Box 2.2 shows applications of the above ideas and notation to two elementary 
problems in special relativity theory. 


§2.5. DIFFERENTIAL FORMS 

Vectors and the metric tensor are geometric objects that are already familiar from 
Chapter 1 and from elementary courses in special relativity. Not so familiar, yet 
equally important, is a third geometric object: the “differential form” or “1-form.” 

Consider the 4-momentum p of a particle, an electron, for example. To spell out 
one concept of momentum, start with the 4-velocity, u = diP/dr, of this electron 
(“spacetime displacement per unit of proper time along a straightline approximation 
of the world line”). This is a vector of unit length. Multiply by the mass m of the 
particle to obtain the momentum vector 

p = mu. 

But physics gives also quite another idea of momentum. It associates a de Broglie 
wave with each particle. Moreover, this wave has the most direct possible physical 
significance. Diffract this wave from a crystal lattice. From the pattern of diffraction, 
one can determine not merely the length of the de Broglie waves, but also the pattern 
in space made by surfaces of equal, integral phase <j> = 7, tj> = 8, <j> = 9.This 


Metric coefficients 


Scalar products computed 
from components of vectors 


The 1-form illustrated by de 
Broglie waves 




Box 2.2 WORKED EXERCISES USING THE METRIC 


Exercise: Show that the squared length of a test particle’s 4-velocity u is -1. 
Solution: In any Lorentz frame, using the components (2.2). one calculates as follows 

U 2 = g(u,u) = U a li^7l a p = -(u 0 ) 2 + (w 1 ) 2 + (W 2 ) 2 + (w 3 ) 2 



Exercise: Show that the rest mass of a particle is related to its energy and momen¬ 
tum by the famous equation 

(me 2 ) 2 = E 2 — (pc) 2 
or, equivalently (geometrized units!), 

m 2 = E 2 — p 2 . 

First Solution: The 4-momentum is defined by p = mu , where i/ is the 4-velocity 
and m is the rest mass. Consequently, its squared length is 

p 2 = m 2 u 2 = —m 2 

= ~( m « 0 ) 2 + m 2 « 2 - - 

1 - v 2 1 - v 2 

t t 

E 2 p 2 

Second Solution: In the frame of the observer, where £ and p are measured, the 
4-momentum splits into time and space parts as 

p° = £, p^ -f p 2 e 2 + p 3 e 3 = p; 
hence, its squared length is 

p 2 = —£ 2 + p 2 . 

But in the particle’s rest frame, p splits as 

p°~m > p l = p 2 = p 3 — 0; 

hence, its squared length isp 2 = —m 2 . But the squared length is a geometric object 
defined independently of any coordinate system; so it must be the same by whatever 
means one calculates it: 


— p 2 — m 2 — E 2 — p 2 . 
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Figure 2.4. 

The vector separation v — — *P 0 between two neighboring events 

& 0 and a 1-form a; and the piercing of a by v to give the number 

v ) = (number of surfaces pierced) = 4.4 

(4.4 “bongs of bell”). When o is made of surfaces of constant phase, 
4> = 17, <j> = 18, <> == 19,... of the de Broglie wave for an electron, 
then (a, v ) is the phase difference between the events t? 0 and Note 
that o is not fully specified by its surfaces; an orientation is also 
necessary. Which direction from surface to surface is “positive”; i.e., 
in which direction does </> increase? 



pattern of surfaces, given a name provides the simplest illustration one can 
easily find for a 1-form, 

The pattern of surfaces in spacetime made by such a 1-form: what is it good for? 

Take two nearby points in spacetime, 9 and 9 0 . Run an arrow v = 9 — 9 0 from 

9 0 to 9. It will pierce a certain number of the de Broglie wave’s surfaces of integral Vector pierces 1-form 

phase, with a bong of an imaginary bell at each piercing. The number of surfaces 

pierced (number of “bongs of bell”) is denoted 


1-form pierced 


<*•?>: 

_J L 


vector that pierces 


in this example it equals the phase difference between tail (9 0 ) and tip (9) of u, 

(k, v > = - <f>(^ 0 ). 

See Figure 2.4. 

Normally neither 9 0 nor 9 will lie at a point of integral phase. Therefore one 

can and will imagine, as uniformly interpolated between the surfaces of integral 

phase, an infinitude of surfaces with all the intermediate phase values. With their 

aid, the precise value of ( k , v ) = §(9) — <t>(9 0 ) can be determined. 

To make the mathematics simple, regard k not as the global pattern of de Broglie- 

wave surfaces, but as a local pattern near a specific point in spacetime. Just as the The 1-form viewed as family 

vector u — d9/dr represents the local behavior of a particle’s world line (linear of f ,at ' equa (| y spaced 

surfaces 

approximation to curved line in general), so the 1-form k represents the local form 
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Figure 2.5. 

This is a dual-purpose figure, (a) It illustrates the de Broglie wave 1-form k at an event & 0 (family 
of equally spaced, flat surfaces, or “hyperplanes” approximating the surfaces of constant phase), (b) 
It illustrates the gradient d$ of the function 4> (concept defined in §2.6), which is the same oriented 
family of flat surfaces 

k = df>. 

At different events, k = cfy is different—different orientation of surfaces and different spacing. The 
change in <t> between the tail and tip of the very short vector v is equal to the number of surfaces of 
d<t> pierced by v, (</<£, v); it equals —0.5 in this figure. 


of the de Broglie wave’s surfaces (linear approximation; surfaces flat and equally 
spaced; see Figure 2.5). 

Regard the 1 -form k as a machine into which vectors are inserted, and from which 
numbers emerge. Insertion of v produces as output (k, v ). Since the surfaces of 
k are flat and equally spaced, the output is a linear function of the input: 

<£, au + bv) = a(k, u > + b(k , v). (2.12a) 

The 1-form viewed as linear This, in fact, is the mathematical definition of a 1-form: a 1-form is a linear, real- 

valued function of vectors; i.e., a linear machine that takes in a vector and puts out 
a number. Given the machine k, it is straightforward to draw the corresponding 
surfaces in spacetime. Pick a point 9 0 at which the machine is to reside. The surface 
of k that passes through contains points 9 for which (k, 9 — 9 0 ) = 0 (no bongs 
of bell). The other surfaces contain points with < k , 9 — 9 0 ) = ±1, ±2, ±3,_ 




Figure 2.6. 

The addition of two 1-forms, a and /?, to produce the 1-form a . Required is a pictorial construction 
that starts from the surfaces of a and /?, e.g., <a, 9 — 9^) = • • • — 1, 0, 1, 2,..., and constructs those 
of a = a + p. Such a construction, based on linearity (2.12b) of the addition process, is as follows. 

(1) Pick several vectors u, v y . .. that lie parallel to the surfaces of p (no piercing!), but pierce precisely 
3 surfaces of a; each of these must then pierce precisely 3 surfaces of a: 

(a, u) = <a + P, u) = <a, u) = 3. 

(2) Pick several other vectors w ,... that lie parallel to the surfaces of a but pierce precisely 3 surfaces 
of p\ these will also pierce precisely 3 surfaces of a . (3) Construct that unique family of equally spaced 
surfaces in which u, v,. . ., w ,... all have their tails on one surface and their tips on the third succeeding 
surface. 


Sometimes 1-forms are denoted by boldface, sans-serif Latin letters with tildes 
over them, e.g., k ; but more often by boldface Greek letters, e.g., a, 0, a . The output 
of a 1-form a, when a vector u is inserted, is called “the value of a on u” or “the 
contraction of a with u” 

Also, l*forms, like any other kind of function, can be added. The 1-form aa + bp Addition of 1 -forms 
is that machine (family of surfaces) which puts out the following number when a 
vector u is put in: 


(aa -b bp , u > = a(a , u) + b(P , i/>. (2.12b) 

Figure 2.6 depicts this addition in terms of surfaces. 

One can verify that the set of all 1-forms at a given event is a “vector space” 
in the abstract, algebraic sense of the term. 

Return to a particle and its de Broglie wave. Just as the arrow p = md^/dr 
represents the best linear approximation to the particle’s actual world line near 
so the flat surfaces of the 1-form k provide the best linear approximation to the 
curved surfaces of the particle’s de Broglie wave, and k itself is the linear function 
that best approximates the de Broglie phase <j> near ^ 0 : 

<f>(^) = <K& 0 ) + ^ - ^ 0 > 

+ terms of higher order in (<? — ^ 0 ). 


(2.13) 







of E of D 

Figure 2.7. 

Several vectors, A, fl, C, D, £, and corresponding 1-forms A , B , 0, £. The process of drawing U 

corresponding to a given vector U is quite simple. (1) Orient the surfaces of U orthogonal to the vector 
U. (Why? Because any vector V that is perpendicular to U must pierce no surfaces of U 
(0 = U ‘ V — <(/, V)) and must therefore lie in a surface of U.) (2) Space the surfaces of U so that 
the number of surfaces pierced by some arbitrary vector Y (e.g., Y = U) is equal to Y U. 

Note that in the figure the surfaces of B are, indeed, orthogonal to B; those of C are, indeed, orthogonal 
to C, etc. If they do not look so, that is because the reader is attributing Euclidean geometry, not Lorentz 
geometry, to the spacetime diagram. He should recall, for example, that because C is a null vector, it 
is orthogonal to itself (C' C = 0), so it must itself lie in a surface of the 1-form C. Confused readers 
may review spacetime diagrams in a more elementary text, e.g., Taylor and Wheeler (1966). 


Physical correspondence Actually, the de Broglie 1-form k and the momentum vector p contain precisely 

between 1 -forms and vectors t h e same information, both physically (via quantum theory) and mathematically. 

To see their relationship, relabel the surfaces of k by fi x phase, thereby obtaining 
the “momentum 1 -form”p. Pierce this 1-form with any vector v, and find the result 
(exercise 2.1) that 

pv = (p,v). (2.14) 

In words: the projection of v on the 4-momentum vector p equals the number of 
surfaces it pierces in the 4-momentum 1-form p. Examples: Vectors v lying in a 
surface of p (no piercing) are perpendicular to p (no projection); p itself pierces 
p 2 = — m 2 surfaces of p. 

Mathematical correspondence Corresponding to any vector p there exists a unique 1-form (linear function of 
between 1 -forms and vectors vectors) p defined by equation (2.14). And corresponding to any 1-form p, there 

exists a unique vector p defined by its projections on all other vectors, by equation 

(2.14). Figure 2.7 shows several vectors and their corresponding 1-forms. 



§2.6. GRADIENTS AND DIRECTIONAL DERIVATIVES 
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A single physical quantity can be described equally well by a vector p or by the 
corresponding 1-form p. Sometimes the vector description is the simplest and most 
natural; sometimes the 1-form description is nicer. Example: Consider a 1-form 
representing the march of Lorentz coordinate time toward the future—surfaces 

x° = ..., 7, 8, 9,-The corresponding vector points toward the past [see Figure 

2.7 or equation (2.14)]; its description of the forward march of time is not so nice! 

One often omits the tilde from the 1-form p corresponding to a vector p, and 
uses the same symbol p for both. Such practice is justified by the unique correspond¬ 
ence (both mathematical and physical) between p and p. 


Exercise 2.1. EXERCISE 

Show that equation (2.14) is in accord with the quantum-mechanical properties of a de Broglie 
wave, 

i[/ — e i(f> = exp [/(£ * .r — <*>/)]. 


§2.6. GRADIENTS AND DIRECTIONAL DERIVATIVES 

There is no simpler 1-form than the gradient, “df” of a function/. Gradient a 1-form? Gradient of a function as a 

How so? Hasn’t one always known the gradient as a vector? Yes, indeed, but only 1 “ form 

because one was not familiar with the more appropriate 1-form concept. The more 

familiar gradient is the vector corresponding, via equation (2.14), to the 1-form 

gradient. The hyperplanes representing df at a point are just the level surfaces 

of/itself, except for flattening and adjustment to equal spacing (Figure 2.5; identify 

/here with <f> there). More precisely, they are the level surfaces of the linear function 

that approximates / in an infinitesimal neighborhood of f? 0 . 

Why the name ‘“gradient”? Because df describes the first order changes in / in 
the neighborhood of c f 0 : 

/0?) = f(f 0 ) -I- (df — *P 0 > -|- (nonlinear terms). (2.15) 

[Compare the fundamental idea of “derivative” of something as “best linear ap¬ 
proximation to that something at a point”—an idea that works even for functions 
whose values and arguments are infinite dimensional vectors! See, e.g., Dieudonne 
(I960).] 

Take any vector v\ construct the curve l f(X) defined by ?P(X) — ?P 0 = Xv ; and 
differentiate the function/ along this curve: 

d v f= (d/dX) x = 0 Jl<nX)] = (df/dX)^ (2.16a) 

The “differential operator,” 


d v = (d/dX), 


! at X = 0, along curve — :? 0 = Xv? 


( 2 . 16 b) 
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Directional derivative 
operator defined 


Basis 1-forms 


which does this differentiating, is called the “directional derivative operator along 
the vector v.” The directional derivative S„/ and the gradient df are intimately 
related, as one sees by applying 3„ to equation (2.15) and evaluating the result at 
the point c f 0 : _ 

9 >,/= (df dtf/dX) = < df, v >. (2.17) 

This result, expressed in words, is: df is a linear machine for computing the rate 
of change of/along any desired vector v. Insert v into df, the output (“number 
of surfaces pierced; number of bongs of bell”) is 3„/—which, for sufficiently small 
v, is simply the difference in /between tip and tail of v. 


§2.7. COORDINATE REPRESENTATION OF 
GEOMETRIC OBJECTS 

In flat spacetime, special attention focuses on Lorentz frames. The coordinates x°(s?), 
x\°P), x 2 fP), x 3 (&) of a Lorentz frame are functions; so their gradients can be 
calculated. Each of the resulting “basis 1-forms,” 


«/“ = dx a , (2.18) 

has as its hyperplanes the coordinate surfaces x a = const; see Figure 2.8. Conse¬ 
quently the basis vector e a pierces precisely one surface of the basis 1-form w a , 


t 



Figure 2.8. 

The basis vectors and 1-forms of a particular Lorentz 
coordinate frame. The basis 1-forms are so laid 
out that 


<«/“, e^> = 
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while the other three basis vectors he parallel to the surfaces of uS* and thus pierce 
none: 

<«/“, e 0 ) = 8 a fi . (2.19) 

(One says that the set of basis 1-forms {cv a } and the set of basis vectors {eg} are 
the “duals” of each other if they have this property.) 

Just as arbitrary vectors can be expanded in terms of the basis e a , v = v a e a , so 
arbitrary 1-forms can be expanded in terms of cv 

a = opivfi. ( 2 . 20 ) 

The expansion coefficients a^ are called “the components of a on the basis 
These definitions produce an elegant computational formalism, thus: Calculate 
how many surfaces of a are pierced by the basis vector e a ; equations (2.19) and 
(2.20) give the answer: 

<<7, e a ) = {o p ujP, e a ) = e a ]> = 

i.e., 

(a ,e a )=o a . (2.21a) 

Similarly, calculate ( w a , v) for any vector v = the result is 

( uf a , v) = v a . (2.21b) 

Multiply equation (2.21a) by v a and sum, or multiply (2.21b) by o a and sum; the 
result in either case is 

<cr, v) = o a u a . (2.22) 

This provides a way, using components, to calculate the coordinate-independent 
value of (a, v). 

Each Lorentz frame gives a coordinate-dependent representation of any geometric 
object or relation: v is represented by its components v a \ a , by its components o a ; 
a point 9, by its coordinates x a ; the relation <a, v) = 17.3 by o a u a — 17.3. 

To find the coordinate representation of the directional derivative operator d v , 
rewrite equation (2.16b) using elementary calculus 


the result is 


_d_\ _ / dx a \ / a 

^ ^ ' at along ^(X>- & 0 = \v ' 

u a ; see equation (2.3) 
d v = v a d/dx a . 


(2.23) 


In particular, the directional derivative along a basis vector e a (components 
= S\) is 

0 a = d K = d/dx°. 

This should also be obvious from Figure 2.8. 


Expansion of 1-form in terms 
of basis 


Calculation and manipulation 
of vector and 1-form 
components 


Directional derivative in terms 
of coordinates 


(2.24) 
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Components of gradient 


EXERCISES 


The components of the gradient 1-form df which are denoted f a 

= (2.25a) 

are calculated easily using the above formulas: 

f, a — (df e a > [standard way to calculate components; equation (2.21a)] 

= ^af [by relation (2.17) between directional derivative and gradient] 

= df/cx a [by equation (2.24)]. 

Thus, in agreement with the elementary calculus idea of gradient, the components 
of df are just the partial derivatives along the coordinate axes: 

f, a = cf/cx a : i.e., df — (cf/dx a ) dx a . (2.25b) 

(Recall: w a = dx a .) The formula df = (df/dx a ) dx a suggests, correctly, that df is 
a rigorous version of the “differential” of elementary calculus; see Box 2.3. 

Other important coordinate representations for geometric relations are explored 
in the following exercises. 


Derive the following computationally useful formulas: 

Exercise 2.2. LOWERING INDEX TO GET THE 1-FORM 
CORRESPONDING TO A VECTOR 

The components u a of the 1-form u that corresponds to a vector u can be obtained by 
lowering an index’ with the metric coefficients 

u a = VaffU 0 ', > e - Uq = -U°, u k = u k . (2.26a) 

Exercise 2.3. RAISING INDEX TO RECOVER THE VECTOR 

One can return to the components of u by raising indices. 

u a = 

the matrix llij a ^ll is defined as the inverse of Ibj^ll, and happens to equal llrj aj8 ll 

= S a y \ = i) a/! for all a, ft. 

Exercise 2.4. VARIED ROUTES TO THE SCALAR PRODUCT 

The scalar product of u with v can be calculated in any of the following ways: 

u v = g(u, v) = u a v^q afj = u a v a = t(2.28) 


(2.26b) 

(2.27) 
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^ Box 2.3 DIFFERENTIALS 




The “exterior derivative” or “gradient” df of a 
function / is a more rigorous version of the ele¬ 
mentary concept of “differential.” 

In elementary textbooks, one is presented with 
the differential df as representing “an infinitesimal 
change in the function /(&)” associated with some 
infinitesimal displacement of the point but one 
will recall that the displacement of 9 is left arbi¬ 
trary, albeit infinitesimal. Thus df represents a 
change in / in some unspecified direction. 

But this is precisely what the exterior derivative 
df represents. Choose a particular, infinitesimally 
long displacement v of the point ff. Let the dis- 

___ 


placement vector v pierce df to give the number 
< dfv ) = d v f That number is the change of f in 
going from the tail of v to its tip. Thus df before 
it has been pierced to give a number, represents 
the change of f in an unspecified direction. The 
act of piercing df with v is the act of making 
explicit the direction in which the change is to be 
measured. The only failing of the textbook presen¬ 
tation, then, was its suggestion that df was a scalar 
or a number; the explicit recognition of the need 
for specifying a direction v to reduce df to a num¬ 
ber {dfv) shows that in fact df is a 1-form, the 
gradient of f 


§2.8. THE CENTRIFUGE AND THE PHOTON 

Vectors, metric, 1-forms, functions, gradients, directional derivatives: all these geo¬ 
metric objects and more are used in flat spacetime to represent physical quantities; 
and all the laws of physics must be expressible in terms of such geometric objects. 

As an example, consider a high-precision redshift experiment that uses the Moss- Geometric objects in action: 

bauer effect (Figure 2.9). The emitter and the absorber of photons are attached to example of centrifuge and 

photon 



Figure 2.9. 

The centrifuge and the photon. 


the rim of a centrifuge at points separated by an angle a, as measured in the inertial 
laboratory. The emitter and absorber are at radius r as measured in the laboratory, 
and the centrifuge rotates with angular velocity to. Problem: What is the redshift 
measured, 

(^absorbed ^emitted)/^emitted'’ 


in terms of t*\ r, and a? 
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Solution: Let u e be the 4-velocity of the emitter at the event of emission of a 
given photon; let u a be the 4-velocity of the absorber at the event of absorption; 
and let p be the 4-momentum of the photon. All three quantities are vectors defined 
without reference to coordinates. Equally coordinate-free are the photon, energies 
E e and E a measured by emitter and absorber. No coordinates are needed to describe 
the fact that a specific emitter emitting a specific photon attributes to it the energy 
E e \ and no coordinates are required in the geometric formula 

E e = ~P • "e (2.29) 

for E e . [That this formula works can be readily verified by recalling that, in the 
emitter’s frame, u e ° = 1 and uj = 0; so 

E e = = -Po = +p° 

in accordance with the identification “(time component of 4-momentum) = (en¬ 
ergy-’ J Analogous to equation (2.29) is the purely geometric formula 

E a = ~P ' «o 

for the absorbed energy. 

The ratio of absorbed wavelength to emitted wavelength is the inverse of the 
energy ratio (since E = hv = hc/X ): 

_ E e _ -P’ Ue 
X e E a ~P • "a 

This ratio is most readily calculated in the inertial laboratory frame 

A a _ P\° ~ /> V — P\° ~P'U e n ~ n 

X e P°Ua° “ /> V P°“a° ~ P * « a 

(Here and throughout we use boldface Latin letters for three-dimensional vectors 
in a given Lorentz frame; and we use the usual notation and formalism of three- 
dimensional, Euclidean vector analysis to manipulate them.) Because the magnitude 
of the ordinary velocity of the rim of the centrifuge, v = tor, is unchanging in time, 
u e ° and u a ° are equal, and the magnitudes—but not the directions—of u e and u a 
are equal: 

u e = “ a ° = (1 - v 2 )~ 1/2 , \u e \ = K| = v/(l - v 2 ) 1/2 . 

From the geometry of Figure 2.9, one sees that u e makes the same angle with p 
as does « a . Consequently, p-u e =p-u a , and A absorbed /A emitted = 1. There is no 
redshift! 

Notice that this solution made no reference whatsoever to Lorentz transforma¬ 
tions they have not even been discussed yet in this book! The power of the geomet¬ 
ric, coordinate-free viewpoint is evident! 
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One must have a variety of coordinate-free contacts between theory and experiment in order EXERCISES 
to use the geometric viewpoint. One such contact is the equation E — — p • u for the energy 
of a photon with 4-momentum p, as measured by an observer with 4-velocity u . Verify the 
following other points of contact. 

Exercise 2.5. ENERGY AND VELOCITY FROM 4-MOMENTUM 

A particle of rest mass m and 4-momentum p is examined by an observer with 4-velocity 
u . Show that just as (a) the energy he measures is 

E = —p * u; (2.31) 

so (b) the rest mass he attributes to the particle is 

= -p 2 ; (2.32) 

(c) the momentum he measures has magnitude 

\p\ = t(P • u ) 2 + (p • p)] 1/2 ; (2.33) 

(d) the ordinary velocity v he measures has magnitude 



(234) 


where |p| and E are as given above; and (e) the 4-vector v, whose components in the 
observer’s Lorentz frame are 


is given by 


v° = 0, v> = (dx’/dt) forparticie = ordinary velocity, 


p + (p • t/)t/ 
-pu 


(2.35) 


Exercise 2.6. TEMPERATURE GRADIENT 

To each event <2 inside the sun one attributes a temperature T(i2), the temperature measured 
by a thermometer at rest in the hot gas there. Then T{<2) is a function; no coordinates are 
required for its definition and discussion. A cosmic ray from outer space flies through the 
sun with 4-velocity u. Show that, as measured by the cosmic ray’s clock, the time derivative 
of temperature in its vicinity is 


dT/dj = d u T = < dT i u >. 


(2.36) 


In a local Lorentz frame inside the sun, this equation can be written 

dT = 1ja jnL = 1 dT v> dT 

dr dx a Vl — v 2 dt — v 2 


(2.37) 


Why is this result reasonable? 
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§2.9. LORENTZ TRANSFORMATIONS 

To simplify computations, one often works with the components of vectors and 
1-forms, rather than with coordinate-free language. Such component manipulations 
sometimes involve transformations from one Lorentz frame to another. The reader 
is already familiar with such Lorentz transformations; but the short review in Box 
2.4 will refresh his memory and acquaint him with the notation used in this book. 
Lorentz transformations: of The key entities in the Lorentz transformation are the matrices \\A a ' p \\ and llvl^ll; 

' ' 3 es the first transforms coordinates from an unprimed frame to a primed frame, while 

the second goes from primed to unprimed 

X°‘ = A a ' p x», x* = A^ a .x a . (2.38) 

Since they go in opposite directions, each of the two matrices must be the inverse 
of the other: 



A^pAPy = d a ‘ Y ; 

A^A a ' y = 8P y . 

(2.39) 


From the coordinate-independent nature of 4-velocity, u = (dx a /dr)e a , 
derives the expressions 

one readily 

Of basis vectors 

e a' “ 

e/3 = e a'A a 

(2.40) 


for the basis vectors of one frame in terms of those of the other; and from other 
geometric equations, such as 


v = e a v a 

(a, v} = o a v a 

a = o a iv‘ 

= Bp.vP\ 

= Op,V0\ 

' = 



one derives transformation laws 



Of basis 1 -forms 

w a ' = A a ' pU)&, 

u/0 = A0 a ,w a '\ 

(2.41) 

Of components 

if' = A^pv 1 *, 

v 13 = AP a ,if'\ 

(2.42) 


<V = OpA^,, 

Op = o a ,A a ’p. 

(2.43) 


One need never memorize the index positions in these transformation laws. One 
need only line the indices up so that (1) free indices on each side of the equation 
are in the same position; and (2) summed indices appear once up and once 
down. Then all will be correct! (Note; the indices on A always run “northwest to 
southeast.”) 



Box 2.4 LORENTZ TRANSFORMATIONS 

Rotation of Frame of Reference by Angle 0 in x-y Plane 


Slope s = tan 0; sin 6 = — * : 

(i + s l y /l 


cos 8 — 


(1 + s 2 ) 1/2 


t = T 

x — x cos 8 — y sin 8 
y = x sin 8 -f J7cos0 



9 x 


T= t 

x = jc cos 8 -f y sin 8 
y = sin 8 + y cos 8 
z = z 


All signs follow from sign of this term. Positive by inspection of point 9. 

Combination of Two Such Rotations 


S -I “b Sn 

s = --— or 8 = 8, + 0 2 

1 — 1 2 


Boost of Frame of Reference by Velocity Parameter a in z-t Plane 

Velocity ft = tanh a; sinh a = - _^ 2)1/2 ; cosh a = -- - _ = V 



0 tan 8 = velocity ft 

= tanh a 


t = t cosh a + 2 sinh a 
* = jc 

7= J 

2 = fsinh a + 2 cosh a 


/ = t cosh a — z sinh a 

.X = JC 

J = y 

z — —t sinh a + z cosh a 


All signs follow from sign of this term. Positive because object at rest at z = 0 
in rocket frame moves in direction of increasing 2 in lab frame. 


Matrix notation: x^ = A*-x v , x " = <» 


\\Av-U = 


cosh a 

0 

0 

sinh a 

cosh a 

0 

0 

— sinh a 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 

o , pgi = 

0 

0 

1 

0 

sinh a 

0 

0 

cosh a 

— sinh a 

0 

0 

cosh a 



Box 2.4 (continued) 


Energy-momentum 4-vector Charge density-current 4-vector 
E = £ cosh a + p z sinh a P = P cosh a + j z sinh a 


P x =P X 

p v =p v 


jv =jy 


p z = E sinh a + p z cosh a j z — P sinh a -f j z cosh a 


Aberration, incoming photon: 


sin 0 = 


cos 0 = 


-p ± _ (1 - P 2 ) 1/2 sin 0 
E 1 — ft cos 0 

—p z cos 0 — ft 


E 1 — ft cos 0 
tan (0/2) = e a tan (0/2) 


. a -Px (1 — /? 2 ) 1/2 sin 0 
E 1 + ft cos 0 

F -p 2 cos 0 -f /? 

E 1 + ft cos 0 
tan (0/2) — e~ a tan (0/2) 


Combination of Two Boosts in Same Direction 


fti + $2 

1 + 


or a = a, + a 9 . 


General Combinations of Boosts and Rotations 


Spinor formalism of Chapter 41 

Poincare Transformation 

X* = A\,X a ' + a*- 

Condition on the Lorentz part of this transformation: 

ds' 2 ~ ri a ,p dx a ' dx F = ds 2 = r\ tlv A tL a ,A v ^ dx a> dxP 

or A t t]A = t] (matrix equation, with T indicating “transposed,” or rows and columns 
interchanged). 

Effect of transformation on other quantities: 

u » = A» a ,u a ’' (4-velocity) u a , = 

P M = Atl a'P a (4-momentum) p a . - p^A^; 

FtiV ~ (electromagnetic field) F a .p = F^A^A^,; 

e a' ~ e n Ati a' (basis vectors); 

w a — A a ^uj* 1 (basis 1-forms); 

u = e a >u a = — u (the 4-velocity vector). 
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Exercise 2.7. BOOST IN AN ARBITRARY DIRECTION 

An especially useful Lorentz transformation has the matrix components 


A 0 ) - A’ 0 = -fiyn\ 

A r k = A k ) = (y - I )n*n k + 5* (2.44) 

A*,,' = (same as A*\ but with fi replaced by -fi), 

where fi, n l , n 2 , and n 3 are parameters, and n 2 = (n 1 ) 2 + ( n 2 ) 2 + ( n 3 ) 2 = 1. Show (a) that 
this does satisfy the condition A T y\A = r\ required of a Lorentz transformation (see Box 2.4); 

(b) that the primed frame moves with ordinary velocity fin as seen in the unprimed frame; 

(c) that the unprimed frame moves with ordinary velocity - fin (i.e., v v = — fin \ v 2 ' = —fin 2 , 
v*' = - fa*) as seen in the primed frame; and (d) that for motion in the z direction, the 
transformation matrices reduce to the familiar form 


y 

0 

0 

-fiy 


y 

0 

0 

py 

0 

i 

0 

0 


0 

l 

0 

0 

0 

0 

1 

0 

, WA\.\\ = 

0 

0 

1 

0 

-h 

0 

0 

y 


h 

0 

0 

y 


(2.45) 


§2.10. COLLISIONS 

Whatever the physical entity, whether it is an individual mass in motion, or a torrent 
of fluid, or a field of force, or the geometry of space itself, it is described in classical 
general relativity as a geometric object of its own characteristic kind. Each such object 
is built directly or by abstraction from identifiable points, and needs no coordinates 
for its representation. It has been seen how this coordinate-free description translates 
into, and how it can be translated out of, the language of coordinates and compo¬ 
nents, and how components in a local Lorentz frame transform under a Lorentz 
transformation. Turn now to two elementary applications of this mathematical 
machinery to a mass in motion. One has to do with short-range forces (collisions, 
this section); the other, with the long-range electromagnetic force (Lorentz force law, 
next chapter). 

In a collision, all the change in momentum is concentrated in a time that is short 
compared to the time of observation. Moreover, the target is typically so small, and 
quantum mechanics so dominating, that a probabilistic description is the right one. 
A quantity 



gives the cross section (cm 2 ) for scattering into the element of solid angle dtt at 
the deflection angle 6 ; a more complicated expression gives the probability that the 


EXERCISE 


Scattering of particles 
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2. FOUNDATIONS OF SPECIAL RELATIVITY 



Conservation of 
energy-momentum in a 
collision 


original particle will enter the aperture df2 at a given polar angle 0 and azimuth 
<f> and with energy E to E -f dE , while simultaneously products of reaction also 
emerge into specified energy intervals and into specified angular apertures. It would 
be out of place here to enter into the calculation of such cross sections, though it 
is a fascinating branch of atomic physics. It is enough to note that the cross section 
is an area oriented perpendicular to the line of travel of the incident particle. 
Therefore it is unaffected by any boost of the observer in that direction, provided 
of course that energies and angles of emergence of the particles are transformed 
in accordance with the magnitude of that boost (“same events seen in an altered 
reference system”). 

Over and above any such detailed account of the encounter as follows from the 
local dynamic analysis, there stands the law of conservation of energy-momentum: 

2 *>= 2 • 

original final (2.47) 

particles, J particles, K 


Out of this relation, one wins without further analysis such simple results as the 
following. (1) A photon traveling as a plane wave through empty space cannot split 
(not true for a focused photon!). (2) When a high-energy electron strikes an electron 
at rest in an elastic encounter, and the two happen to come off sharing the energy 
equally, then the angle between their directions of travel is less than the Newtonian 
value of 90°, and the deficit gives a simple measure of the energy of the primary. 
(3) When an electron makes a head-on elastic encounter with a proton, the formula 
for the fraction of kinetic energy transferred has three rather different limiting forms, 
according to whether the energy of the electron is nonrelativistic, relativistic, or 
extreme-relativistic. (4) The threshold for the production of an (<? + , e~) pair by a 
photon in the field of force of a massive nucleus is 2 m e . (5) The threshold for the 
production of an (e + , e~) pair by a photon in an encounter with an electron at rest * 
is 4 m e (or 4w e — e when account is taken of the binding of the e + e~e~ system in 
a very light “molecule”). All these results (topics for independent projects!) and more 
can be read out of the law of conservation of energy-momentum. For more on this 
topic, see Blaton (1950), Hagedorn(1964), and Chapter 4 and the last part of Chapter 
5 of Sard (1970). 



CHAPTER 


THE ELECTROMAGNETIC 

FIELD 


The rotating armatures of every generator and every motor in this 
age of electricity are steadily proclaiming the truth of the 
relativity theory to all who have ears to hear. 

LEIGH PAGE (1941) 


§3.1. THE LORENTZ FORCE AND 

THE ELECTROMAGNETIC FIELD TENSOR 

At the opposite extreme from an impulsive change of momentum in a collision (the 
last topic of Chapter 2) is the gradual change in the momentum of a charged particle 
under the action of electric and magnetic forces (the topic treated here). 

Let electric and magnetic fields act on a system of charged particles. The accelera- Lorentz force as definer of 
tions of the particles reveal the electric and magnetic field strengths. In other words, f> e[ ds and predicter of 
the Lorentz force law, plus measurements on the components of acceleration of test motlons 
particles, can be viewed as defining the components of the electric and magnetic 
fields. Once the field components are known from the accelerations of a few test 
particles, they can be used to predict the accelerations of other test particles (Box 
3.1). Thus the Lorentz force law does double service (1) as definer of fields and (2) 
as predicter of motions. 

Here and elsewhere in science , as stressed not least by Henri Poincare, that view 
is out of date which used to say , “Define your terms before you proceed.” All the laws 
and theories of physics, including the Lorentz force law, have this deep and subtle 
character, that they both define the concepts they use {here B and E) and make 
statements about these concepts. Contrariwise, the absence of some body of theory, 
law, and principle deprives one of the means properly to define or even to use concepts. 

Any forward step in human knowledge is truly creative in this sense: that theory, 
concept, law, and method of measurement—forever inseparable—are born into the 
world in union. 
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3. THE ELECTROMAGNETIC FIELD 


A 


Box 3.1 LORENTZ FORCE LAW AS BOTH DEFINER OF FIELDS AND 
PREDICTER OF MOTIONS 


How one goes about determining the components 
of the field from measurements of accelerations is 
not different in principle for electromagnetism and 
for gravitation. Compare the equations in the two 
cases: 

= JL fa u /3 i n a Lorentz frame, (1) 
dr* m p 

and 

D 2 ^ a 

0 = — R a p y8 ufli; y u s in any coordinate system. 

( 2 ) 

To make explicit the simpler procedure for elec¬ 
tromagnetism will indicate in broad outline how 
one similarly determines all the components of 
R a p yS for gravity. Begin by asking how many test 
particles one needs to determine the three compo¬ 
nents of B and the three components of E in the 
neighborhood under study. For one particle, three 
components of acceleration are measurable; for a 
second particle, three more. Enough? No! The 
information from the one duplicates in part the 
information from the other. The proof? Whatever 
the state of motion of the first test particle, pick 
one’s Lorentz frame to be moving the same way. 
Having zero velocity in this frame, the particle has 
a zero response to any magnetic field. The electric 
field alone acts on the particle. The three compo¬ 
nents of its acceleration give directly the three 
components E x , £ y , E z of the electric field. The 
second test particle cannot be at rest if it is to do 
more than duplicate the information provided by 
the first test particle. Orient the x-axis of the frame 


of reference parallel to the direction of motion of 
this second particle, which will then respond to 
and measure the components B y and B z of the 
magnetic field. Not so B x \ The acceleration in the 
x-direction merely remeasures the already once 
measured E x . To evaluate B x , a third test particle 
is required, but it then gives duplicate information 
about the other field components. The alternative? 
Use all N particles simultaneously and on the same 
democratic footing, both in the evaluation of the 
six Fol{3 and in the testing of the Lorentz force, by 
applying the method of least squares. Thus, write 
the discrepancy between predicted and observed 
acceleration of the Kth particle in the form 

u/-j-F a$ u^ = Sa/. (3) 

Take the squared magnitude of this discrepancy 
and sum over all the particles 

S = 2 T 1*80 *80*,. (4) 

k 

In this expression, everything is regarded as known 
except the six F a0- Minimize with respect to these 
six unknowns. In this way, arrive at six equations 
for the components of B and E. These equations 
once solved, one goes back to (3) to test the Lor¬ 
entz force law. 

The 6x6 determinant of the coefficients in the 
equation for the Fad automatically vanishes when 
there are only two test particles. The same fine of 
reasoning permits one to determine the minimum 
number of test particles required to determine all 
the components of the Riemann curvature tensor. 


§3.1. LORENTZ FORCE AND ELECTROMAGNETIC FIELD TENSOR 
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The Lorentz force law, written in familiar three-dimensional notation, with 
E = electric field, B — magnetic field, v = ordinary velocity of particle, p — 
momentum of particle, e — charge of particle, reads 


The three-dimensional version 
of the Lorentz force law 


(i dp/dt) = e(E -j - vx B). 


(3.1) 


Useful though this version of the equation may be, it is far from the geometric spirit 
of Einstein. A fully geometric equation will involve the test particle’s energy-mo¬ 
mentum 4-vector, p, not just the spatial part p as measured in a specific Lorentz 
frame; and it will ask for the rate of change of momentum not as measured by 
a specific Lorentz observer (d/dt), but as measured by the only clock present a priori 
in the problem: the test particle’s own clock {d/dr). Thus, the lefthand side of a 
fully geometric equation will read 


dp/dr = . 

The righthand side, the Lorentz 4-force, must also be a frame-independent object. 
It will be linear in the particle’s 4-velocity u, since the frame-dependent expression 

— = * = —-- (E+ vxB) = e(u°E+ u*B) (3.2a) 

dj Vl - V’ 2 dt vl - v 2 


is linear in the components of u. Consequently, there must be a linear machine 
named Faraday , or F, or “electromagnetic field tensor,” with a slot into which one 
inserts the 4-velocity of a test particle. The output of this machine, multiplied by 
the particle’s charge, must be the electromagnetic 4-force that it feels: 

dp/dr = eF(u). (3.3) 


Electromagnetic field tensor 
defined 


Geometrical version of 
Lorentz force law * 


By comparing this geometric equation with the original Lorentz force law (equa¬ 
tion 3.2a), and with the companion energy-change law 


dp 0 
dr 


1 dE 
Vl — v 2 dt 


— _ eE • v = eE • «, 

Vl — V 2 


(3.2b) 


one can read olf the components of F in a specific Lorentz frame. The components 
of dp/dr are dp a /dr , and the components of eF(u) can be written (definition of F a pl) 
eF a puP . Consequently 

dp a /dr = eF a p uP (3.4) 

must reduce to equations (3.2a,b). Indeed it does if one makes the identification 


p 

= 0 

P - 1 

/* = 2 

fi = 3 

a = 0 

0 

E x 

Ey 

E; 

a = 1 

£* 

0 

B z 

-By 

a = 2 

Ey 

- B , 

0 

B z 

a = 3 

E* 

By 

~B X 

0 


Components of 
electromagnetic field tensor 


(3.5) 
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EXERCISE 


Examples of tensors 


More often seen in the literature are the “covariant components,” obtained by 
lowering an index with the metric components: 

= V a y FV„; (3.6) 


= 



(3.7) 


This matrix equation demonstrates the unity of the electric and magnetic fields. 
Neither one by itself, E or B, is a frame-independent, geometric entity. But merged 
together into a single entity, F = Faraday, they acquire a meaning and significance 
that transcends coordinates and reference frames. 


Exercise 3.1. 

Derive equations (3.5) and (3.7) for the components of Faraday by comparing (3.4) with 
(3.2a,b), and by using definition (3.6). 


§3.2. TENSORS IN ALL GENERALITY 

A digression is in order. Now on the scene are several different tensors: the metric 
tensor g (§2.4), the Riemann curvature tensor Riemann (§1.6), the electromagnetic 
field tensor Faraday (§3.1). Each has been defined as a linear machine with input 
slots for vectors, and with an output that is either a real number, e.g., g(u , v ), or 
a vector, e.g., Riemann (i u , v , iv) and Faraday (u). 

Should one make a distinction between tensors whose outputs are scalars, and 
tensors whose outputs are vectors? No! A tensor whose output is a vector can be 
reinterpreted trivially as one whose output is a scalar. Take, for example, Fara¬ 
day = F ' Add a new slot for the insertion of an arbitrary 1-form a, and gears and 
wheels that guarantee the output 

F(a, u) — (a, F(u)} = real number. (3.8) 

Then permit the user to choose whether he inserts only a vector, and gets out the 
vector F (..., i#) = F{u\ or whether he inserts a form and a vector, and gets out 
the number F(a , u). The same machine will do both jobs. Moreover, in terms of 
components in a given Lorentz frame, both jobs are achieved very simply: 

F (..., u) is a vector with components F a ^u^\ 

F(a , u) is the number < a , F(. .., i/)> = o a F a 


(3.9) 


§3.2. TENSORS 
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By analogy, one defines the most general tensor H and its rank as follows: 
H is a linear machine with n input slots for n 1-forms, and m input slots for m vectors; 
given the requested input, it puts out a real number denoted 

H( a , A,. .., 0 , u, v, ..., w). (3.10) 

n 1-forms m vectors 

For most tensors the output changes when two input vectors are interchanged, 

Riemann(a , u, v , iv) ^ Riemann(a f v y u, iv), (3.11) 

or when two input 1-forms are interchanged. 

Choose a specific tensor S, of rank (*) for explicitness. Into the slots of S, insert 
the basis vectors and 1-forms of a specific Lorentz coordinate frame. The output 
is a “component of S in that frame”: 

S**y = S(w a 9 UJ^ e y ). (3.12) 

This defines components. Knowing the components in a specific frame, one can easily 
calculate the output produced from any input forms and vectors: 

S(a, p , v) = S(o a oj a , ppUJ 1 *, v y e y ) = o a p $ v y S(w a 9 a;^, e y ) (3.13) 

= s«e y o a ppvy. 

And knowing the components of S in one Lorentz frame (unprimed), plus the 
Lorentz transformation matrices IIA^II and HA^II which link that frame with 
another (primed), one can calculate the components in the new (primed) frame. As 
shown in exercise 3.2, one need only apply a matrix to each index of S, lining up 
the matrix indices in the logical manner 

( 3 ‘ 14 ) 

A slight change of the internal gears and wheels inside the tensor enables one 
of its 1-form slots to accept a vector. All that is necessary is a mechanism to convert 
an input vector n into its corresponding 1-form n and then to put that 1-form into 
the old machinery. Thus, denoting the modified tensor by the same symbol S as 
was used for the original tensor, one demands 

S(cr, n , v) ■= S(cr, n, v); (3.15) 

or, in component notation 

sa /}y°a nl3 v y = S ap y a a n $ vy. (3.15') 

This is achieved if one raises and lowers the indices of S using the components of 
the metric: 

S a » y = r p S a Py . (3.16) 

(See exercise 3.3 below.) By using the same symbol S for the original tensor and 


Definition of tensor as linear 
machine that converts 
vectors and 1-forms into 
numbers 


Components of a tensor 


Tensor's machine action 
expressed in terms of 
components 


Lorentz transformation of 
components 


Modifying a tensor to accept 
either a vector or a 1-form 
into each slot 


Raising and lowering indices 
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3. THE ELECTROMAGNETIC FIELD 


the modified tensor, one allows each slot to accept either a 1-form or a vector, so 
one loses sight of whether S is a @ tensor, or a (*) tensor, or a Q) tensor, or a (%) 
tensor; one only distinguishes its total rank, 3. Terminology: an “upstairs index” is 
called “contravariant”; a “downstairs” index is called “covariant.” Thus in S a ^ y , 
“a” is a contravariant index, while “/?” and “y” are covariant indices. 

Because tensors are nothing but functions, they can be added (if they have the 
same rank!) and multiplied by numbers in the usual way: the output of the rank-3 
tensor aS + bQ , when vectors u, v, iv are put in, is 

(aS + bQ)(u , v y iv) = aS(u, v , tv) + bQ(u , v , iv). (3.17) 

Several other important operations on tensors are explored in the following exercises. 
They and the results of the exercises will be used freely in the material that follows. 


Exercise 3.2. 


TRANSFORMATION LAW FOR COMPONENTS OF A TENSOR 


From the transformation laws for components of vectors and 1-forms, derive the transforma¬ 
tion law (3.14). 


Exercise 3.3. RAISING AND LOWERING INDICES 

Derive equations (3.16) from equation (3.15') plus the law n a = rj afi n^ for getting the 
components of the 1-form n from the components of its corresponding vector n. 

Exercise 3.4. TENSOR PRODUCT 

Given any two vectors u and v , one defines the second-rank tensor u ® v (“tensor product 
of u with i/-”) to be a machine, with two input slots, whose output is the number 

(u ® v)(a , A) - <a, u)(A, V) (3.18) 

when 1-forms o and A are inserted. Show that the components of T = u <8> v are the products 
of the components of u and v: 

T«e = u a v p t T j _ u y t Ta/} = UaVj3 ' ( 3 . 19 ) 

Extend the definition to several vectors and forms, 

(u ® v <8> 0 ® w)(a, A, n, <") = {a, u){A , v)(P, n){(, w), (3.20) 

and show that the product rule for components still holds: 

S = i/<8>v<8>/?<8>w has components 

Exercise 3.5. BASIS TENSORS 

Show that a tensor M with components M af3 y s in a given Lorentz frame can be reconstructed 
from its components and from the basis 1-forms and vectors of that frame as follows: 

M = M afi y S e a ® ® e 5 . (3.22) 

(For a special case of this, see Box 3.2.) 




Box 3.2 THE METRIC IN DIFFERENT LANGUAGES 


A. Geometric Language 

g is a linear, symmetric machine with two slots for insertion of vectors. When vectors 
u and v are inserted, the output of g is their scalar product: 

g(u, v) = u • v. 


B. Component Language 

7 ] ftv are the metric components. They are used to calculate the scalar product of two 
vectors from components in a specific Lorentz frame: 


u- v = q^uW. 


C. Coordinate-Based Geometric Language 

The metric g can be written, in terms of basis 1-forms of a specific Lorentz frame, 
as 

g — q^w* 0 w v = q^ dx* 0 dx v 
[see equations (2.18) and (3.22)]. 

D. Connection to the Elementary Concept of Line Element 

Box 2.3 demonstrated the correspondence between the gradient df of a function, 
and the elementary concept df of a differential change of / in some unspecified 
direction. There is a similar correspondence between the metric, written as q dx* 
0 dx v , and the elementary concept of “line element,” written as ds 2 = q^ dx* dx v . 
This elementary line element, as expounded in many special relativity texts, repre¬ 
sents the squared length of the displacement “ dx*” in an unspecified direction. The 
metric q v dx* 0 dx v does the same. Pick a specific infinitesimal displacement vector 
f, and insert it into the slots of q^ v dx* 0 dx v . The output will be f 2 = 
the squared length of the displacement. Before f is inserted, q^ v dx* 0 dx v has the 
potential to tell the squared length of any vector; the insertion of f converts potenti¬ 
ality into actuality: the numerical value of ( 2 . 

Because the metric q^ v dx* 0 dx v and the line element ds 2 = q^ v dx* dx v perform 
this same function of representing the squared length of an unspecified infinitesimal 
displacement, there is no conceptual distinction between them. One sometimes uses 
the symbols ds 2 to denote the metric; one sometimes gets pressed and writes it as 
ds 2 = q^ v dx* dx v , omitting the “0”; and one sometimes even gets so pressed as 
to use nonbold characters, so that no notational distinction remains at all between 
metric and elementary line element: 

g = ds 2 — ds 2 = q^ dx* dx 1 ’. 
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The power of the geometric 
view of physics 


Example of electromagnetism 


Transformation law for 
electric and magnetic fields 


Exercise 3.6. Faraday MACHINERY AT WORK 

An observer with 4-velocity u picks out three directions in spacetime that are orthogonal 
and purely spatial (no time part) as seen in his frame. Let e\, e.y, be unit vectors in those 
directions and let them be oriented in a righthanded way (ej • e? X = 4 -1 in three-di¬ 
mensional language). Why do the following relations hold? 

e f u = 0 , a r et = S Jk . 

What vectors are to be inserted in the two slots of the electromagnetic field tensor Faraday 
if one wants to get out the electric field along e- as measured by this observer? What vectors 
must be inserted to get the magnetic field he measures along ef! 


§3.3. THREE-PLUS-ONE VIEW VERSUS GEOMETRIC VIEW 

Great computational and conceptual power resides in Einstein’s geometric view of 
physics. Ideas that seem complex when viewed in the everyday “space-plus-time” 
or 3+1” manner become elegant and simple when viewed as relations between 
geometric objects in four-dimensional spacetime. Derivations that are difficult in 
3 + 1 language simplify in geometric language. 

The electromagnetic field is a good example. In geometric language, it is described 
by a second-rank, antisymmetric tensor (“ 2 -form”) F y which requires no coordinates 
for its definition. This tensor produces a 4-force on any charged particle given by 

dp/dr = ef(u). 

It is all so simple! 

By contrast, consider the “3 + 1 ” viewpoint. In a given Lorentz frame, there is 
an electric field E and a magnetic field B. They push on a particle in accordance 
with 

dp/dt — e(E 4 - v x B). 

But the values of/?, E , v, and B all change when one passes from the given Lorentz 
frame to a new one. For example, the electric and magnetic fields viewed from a 
rocket ship (“barred” frame) are related to those viewed in the laboratory (“un¬ 
barred” frame) by 

^11 = ^ii’ ^-L = ^ (E± + PX B ± \ 

(3.23) 

B " = K B± = 7f=^f - 0 X E ± ). 


(Here “II” means component along direction of rocket’s motion; “_L” means perpen¬ 
dicular component; and ft — dx* rockei /dt is the rocket’s ordinary velocity.) The 
analogous transformation laws for the particle’s momentum p and ordinary velocity 


§3.4. MAXWELL'S EQUATIONS 
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v, and for the coordinate time t, all conspire—as if by magic, it seems, from the 
3 + 1 viewpoint—to maintain the validity of the Lorentz force law in all frames. 

Not only is the geometric view far simpler than the 3 + 1 view, it can even derive 
the 3 + 1 equations with greater ease than can the 3 + 1 view itself. Consider, for 
example, the transformation law (3.23) for the electric and magnetic fields. The 
geometric view derives it as follows: (1) Orient the axes of the two frames so their 
relative motion is in the z-direction. (2) Perform a simple Lorentz transformation 
(equation 2.45) on the components of the electromagnetic field tensor: 

E n = E z = F m = A%A^F afi = y 2 F 30 + /? 2 y 2 F 03 
= (1 - /?V^30 = F 30 = E z = £„, 

E X = F W = A'jAP s F afl = y F 10 + fiy. F 13 = y(E t - pB y ), 
etc. 

By contrast, the 3+1 view shows much more work. A standard approach is based 
on the Lorentz force law and energy-change law (3.2a,b), written in the slightly 
modified form 


d 2 x 


m - 


e ( E r •— + 0 + B r —■ — B, dz 


dt_ 

dr 


dx 

dr 


)■ 


dr 2 ~ dr ' ~ dr ' ~ z dr y dr 
... (three additional equations) 

It proceeds as follows (details omitted because of their great length!): 


(3.25) 


(1) Substitute for the d 2 x/dr 2 , etc., the expression for these quantities in terms 
of the d 2 x/dr 2 ,... (Lorentz transformation). 

(2) Substitute for the d 2 x/dr 2 ,... the expression for these accelerations in terms 
of the laboratory E and B (Lorentz force law). 

(3) In these expressions, wherever the components dx/dj of the 4-velocity in the 
laboratory frame appear, substitute expressions in terms of the 4-velocities 
in the rocket frame (inverse Lorentz transformation). 

(4) In (3.25) as thus transformed, demand equality of left and right sides for all 
values of the dx/dj , etc. (validity for all test particles). 

(5) In this way arrive at the expressions (3.23) for the E and B in terms of the 
E and B. 


The contrast in difficulty is obvious! 


§3.4. MAXWELL'S EQUATIONS 

Turn now from the action of the field on a charge, and ask about the action of a 
charge on the field, or, more generally, ask about the dynamics of the electromagnetic 
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Magnetodynamics derived 
from magnetostatics 


field, charge or no charge. Begin with the simplest of Maxwell’s equations in a specific 
Lorentz frame, the one that says there are no free magnetic poles: 


V • B = div B 


j^x . + ?El 

dx 3 y dz 


= 0 . 


(3.26) 


This statement has to be true in all Lorentz frames. It is therefore true in the rocket 
frame: 


dx dy 




(3.27) 


For an infinitesimal Lorentz transformation in the x-direction (nonrelativistic ve¬ 
locity /?), one has (see Box 2.4 and equations 3.23) 


B x = B x% B y = B v + fiE z , B Z = B Z - fiE y \ 

dx dx dt 9 dy dy ’ dz dz * 


(3.28) 

(3.29) 


Substitute into the condition of zero divergence in the rocket frame. Recover the 
original condition of zero divergence in the laboratory frame, plus the following 
additional information (requirement for the vanishing of the coefficient of the 
arbitrary small velocity /?): 



0 ^, 04 _ 0 ^. 

dt dy dz 


(3.30) 


Had the velocity of transformation been directed in the/- or z-directions, a similar 
equation would have been obtained for dB y /dt or dBJdt . In the language of three- 
dimensional vectors, these three equations reduce to the one equation 


— + V X E = — + curl £ = 0. 
3 1 3 1 


(3-31) 


Magnetodynamics and 
magnetostatics unified in one 
geometric law 


How beautiful that (1) the principle of covariance (laws of physics are the same 
in every Lorentz reference system, which is equivalent to the geometric view of 
physics) plus (2) the principle that magnetic tubes of force never end, gives 
(3) Maxwell’s dynamic law for the time-rate of change of the magnetic field! 
This suggests that the magnetostatic law V • B = 0 and the magnetodynamic law 
dB/dt + V x E — 0 must be wrapped up together in a single frame-independent, 
geometric law. In terms of components of the field tensor F, that geometric law 
must read 

B a p,y *F B/3y,a “b ^ya,/3 = (3.32) 


since this reduces to V * B = 0 when one takes a — 1, /? = 2, y = 3; and it reduces 
to dB/dt + V x E = 0 when one sets any index, e.g., a, equal to zero (see exercise 
3.7 below). In frame-independent geometric language, this law is written (see §3.5, 
exercise 3.14, and Chapter 4 for notation) 
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dF = 0, or, equivalently, V • *F = 0; (3.33) 

and it says, “Take the electromagnetic 2-form F (a geometric object defined even 
in absence of coordinates); from it construct a new geometric object dF (called the 
“exterior derivative of F ”); dF must vanish. The details of this coordinate-free 
process will be spelled out in exercise 3.15 and in §4.5 (track 2). 

Two of Maxwell’s equations remain: the electrostatic equation 


V • E = 47rp, 


and the electrodynamic equation 

dE/dt - V x B = -4 ttJ. 


(3.34) 

(3.35) 


They, like the magnetostatic and magnetodynamic equations, are actually two Electrodynamics and 
different parts of a single geometric law. Written in terms of field components, that electrostatics unified in one 
law says geometric law 

= 477-7", (3.36) 

where the components of the “4-current” J are 
J° = p = charge density, 

(J\ J 2 , J 3 ) = components of current density. (3.37) 

Written in coordinate-free, geometric language, this electrodynamic law says 

d*F = 477 *J or, equivalently, V • F = 4i rJ. (3.38) 


(For full discussion, see exercise 3.15 and §4.5, which is on Track 2.) 


Exercise 3.7. MAXWELL'S EQUATIONS EXERCISE 

Show, by explicit examination of components, that the geometric laws 

E a $ t y + Fpy f a + Eya ,0 = 0 * E afi ,£ = 4 ? 7 J a , 

do reduce to Maxwell’s equations (3.26), (3.31), (3.34), (3.35), as claimed above. 


§3.5 WORKING WITH TENSORS 

Another mathematical digression is needed. Given an arbitrary tensor field, S, of 

arbitrary rank (choose rank = 3 for concreteness), one can construct new tensor Ways to produce new tensors 
fields by a variety of operations. from old: 

One operation is the gradient V. (The symbol d is reserved for gradients of scalars, Gradient 
in which case ▼/= df and for “exterior derivatives of differential forms;” a Track-2 
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Contraction 


Divergence 


concept, on which see §4.5.) Like S, VS is a machine. It has four slots, whereas 
S has three. It describes how S changes from point to point. Specifically, if one 
desires to know how the number S(t/, v , iv) for fixed u , v , iv changes under a 
displacement f, one inserts i/, v , iv, f into the four slots of VS: 

VS(u, w , f) = 0 f S(u, y, iv) with i/, y, iv fixed 

~ + [value of S(i/, v , iv) at tip of f] (3.39) 

— [value of S(i/, y, iv) at tail of f ]. 

In component notation in a Lorentz frame, this says 

VS(u, IV, f) = 

= S a py tt u a v^w'V. 

Thus, the Lorentz-frame components of VS are nothing but the partial derivatives 
of the components of S. Notice that the gradient raises the rank of a tensor by 1 
(from 3 to 4 for S). 

Contraction is another process that produces a new tensor from an old one. It 
seals off (“contracts”) two of the old tensor’s slots, thereby reducing the rank by 
two. Specifically, if R is a fourth-rank tensor and M is obtained by contracting the 
first and third slots of /?, then the output of M is given by (definition!) 

3 

M{u, v) = Yj *(e«, u, w a , v). (3.40) 

a=0 


Here e a and w a are the basis vectors and 1-forms of a specific but arbitrary Lorentz 
coordinate frame. It makes no difference which frame is chosen; the result will always 
be the same (exercise 3.8 below). In terms of components in any Lorentz frame, 
equation (3.40) says (exercise 3.8) 

M(u, v) = M fiV u fl v v = R a fi v u^v\ 

so that 

= W (3.41) 

Thus, in terms of components, contraction amounts to putting one index up and 
the other down, and then summing on them. 

Divergence is a third process for creating new tensors from old. It is accomplished 
by taking the gradient, then contracting the gradient’s slot with one of the original 
slots: 


(divergence of S on first slot) = V * S is a machine such that 

V • S(i/, v) = VS« i/, v, e a ) = S«p yta ufivy; (3.42) 


i.e. V * S has components S a p y a . 
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Transpose is a fourth, rather trivial process for creating new tensors. It merely 
interchanges two slots: 

N obtained by transposing second and third slots of S => 

N(u, v ; iv) = S{u, iv, v). (3.43) 

Symmetrization and antisymmetrization are fifth and sixth processes for producing 
new tensors from old. A tensor is completely symmetric if its output is unaffected 
by an interchange of two input vectors or 1-forms: 

S(i/, v , w) = S(v , u , w) — S(v, iv, u) ~ * • •. (3.44a) 

It is completely antisymmetric if it reverses sign on each interchange of input 

S(i/, v, iv) = — S(v, i/, iv) = +S(v, iv, i/) = • • •. (3.44b) 

Any tensor can be symmetrized or antisymmetrized by constructing an appropriate 
linear combination of it and its transposes; see exercise 3.12. 

Wedge product is a seventh process for producing new tensors from old. It is merely 
an antisymmetrized tensor product: given two vectors u and v , their wedge product, 
the “ bivector ” u A v, is defined by 

u A v = u ® v — v ® i/; (3.45a) 

similarly, the “2-form” a A constructed from two 1-forms is 

a A p =a ® p - p 0 a. (3.45b) 

From three vectors i/, v, w one constructs the “trivector” 

uAvAw = (uAv)Aw = uA(vAw) 

= u 0 v ® iv + terms that guarantee complete antisymmetry 
=:|/®v , ®lv-fv'®lv®t/-flv®l/^8 | v , (3.45c) 

— V®I/®W— l/®W® V— W® V®!/. 

From 1-forms a, /?, y one similarly constructs the “3-forms” a A /3 A y. The wedge 
product gives a simple way to test for coplanarity (linear dependence) of vectors: 
if u and v are collinear, so u — av , then 

uAv = avAv — Q (by antisymmetry of “A”). 

If iv is coplanar with u and v so w = au + bv (“collapsed box”), then 

wAuAv = auAuAv + bvAuAv = 0. 

The symbol “A” is called a “hat” or “wedge” or “exterior product sign.” Its proper¬ 
ties are investigated in Chapter 4. 

Taking the dual is an eighth process for constructing new tensors. It plays a 
fundamental role in Track 2 of this book, but since it is not needed for Track 1, 
its definition and properties are treated only in the exercises (3.14 and 3.15). 


Transpose 

Symmetrization and 
antisymmetrization 


Wedge product 
Bivector 

2-form 

Trivector 


Dual 
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Index gymnastics 


EXERCISES 


Because the frame-independent geometric notation is somewhat ambiguous (which 
slots are being contracted? on which slot is the divergence taken? which slots are 
being transposed?), one often uses component notation to express coordinate-inde¬ 
pendent, geometric relations between geometric objects. For example, 

T — 

means “J is a tensor obtained by taking the divergence on the first slot of the tensor 
SAlso, 


» y = (F„F^y = (F„Fi*) 9/ tf*y 

means “v is a vector obtained by (1) constructing the tensor product F <g> F of F 
with itself, (2) contracting F 0 F on its first and third slots, and also on its second 
and fourth, (3) taking the gradient of the resultant scalar function, (4) converting 
that gradient, which is a 1-form, into the corresponding vector.” 

Index gymnastics,” the technique of extracting the content from geometric 
equations by working in component notation and rearranging indices as required, 
must be mastered if one wishes to do difficult calculations in relativity, special or 
general. Box 3.3 expounds some of the short cuts in index gymnastics, and exercises 
3.8-3.18 offer practice. 


Exercise 3.8. CONTRACTION IS FRAME-INDEPENDENT 

Show that contraction, as defined in equation (3.40), does not depend on which Lorentz 
frame e a and w a are taken from. Also show that equation (3.40) implies 

M{u , v) = R a « v u fi v\ 

Exercise 3.9. DIFFERENTIATION 

(a) Justify the formula 


diu^/di = (duj dr)v v + u fl {dv v /dr). 

by considering the special case /a = 0, v = l. 

(b) Explain why 


Exercise 3.10. MORE DIFFERENTIATION 

(a) Justify the formula, 


d^u^/dj ~ luj^du^/dr), 

by writing out the summation explicitly. 

(b) Let 8 indicate a variation or small change, and justify the formula 

*( V*) = 2 F afi 8F°P. 


(c) Compute (F a pF aP ) u = ? 
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^ Box 3.3 TECHNIQUES OF INDEX GYMNASTICS 


Equation 


Name and Discussion 


S a py — S(w a , Bp, Gy) 

S afi y = S(W a , IV* 1 , By) 


S = S a g y e a ® wP <8> 0/1 
S = S afiy B a ® e B ® e y 


S‘% = n^s^y 

S \y = \^»y 

K = sv 

= S*\M, 

A°- = A°A a 




W = SJ 
'-'a 


Ca _ \!a 

* py — ;v /3,y 


Computing components. 

Computing other components. 

Reconstructing the rank-(J) version of S. 

Reconstructing the rank-® version of S. [Recall: one does 
not usually distinguish between the various versions; see equa¬ 
tion (3.15) and associated discussion.] 

Raising an index. 

Lowering an index. 

Contraction of S to form a new tensor M. 

Tensor product of S with M to form a new tensor T. 

Squared length of vector A produced by forming tensor product 
A ® A and then contracting, which is the same as forming the 
corresponding l-form>t and then piercing: .4 2 = (A, A) = A Q A a . 

The matrix formed from the metric’s “covariant components,” 
lirj a/J ll, is the inverse of that formed from its “contravariant 
components,” Equivalently, raising one index of the 

metric i] a/3 produces the Kronecker delta. 

Gradient of N to form a new tensor 5. 


R p — 


Divergence of N to form a new tensor R. 


N a p, y = = Taking gradients and raising or lowering indices are operations 

that commute. 


N a ^y = 


fi./i 


Contravariant index on a gradient is obtained by raising covari¬ 
ant index. 


( R a M p),y = lx a,Y IV */3 


= R a yM a + RM. 




Gradient of a tensor product; says ▼(/? ® M) = 
Transpose (V/7 <8> M) + R <8> V/W. 


Gap — F[ a p] = i( R ap — F 8a) 

Hap — F{ a p) EE 1/3 + F 8«) 

* / _ THo 

up V — J l ttaPy 


*F aa = iF^e 


tivap 


*D _ 1 

D a — a D E \(iva 


Antisymmetrizing a tensor F to produce a new tensor G. 

Symmetrizing a tensor F to produce a new tensor H. 

Forming the rank-3 tensor that is dual to a vector (exercise 
3.14). 

Forming the antisymmetric rank-2 tensor that is dual to a given 
antisymmetric rank-2 tensor (exercise 3.14). 

Forming the 1-form that is dual to an antisymmetric rank-3 
tensor (exercise 3.14). 
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Exercise 3.11. SYMMETRIES 

Let A m , be an antisymmetric tensor so that A M „ = -A„ u ; and let S*' be a symmetric tensor 
so that S** = S’*. 

(a) Justify the equations A /1I ,S , “' — 0 in two ways: first, by writing out the sum explicitly 
(all sixteen terms) and showing how the terms in the sum cancel in pairs; second, by giving 
an argument to justify each equals sign in the following string: 

A U ,.S“- = -A„ U S^ = —A^S"* = -A ae S“t = -A^ = 0 . 

(b) Establish the following two identities for any arbitrary tensor V^: 

yi “' A u , = J ( V*" ~ = y ( V»' + 


Exercise 3.12. SYMMETRIZATION AND ANTISYMMETRIZATION 

To “symmetrize” a tensor, one averages it with all of its transposes. The components of the 
new, symmetrized tensor are distinguished by round brackets: 


y (ltrt =l(y iiP+ f,„); 

Kuv\) = 37 (+ Kx* + + K u x + V.X, + V Xvii ). 


(3.46) 


One “antisymmetrizes” a tensor (square brackets) similarly: 

V f 1 = — (K — V Y 

1 ju t'j — 2 v i tv v u*' > 

^[fu >M — T* T V\/j.p — 


(3.47) 


(a) Show that such symmetrized and antisymmetrized tensors are, indeed, symmetric and 
antisymmetric under interchange of the vectors inserted into their slots: 

V (a0y) ua v 0 w y = 4 - V( a0 y)V a ufiwy = • • 

= - V[a 0 y]V a uW= .... 

(b) Show that a second-rank tensor can be reconstructed from its symmetric and antisym¬ 
metric parts, 


Vfiv — V(n») + (3.48) 

but that a third-rank tensor cannot; V {a0y) and V [a0y} contain together “less information” 
than V a0y . “Young diagrams” (see, e.g., Messiah [1961], appendix D) describe other symme¬ 
tries, more subtle than these two, which contain the missing information. 

(c) Show that the electromagnetic field tensor satisfies 

F W = 0, F a0 = F [a0] . (3.49a) 

(d) Show that Maxwell’s “magnetic” equations 

F afi,y 4 F 0 y a + F ya 0 = 0 

can be rewritten in the form 


Wl — 


(3.49b) 
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Exercise 3.13. LEV1-C1V1TA TENSOR 

The “Levi-Civita tensor” c in spacetime is a fourth-rank, completely antisymmetric tensor: 

c(n, u, v , w) changes sign when any two of the ^ 

vectors are interchanged. 

Choose an arbitrary but specific Lorentz frame, with e 0 pointing toward the future and with 
e v e 2 , ©3 a righthanded set of spatial basis vectors. The covariant components of e in this 
frame are 


*0123 = c (e 0 . e i> e 2> e 3 ) = + 1 - (3.50b) 

[Note: In an ^-dimensional space, e is the analogous completely antisymmetric rank-« tensor. 
Its components are 


f 12...n — ’ ®n) “ *b 1, 

when computed on a “positively oriented,” orthonormal basis e v ..., e n .] 

(a) Use the antisymmetry to show that 

e aj 8 ys = 0 unless a, /?, y, 5 are all different, 

f+1 for even permutations of 0, 1, 2, 3, and 
^o-rr 1^2773 — j_ j f or permutations. 

(b) Show that 

p’ill r 0 1 2 ^ 3 e 

C 7ro^l l 7T2 , ^3 * 


(3.50c) 

(3.50d) 

(3.50e) 

(3.50f) 


(c) By means of a Lorentz transformation show that e a $ yS and e^yE have these same values 
in any other Lorentz frame with eg pointing toward the future and with e^, e$, e 3 a 
righthanded set. Hint: show that 

= -det|yl\|; (3.50g) 

from yl^yl = 17 , show that det|/U„|= ± 1 ; and verify that the determinant is + I for trans¬ 
formations between frames with e 0 and eg future-pointing, and with e v e 2 , e 3 and e^, e 2 , 
e 3 righthanded. 

(d) What are the components of e in a Lorentz frame with past-pointing eg? with 
lefthanded ej, e 2 , e 3 ? 

(e) From the Levi-Civita tensor, one can construct several “permutation tensors.” In index 
notation: 


Show that: 


u (iv\ — 

5«/8 . —- v ’ 

u (iv — y u fiy\ 9 pf'Xp’ 


Xa =_Lx^j8 — JL Rotf3\ —_L c 

0 H = ° e e iil3\p- 


= 


+ 1 if a/ty is an even permutation of ju.v\, 
— 1 if aft y is an odd permutation of fu>\, 
0 otherwise; 


(3.50h) 

(3.50i) 

(3.50j) 

(3.50k) 
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1 +1 if «/? is an even permutation of pv, 

- 1 if aft is an odd permutation of pv, (3.501) 

0 otherwise; 

Ka _ f + 1 if « = 

M ~ 1 0 otherwise. (3.50m) 

Exercise 3.14. DUALS 

From any vector*/, any second-rank antisymmetric tensor F(F afi = F [a$] ), and any third-rank 
antisymmetric tensor B(B afiy = B [afiy] ), one can construct new tensors defined by 

V aj8y = ^ Wy , •F a „ = lF‘“’ w *B a = ±B^t XllPa . (3.51) 

One calls *J the “dual” of */, *F the dual of F, and *B the dual of B. [A previous and 
entirely distinct use of the word “dual” (§2.7) called a set of basis one-forms {«/“} dual 
to a set of basis vectors (e a ) if {u> a , = 8 a Fortunately there are no grounds for 

confusion between the two types of duality. One relates sets of vectors to sets of 1-forms. 
The other relates antisymmetric tensors of rank p to antisymmetric tensors of rank 4 — p.] 
(a) Show that 

”J = * "B = S. (3.52) 

so (aside from sign) one can recover any completely antisymmetric tensor H from its dual 
*H by taking the dual once again, ••##. This shows that H and *H contain precisely the 
same information. 

(b) Make explicit this fact of same-information-content by writing out the components 
m A afiy in terms of A a , also *F a ' 3 in terms of F a &, also *B a in terms of B a ^ y . 

Exercise 3.15. GEOMETRIC VERSIONS OF MAXWELL EQUATIONS 

Show that, if F is the electromagnetic field tensor, then ▼ • *F — 0 is a geometric frame-in¬ 
dependent version of the Maxwell equations 

F a0,y + F $y,a + Fya,$ = 

Similarly show that ▼ • F = 4t tJ (divergence on second slot of F) is a geometric version 
of F a * tfi = 4t tJ«. 

Exercise 3.16. CHARGE CONSERVATION 

From Maxwell’s equations F a & ^ = 47r/ a , derive the “equation of charge conservation” 

J a , a = 0. (3.53) 

Show that this equation does, indeed, correspond to conservation of charge. It will be studied 
further in Chapter 5. 

Exercise 3.17. VECTOR POTENTIAL 

The vector potential A of electromagnetic theory generates the electromagnetic field tensor 
via the geometric equation 


F = —(antisymmetric part of VA), 


(3.54) 


i.e., 


F — A —A 

* (IP ^P,H 


(3.54') 
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(a) Show that the electric and magnetic fields in a specific Lorentz frame are given by 

B = VxA , E = —dA/dt — VA°. (3.55) 

(b) Show that F will satisfy Maxwell’s equations if and only if A satisfies 

A a ’» f/i - A»y a = -4 7rJ a . (3.56) 

(c) Show that “gauge transformations” 

^new = ^OLD + <t> — arbitrary function, (3.57) 

leave F unaffected. 

(d) Show that one can adjust the gauge so that 

▼ •>1=0 (“Lorentz gauge”), (3.58a) 

QA = -4t tJ. (3.58b) 

Here □ is the wave operator (“d’Alembertian”): 

UA = A a * M e a . (3.59) 

Exercise 3.18. DIVERGENCE OF ELECTROMAGNETIC 
STRESS-ENERGY TENSOR 


From an electromagnetic field tensor F, one constructs a second-rank, symmetric tensor T 
(“stress-energy tensor,” to be studied in Chapter 5) as follows: 

T" = 'h{ F '‘ aF ’ a ~ J r ’ F «e Fal1 ) (3.60) 

As an exercise in index gymnastics: 

(a) Show that ▼ • T has components 

+ F>iaF \’ -yV^]' (3.61) 

(b) Manipulate this expression into the form 

Vs = ^ [- VV - j Fa0 ( F «^ + F ,a, P + />,.„)](3.62) 

note that the first term of (3.62) arises directly from the second term of (3.61). 

(c) Use Maxwell’s equations to conclude that 

T*’, = — F* a J a . 


(3.63) 



CHAPTER 


ELECTROMAGNETISM AND 
DIFFERENTIAL FORMS 


The ether trembled at his agitations 
In a manner so familiar that / only need to say. 
In accordance with Clerk Maxwell's six equations 
It tickled peoples' op tic s far away. 
You can feel the way it's done. 

You may trace them as they run — 
dy by dy less dp by dz is equal KdX/dt . . . 

While the curl of (X, Y, Z) is the 
minus d/dt of the vector (a, b, c): 

From The Revolution of the Corpuscle, 
written by A. A. Robb 
(to the tune of The Interfering Parrott) 
for a dinner of the research students 
of the Cavendish Laboratory 
in the days of the old mathematics. 



§4.1. EXTERIOR CALCULUS 

Stacks of surfaces, individually or intersecting to make “honeycombs,” “egg crates,” 
and other such structures (“differential forms”), give unique insight into the geometry 
of electromagnetism and gravitation. However, such insight comes at some cost in 
time. Therefore, most readers should skip this chapter and later material that depends 
on it during a first reading of this book. 

Analytically speaking, differential forms are completely antisymmetric tensors; 
pictorially speaking, they are intersecting stacks of surfaces. The mathematical 
formalism for manipulating differential forms with ease, called “exterior calculus,” 
is summarized concisely in Box 4.1; its basic features are illustrated in the rest of 
this chapter by rewriting electromagnetic theory in its language. An effective way 
to tackle this chapter might be to (1) scan Box 4.1 to get the flavor of the formalism; 
(2) read the rest of the chapter in detail; (3) restudy Box 4.1 carefully; (4) get practice 
in manipulating the formalism by working the exercises.* 

_ (continued on page 99) 

* Exterior calculus is. treated in greater detail than here by: £. Cartan (1945); de Rham (1955); 
Nickerson, Spencer, and Steenrod (1959); Hauser (1970); Israel (1970); especially Flanders (1963, 
relatively easy, with many applications); Spivak (1965, sophomore or junior level, but fully in tune with 
modem mathematics); H. Cartan (1970); and Choquet-Bruhat (1968a). 
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/-N 

Box 4.1 DIFFERENTIAL FORMS AND 

EXTERIOR CALCULUS IN BRIEF 

The fundamental definitions and formulas of exterior calculus are summarized here 
for ready reference. Each item consists of a general statement (at left of page) plus 
a leading application (at right of page). This formalism is applicable not only to 
spacetime, but also to more general geometrical systems (see heading of each section). 

No attempt is made here to demonstrate the internal consistency of the formalism, 
nor to derive it from any set of definitions and axioms. For a systematic treatment 
that does so, see, e.g., Spivak (1965), or Misner and Wheeler (1957). 


A. Algebra I (applicable to any vector space) 

1. Basis 1-forms. 

a. Coordinate basis = dx> 

(j tells which 1-form, not which component). 

b. General basis w j = V v dx k \ 

An application 

Simple basis 1-forms for analyzing Schwarzschild ge¬ 
ometry around static spherically symmetric center of 
attraction: 

W° = (1 — 2 m/r) l/2 dt; 

W 1 = (1 — 2 m/r)~ 1/2 dr; 

at 2 = r dd; 

at 3 = r sin 9 d<b. 

2. General pform (or p -vector) is a completely anti¬ 
symmetric tensor of rank (£) [or (g)]. It can be 
expanded in terms of wedge products (see §3.5 and 
exercise 4.12): 

a — A cu i2 A • • • A cu ip 

= a \iii2..A P \ wil A cu i2 A ■ • • A w ip . 

(Note: Vertical bars around the indices mean sum¬ 
mation extends only over i l < i 2 < • • • < i p .) 

Two applications 

Energy-momentum 1-form is of type a — a i a/ i or 

p = —£<#/+ p x dx + p y dy + p z dz . 

Faraday is a 2-form of type ~ A w v or in 

flat spacetime 

F = —E x dt A dx - E y dt A dy - E z dt A dz 
+ B x dy A dz + B y dz A dx + B z dx A dy 
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Box 4.1 (continued) 


3. Wedge product . 

All familiar rules of addition and multiplication 
hold, such as 


(aa + bp) A y = aa A y 4- bp A y, 

(a A /?) A / = a A (/? A y) = a A P A y, 

except for a modified commutation law between 
a /7-form a and a ^-form p : 

a A p = {-\) vq P A a. 

P v V P 

Applications to 1-forms a,p: 

a A p = ~p A a, a A a = 0; 
a A P — (cLjUji) A (fi k uj k ) — a^u)* A cu k 

= J ( a jPk - Ppkpu* A "*• 


4. Contraction of p-form on p-vector. 

(a,A) 

' p p' 

= a |(i A ■ • • A a/ 4 *, A 


A *i„> 




[= Sj^jp (see exercises 3.13 and 4.12)] 


Fowr applications 

a. Contraction of a particle’s energy-momentum 1-form 
p = p a uj a with 4-velocity u = u a e a of observer (a 
1-vector) ; 

— <P, u) = ~p a u a = energy of particle. 

b. Contraction of Faraday 2-form F with bivector 
89 A A9 [where 89 = (d9/d\ l )A\ 1 and A9 ~ 
(d9/d\ 2 )A\ 2 are two infinitesimal vectors in a 2-sur¬ 
face 9(X V A 2 ), and the bivector represents the surface 
element they span] is the magnetic flux 0 = < F,\ 89 
A A9) through that surface element. 

More generally, a /7-dimensional parallelepiped with 


c. 


vectors a v a 2 ,..., a p for legs has an oriented volume 
described by the ^/w/?/e”/7-vector a 1 A a 2 A • ■ • a p 
(oriented because interchange of two legs changes its 
sign). An egg-crate type of structure with walls made 
from the hyperplanes of p different 1-forms a 1 , 
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cr 2 , ...,a p is described by the “simple” /?-form cr 1 
A cr 2 A * * • A cr p . The number of cells of cr 1 A 
a 2 A • * ■ A a p sliced through by the infinitesimal 
/7-volume a x A a 2 A • • ■ A a p is 

(cr 1 A a 2 A • • ♦ A a p , A a 2 A • • • A s p >. 

d. The Jacobian determinant of a set of p functions 
/ fc (x\ ..., x n ) with respect to p of their arguments 
is 

(fA f A...A e fA|A...A|) 

= d \\(ML\\\ = d SJhIh • ■ ’/ p > 

ll\ 0X J * /II ^(x 1 , X 2 , ... , x p ) * 

5. Simple forms . 

a. A simple /7-form is one that can be written as 
a wedge product of p 1-forms: 

a = a A p A • • • Ay. 

p factors 

b. A simple /?-form a A p A • • • A y is repre¬ 

sented by the intersecting families of surfaces 
of a, y (egg-crate structure) plus a sense 

of circulation (orientation). 

Applications: 

a. In four dimensions (e.g., spacetime) all 0-forms, 1- 
forms, 3-forms, and 4-forms are simple. A 2-form F 
is generally a sum of two simple forms, e.g., F = 
— edt A dx + h dy A dz\ it is simple if and only if 
F A F = 0. 

b. A set of 1-forms a , p, ..., y is linearly dependent 
(one a linear combination of the others) if and 
only if 

a A p A • • ♦ Ay = 0 (egg crate collapsed). 



B. Exterior Derivative (applicable to any "differentiable manifold/' 
with or without metric) 


1. d produces a (p + l)-form da from a /?-form a. 

2. Effect of d is defined by induction using the 
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Box 4.1 (continued) 

(Chapter 2) definition of df and / a function (0- 
form), plus 

d(a A %) = da A p + (- l)*<ar A d/?, 
d 2 = dd = 0. 

7h<9 applications 

d(a A <//?) — da A dp. 

For the /?-form <p, with 

<P = dx* 1 A • • • A dx 4 *>, 
one has (alternative and equivalent definition of d<p) 

d<P = A dx* 1 A • •. A dx*>. 

C. Integration (applicable to any "differentiable manifold," with or 
without metric) 

1. Pictorial interpretation. 

Text and pictures of Chapter 4 interpret fa (inte¬ 
gral of specified 1-form a along specified curve 
from specified starting point to specified end point) 
as “number of a-surfaces pierced on that route”; 
similarly, they interpret fcp (integral of specified 
2-form cp over specified bit of surface on which 
there is an assigned sense of circulation or “orien¬ 
tation”) as “number of cells of the honeycomb-like 
structure cp cut through by that surface”; similarly 
for the egg-crate-like structures that represent 3- 
forms; etc. 

2. Computational rules for integration . 

To evaluate fa , the integral of a /7-form 

a = • • •, x n ) dx' 1 A • • • A dx*’, 

over a /?-dimensional surface, proceed in two steps, 
a. Substitute a parameterization of the surface, 

into a , and collect terms in the form 

a = a(\>) dX 1 A ... A d\* 

(this is a viewed as a /7-form in the /7-dimen¬ 
sional surface); 
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b. Integrate 

J a = J a(X’) d \ 1 dX 2 ... d\ p 
using elementary definition of integration. 

Example: See equations (4.12) to (4.14). 

3. The differential geometry of integration . 

Calculate fa for a /?-form a as follows. 

a. Choose the /?-dimensional surface S over which 
to integrate. 

b. Represent S by a parametrization giving the 
generic point of the surface as a function of the 
parameters, ^(A 1 , A 2 ,... A p ). This fixes the ori¬ 
entation. The same function with A 1 ^ A 2 , 

^(A 2 , A 1 ,..., A p ), describes a different (i.e., op¬ 
positely oriented) surface, — S. 

c. The infinitesimal parallelepiped 



is tangent to the surface. The number of cells 
of a it slices is 


(»>§ a - a &) 

This number changes sign if two of the vectors 
d*P/dX k are interchanged, as for an oppositely 
oriented surface. 

d. The above provides an interpretation motivat¬ 
ing the definition 

1 > 

dX 1 dX 2 ... dX p . 


This definition is identified with the computa¬ 
tional rule of the preceding section (C.2) in 
exercise 4.9. 

An application 

Integrate a gradient df along a curve, A) from ??(0) 
to sP(l): 


f df= f 1 (df dd>/dX) dX= f 1 (df/dX) dX 
•' •'o •'o 

=/mi)] -nmi 


e. Three different uses for symbol “rf”: First , light- 
face d in explicit derivative expressions such as 
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Box 4.1 (continued) 

d/da , or df/da , or dtf/da^ neither numerator nor 
denominator alone has any meaning, but only 
the full string of symbols. Second ' lightface d 
inside an integral sign; e.g., J f da. This is an 
instruction to perform integration, and has no 
meaning whatsoever without an integral sign; 

lives as an indivisible unit. Third, 
sans-serif d\ e.g., d alone, or df\ or da. This is 
an exterior derivative, which converts a p- form 
into a (p + l)-form. Sometimes lightface d is 
used for the same purpose. Hence, d alone, or 
df,\ or dx, is always an exterior derivative unless 
coupled to an f sign (. second use), or coupled 
to a / sign (first use). 

4. The generalized Stokes theorem (see Box 4.6). 

a. Let 3T* be the closed /7-dimensional boundary 
of a (p + l)-dimensional surface C V. Let a be 
a /?-form defined throughout L V. 

Then 


f da = 

f ° 

J r 

r 


[integral of /7-form a over boundary d L Y equals 
integral of (p + l)-form da over interior T]. 

b. For the sign to come out right, orientations of 
T* and 3T* must agree in this sense: choose 
coordinates y°, y 1 ,... ,y p on a portion of T, 
with y° specialized soj/° < 0 in T, and y° = 0 
at the boundary 3V; then the orientation 


••• 

dy° ay ay 


for T’ demands the orientation 


|^A ... A^- 

for 3TT. 

c. Note: For a nonorientable surface, such as a 
Mobius strip, where a consistent and continuous 
choice of orientation is impossible, more intri¬ 
cate mathematics is required to give a definition 
of “3” for which the Stokes theorem holds. 

Applications: Includes as special cases all integral theo¬ 
rems for surfaces of arbitrary dimension in spaces of 
arbitrary dimension, with or without metric, generaliz- 


j 
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ing all versions of theorems of Stokes and Gauss. Exam- 
pies: 

a. T a curve, 0T its endpoints, a — f a 0-form (func¬ 
tion): 

f df= f\df/dX)dX= [ f — f{\) —/(0). 

J r J o dr 

b. T a 2-surface in 3-space, 9T its closed-curve bound¬ 
ary, v a 1-form; translated into Euclidean vector 
notation, the two integrals are 

f dv= f (Vxr)-c/S;f v=f vdl 

c. Other applications in §§5.8, 20.2, 20.3, 20.5, and 
exercises 4.10, 4.11, 5.2, and below. 

D. Algebra II (applicable to any vector space with metric) 

1. Norm of a p-form. 

Hall 2 = a lil ...ip l a(il *'' <p - 

Two applications: Norm of a 1-form equals its squared 
length, Hall 2 — a • a. Norm of electromagnetic 2-form 
or Faraday: IIFII 2 — B 2 — E 2 . 

2. Dual of a pform. 

a. In an ^-dimensional space, the dual of a /7-form 
a is the (n — /7)-form *a, with components 

(* a )fci.Jc„_p “ 1 *" P * C <1 ..Ap fci ...kn-p' 

b. Properties of duals: 

**a = (—l) p_1 a in spacetime; 
a A *a = \\a\\ 2 c in general. 

c. Note: the definition of e (exercise 3.13) entails 
choosing an orientation of the space, i.e., decid¬ 
ing which orthonormal bases (1) are “right- 
handed” and thus (2) have e(e v ..., e n ) = +1. 

Applications 

a. For / a 0-form, */ = fc, and jy#(volume) = f *f 

b. Dual of charge-current 1-form J is charge-current 
3-form *J. The total charge Q in a 3-dimensional 
hypersurface region S is 

Q(S) = / *J 
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Conservation of charge is stated locally by d*J = 0. 
Stokes' Theorem goes from this differential conserva¬ 
tion law to the integral conservation law, 

0 = f d*J = f *J. 

•V ‘W 

This law is of most interest when 0T = S 2 — S x con¬ 
sists of the future S 2 and past S 1 boundaries of a 
spacetime region, in which case it states Q(S 2 ) = 

C(<S 1 ); see exercise 5.2. 

c. Dual of electromagnetic field tensor F = Faraday is 
*F — Maxwell. From the d*F — 4tt *J Maxwell 
equation, find 4 *Q = 4 vf s *J = f s d*F = J as *F. 

3. Simple forms revisited. 

a. The dual of a simple form is simple. 

b. Egg crate of *o is perpendicular to egg crate 
of a — a A 0 A • • * A jj in this sense: 

(1) pick any vector V lying in intersection of 
surfaces of a 

«a, 10 = </*,IO= =<//, ^>=0); 

(2) pick any vector W lying in intersection of 
surfaces of *a; 

(3) then V and W are necessarily perpendicu¬ 
lar: V- w — 0. 


C Box 4.1 (continued) 


Example: a — 3 dt is a simple 1-form in spacetime. 

a. *o — — 3 dx A dy A dz is a simple 3-form. 

b. General vector in surfaces of a is 

V = V x e x + V v e y + V z e z . 

c. General vector in intersection of surfaces of *a is 

W = W t e v 



j 


d. W- V = 0. 
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§4.2. ELECTROMAGNETIC 2-FORM AND LORENTZ FORCE 

The electromagnetic field tensor, Faraday — F ' is an antisymmetric second-rank 
tensor (i.e., 2-form). Instead of expanding it in terms of the tensor products of basis 
1-forms, 

F = F a(i dx a <8> dx&. 


the exterior calculus prefers to expand in terms of antisymmetrized tensor products 
(“exterior products/ 9 exercise 4.1): 


F = \ F afi dx a A dxfi, (4.1) 

dx a A dx p = dx a ® dx 13 - dx p <8> dx a . (4.2) 

Any 2-form (antisymmetric, second-rank tensor) can be so expanded. The symbol 
“A” is variously called a “wedge,” a “hat,” or an “exterior product sign”; and 
dx a A dx 13 are the “basis 2-forms” of a given Lorentz frame (see §3.5, exercise 3.12, 
and Box 4.1). 

There is no simpler way to illustrate this 2-form representation of the electromag¬ 
netic field than to consider a magnetic field in the x-direction: 


F = —F = B 

yz x zy "jr* 

F ~ B x dy A dz. 


(4.3) 


The 1-form dy = grad y is the set of surfaces (actually hypersurfaces) y = 18 (all 
U x, z),y =19 (all t, x, z),y = 20 (all /, x, z), etc.; and surfaces uniformly interpolated 
between them. Similarly for the 1-form dz . The intersection between these two sets 
of surfaces produces a honeycomb-like structure. That structure becomes a "2-form” 
when it is supplemented by instructions (see arrows in Figure 4.1) that give a "sense 
of circulation” to each tube of the honeycomb (order of factors in the “wedge 
product’ of equation 4.2; dy A dz = —dz A dy). The 2-form F in the example 
differs from this "basis 2-form” dy A dz only in this respect, that where dy A dz 
had one tube, the field 2-form has B x tubes. 

When one considers a tubular structure that twists and turns on its way through 
spacetime, one must have more components to describe it. The 2-form for the general 
electromagnetic field can be written as 


F = E x dx A dt H- E y dy A dt + E z dz A dt + B x dy A dz 

4- B y dz A dx + B z dx A dy (4.4) 

(6 components, 6 basis 2-forms). 

A 1-form is a machine to produce a number out of a vector (bongs of a bell as 
the vector pierces successive surfaces). A 2-form is a machine to produce a number 
out of an oriented surface (surface with a sense of circulation indicated on it: Figure 
4.1, lower right). The meaning is as clear here as it is in elementary magnetism: 


Electromagnetic 2-form 
expressed in terms of exterior 
products 


A 2-form as a honeycomb of 
tubes with a sense of 
circulation 


A 2-form as a machine to 
produce a number out of an 
oriented surface 
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Figure 4.1. 

Construction of the 2-form for the electromagnetic field F — B s dy A dz out of the 1-forms dy and 
dz by “wedge multiplication” (formation of honeycomb-like structure with sense of circulation indicated 
by arrows). A 2-form is a “machine to construct a number out of an oriented surface” (illustrated by 
sample surface enclosed by arrows at lower right; number of tubes intersected by this surface is 

/ *= 18 ; 

(this surface) 

Faraday’s concept of “magnetic flux”). This idea of 2-form machinery can be connected to the “tensor- 
as-machine” idea of Chapter 3 as follows. The shape of the oriented surface over which one integrates 
F does not matter, for small surfaces. All that affects JF is the area of the surface, and its orientation. 
Choose two vectors, u and v, that lie in the surface. They form two legs of a parallelogram, whose 
orientation (i# followed by i*) and area are embodied in the exterior product u A v. Adjust the lengths 
of u and v so their parallelogram, u A v, has the same area as the surface of integration. Then 




u A v 


machinery ideal 
of this chapterj 


[jmachinery idea 
L °f Chapter 3 


Exercise: derive this result, for an infinitesimal surface u A v and for general F y using the formalism 
of Box 4.1. 
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the number of Faraday tubes cut by that surface. The electromagnetic 2-form F 
or Faraday described by such a “tubular structure” (suitably abstracted; Box 4.2) 
has a reality and a location in space that is independent of all coordinate systems 
and all artificial distinctions between “electric” and “magnetic” fields. Moreover, 
those tubes provide the most direct geometric representation that anyone has ever 
been able to give for the machinery by which the electromagnetic field acts on a 
charged particle. Take a particle of charge e and 4-velocity 



(4.5) 


Let this particle go through a region where the electromagnetic field is described 
by the 2-form 


F = B t dy A dz (4.6) 

of Figure 4.1. Then the force exerted on the particle (regarded as a 1-form) is the 
contraction of this 2-form with the 4-velocity (and the charge); 

P = dp/dr = eF(u) = e(F, u), (4.7) 

as one sees by direct evaluation, letting the two factors in the 2-form act in turn 
on the tangent vector u : 

p = eB x (dy A dz, u) 

= eB x {dy{dz, u > - dz{dy, u>} 

= eB x {dy{dz, u z e z ) - dz(dy, u v e y )} 

or 


Pa dx “ = eB x uZ d y - eB x U v dz. (4.8) 

Comparing coefficients of the separate basis 1-forms on the two sides of this equa¬ 
tion, one sees reproduced all the detail of the Lorentz force exerted by the magnetic 
field By. ' " 


Pv 



dz 

IP' 


Pz 


d Pz 

dr 



(4.9) 


By simple extension of this line of reasoning to the general electromagnetic field, 
one concludes that the time-rate of change of momentum (1-form ) is equal to the charge 
multiplied by the contraction of the Faraday with the 4-velocity. Figure 4.2 illustrates 
pictorially how the 2-form, F, serves as a machine to produce the 1-form, p, out 
of the tangent vector, eu. 


Lorentz force as contraction 
of electromagnetic 2-form 
with particle's 4-velocity 


(continued on page 105) 
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Box 4.2 ABSTRACTING A 2-FORM FROM THE CONCEPT OF "HONEYCOMB¬ 
LIKE STRUCTURE," IN 3-SPACE AND IN SPACETIME 


Open up a cardboard carton containing a dozen 
bottles, and observe the honeycomb structure of 
intersecting north-south and east-west cardboard 
separators between the bottles. That honeycomb 
structure of “tubes” (“channels for bottles”) is a 
fairly apt illustration of a 2-form in the context 
of everyday 3-space. It yields a number (number 
of tubes cut) for each choice of smooth element 
of 2-surface slicing through the three-dimensional 
structure. However, the intersecting cardboard 
separators are rather too specific. All that a true 
2-form can ever give is the number of tubes sliced 
through, not the “shape” of the tubes. Slew the 
carton around on the floor by 45°. Then half the 
separators run NW-SE and the other half run 
NE-SW, but through a given bit of 2-surface fixed 
in 3-space the count of tubes is unchanged. There¬ 
fore, one should be careful to make the concept 
of tubes in the mind’s eye abstract enough that 
one envisages direction of tubes (vertical in the 
example) and density of tubes, but not any specific 
location or orientation for the tube walls. Thus all 
the following representations give one and the 
same 2-form, a: 

a = B dx A dy\ 

a = B(2 dx) A (y dy} 

(NS cardboards spaced twice as close as before; 

EW cardboards spaced twice as wide as before); 



(cardboards rotated through 45°); 


_ a dx + /? dy ydxj- Sdy_ 
(aS — (aS — fiy ) l/2 

(both orientation and spacing of “cardboards” 
changing from point to point, with all four 


functions, «, /?, y, and 5, depending on 
position). 


What has physical reality, and constitutes the real 
geometric object, is not any one of the 1-forms just 
encountered individually, but only the 2-form a 
itself. This circumstance helps to explain why in 
the physical literature one sometimes refers to 
“tubes of force” and sometimes to “lines of force.” 
The two terms for the same structure have this in 
common, that each yields a number when sliced 
by a bit of surface. The line-of-force picture has 
the advantage of not imposing on the mind any 
specific structure of “sheets of cardboard”; that is, 
any specific decomposition of the 2-form into the 
product of 1-forms. However, that very feature is 
also a disadvantage, for in a calculation one often 
finds it useful to have a well-defined representa¬ 
tion of the 2-form as the wedge product of 1-forms. 
Moreover, the tube picture, abstract though it 
must be if it is to be truthful, also has this advan¬ 
tage, that the orientation of the elementary tubes 
(sense of circulation as indicated by arrows in 
Figures 4.1 and 4.5, for example) lends itself to 
ready visualization. Let the “walls” of the tubes 
therefore remain in all pictures drawn in this book 
as a reminder that 2-forms can be built out of 
1-forms; but let it be understood here and here¬ 
after how manyfold are the options for the indi¬ 
vidual 1-forms! 

Turn now from three dimensions to four, and 
find that the concept of “honeycomb-like struc¬ 
ture” must be made still more abstract. In three 
dimensions the arbitrariness of the decomposition 
of the 2-form into 1-forms showed in the slant and 
packing of the “cardboards,” but had no effect on 
the vertically of the “channels for the bottles” 
(“direction of Faraday lines of force or tubes of 


V 
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force”); not so in four dimensions, or at least not 
in the generic case in four dimensions. 

In special cases, the story is almost as simple 
in four dimensions as in three. An example of a 
special case is once again the 2-form a — B dx 
A dy, with all the options for decomposition into 
1-forms that have already been mentioned, but 
with every option giving the same “direction” for 
the tubes. If the word “direction” now rises in 
status from “tube walls unpierced by motion in 
the direction of increasing z” to “tube walls un¬ 
pierced either by motion in the direction of in¬ 
creasing z, or by motion in the direction of in¬ 
creasing t , or by any linear combination of such 
motions,” that is a natural enough consequence of 
adding the new dimension. Moreover, the same 
simplicity prevails for an electromagnetic plane 
wave. For example, let the wave be advancing in 
the z-direction, and let the electric polarization 
point in the x-direction; then for a monochromatic 
wave, one has 

E x = B v = E 0 cos w(z - t) - -F 01 = E 3v 

and all components distinct from these equal zero. 

Faraday is 

F = F 0l dt A dx + F 31 dz A dx 
= E 0 cos c o(z — t) d(z — t) A dx , 

which is again representable as a single wedge 
product of two 1-forms. 

Not so in general! The general 2-form in four 
dimensions consists of six distinct wedge products, 

F = F 01 dt A dx + F 02 dt A dy 4- 

+ F 23 dy A dz. 

It is too much to hope that this expression will 
reduce in the generic case to a single wedge prod¬ 
uct of two 1-forms (“simple” 2-form). It is not even 


true that it will. It is only remarkable that it can 
be reduced from six exterior products to two (de¬ 
tails in exercise 4.1); thus, 

F = /7 1 A f 1 + n 2 A f 2 . 

Each product n l A ( 1 individually can be visual¬ 
ized as a honeycomb-like structure like those de¬ 
picted in Figures 4.1, 4.2, 4.4, and 4.5. Each such 
structure individually can be pictured as built out 
of intersecting sheets (1-forms), but with such de¬ 
tails as the tilt and packing of these 1-forms ab¬ 
stracted away. Each such structure individually 
gives a number when sliced by an element of 
surface. What counts for the 2-form F, however, 
is neither the number of tubes of n 1 A f 1 cut by 
the surface, nor the number of tubes of n 2 A f 2 
cut by the surface, but only the sum of the two. 
This sum is what is referred to in the text as the 
“number of tubes of F” cut by the surface. The 
contribution of either wedge product individually 
is not well-defined, for a simple reason: the de¬ 
composition of a six-wedge-product object into 
two wedge products, miraculous though it seems, 
is actually far from unique (details in exercise 4.2). 

In keeping with the need to have two products 
of 1-forms to represent the general 2-form note 
that the vanishing of dF (“no magnetic charges”) 
does not automatically imply that d(n l A f 1 ) or 
d(n 2 A f 2 ) separately vanish. Note also that any 
spacelike slice through the general 2-form F (re¬ 
duction from four dimensions to three) can always 
be represented in terms of a honeycomb-like 
structure (“simple” 2-form in three dimensions; 
Faraday’s picture of magnetic tubes of force). 

Despite the abstraction that has gone on in see¬ 
ing in all generality what a 2-form is, there is no 
bar to continuing to use the term “honeycomb-like 
structure” in a broadened sense to describe this 
object; and that is the practice here and hereafter. 



Figure 4,2. 

The Faraday or 2-form F of the electromagnetic field is a machine to produce a 1-form (the time-rate 
of change of momentum j> of a charged particle) out of a tangent vector (product of charge e of the 
particle and its 4-velocity u ). In spacetime the general 2-form is the “superposition” (see Box 4.2) of 
two structures like that illustrated at the top of this diagram, the tubes of the first being tilted and packed 
as indicated, the tubes of the second being tilted in another direction and having a different pack¬ 
ing density. 
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§4.3. FORMS ILLUMINATE ELECTROMAGNETISM, AND 
ELECTROMAGNETISM ILLUMINATES FORMS 

All electromagnetism allows itself to be summarized in the language of 2-forms, 
honeycomb-like “structures” (again in the abstract sense of “structure” of Box 4.2) 
of tubes filling all spacetime, as well when spacetime is curved as when it is flat. 
In brief, there are two such structures, one Faraday - F, the other Maxwell — *F , 
each dual (“perpendicular,” the only place where metric need enter the discussion) 
to the other, each satisfying an elementary equation: 


dF = 0 


(4.10) 


(“no tubes of Faraday ever end”) and 


d*F = 4ct *J 


(4.11) 


(“the number of tubes of Maxwell that end in an elementary volume is equal to 
the amount of electric charge in that volume”). To see in more detail how this 
machinery shows up in action, look in turn at: (1) the definition of a 2-form; (2) 
the appearance of a given electromagnetic field as Faraday and as Maxwell ; (3) 
the Maxwell structure for a point-charge at rest; (4) the same for a point-charge 
in motion; (5) the nature of the field of a charge that moves uniformly except during 
a brief instant of acceleration; (6) the Faraday structure for the field of an oscillating 
dipole; (7) the concept of exterior derivative; (8) Maxwell’s equations in the language 
of forms; and (9) the solution of Maxwell’s equations in flat spacetime, using a 1-form 
A from which the Li6nard-Wiechert 2-form F can be calculated via F = dA. 

A 2-form, as illustrated in Figure 4.1, is a machine to construct a number (“net 
number of tubes cut”) out of any “oriented 2-surface” (2-surface with “sense of 
circulation” marked on it): 


Preview of key points in 
electromagnetism 


A 2-form as machine for 
number of tubes cut 



(4.12) 


For example, let the 2-form be the one illustrated in Figure 4.1 

F = B x dy A dz, 


Number of tubes cut 
calculated in one example 


and let the surface of integration be the portion of the surface of the 2-sphere 
x 2 +/ + : 2 = a 2 , t = constant, bounded between 9 - 70° and 9 = 110° and 
between <p = 0° and <p = 90° (“Atlantic region of the tropics”). Write 


y = a sin 9 sin <p, 
z — a cos 9, 

dy = a (cos 9 sin <p tld + sin 9 cos <p d<p), 
dz = — a sin 9 d9 , 
dy A dz — a 2 sin 2 # cos <p d9 A dcp. 


(4.13) 
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Figure 4.3. 

Spacelike slices through Faraday , the electromagnetic 2-form, a geometric object, a honeycomb of tubes 
that pervades all spacetime (“honeycomb” in the abstract sense spelled out more precisely in Box 4.2). 
The surfaces in the drawing do not look like a 2-form (honeycomb), because the second family of surfaces 
making up the honeycomb extends in the spatial direction that is suppressed from the drawing. Diagram 
A shows one spacelike slice through the 2-form (time increases upwards in the diagram). In diagram 
B, a projection of the 2-form on this spacelike hypersurface gives the Faraday tubes of magnetic force 
in this three-dimensional geometry (if the suppressed dimension were restored, the tubes would be tubes, 
not channels between lines). Diagram C shows another spacelike slice (hypersurface of simultaneity for 
an observer in a different Lorentz frame). Diagram D shows the very different pattern of magnetic tubes 
in this reference system. The demand that magnetic tubes of force shall not end (V • B = 0), repeated 
over and over for every spacelike slice through Faraday , gives everywhere the result dB/dt = — V x E. 
Thus (magnetostatics) + (covariance) —^ (magnetodynamics). Similarly—see Chapters 17 and 21— 
(geometrostatics) + (covariance) —*- (geometrodynamics). 


The structure dd A dd looks like a “collapsed egg-crate” (Figure 1.4, upper right) 
and has zero content, a fact formally evident from the vanishing of a A p = 
—(3 A a when a and p are identical. The result of the integration, assuming constant 


B x , is 


n -HO* «90° 

J F — q 2 B x f sin 2 0 dd I cos <p dcp 

surface 70° 


(4.14) 


It is not so easy to visualize a pure electric field by means of its 2-form F (Figure 
4.4, left) as it is to visualize a pure magnetic field by means of its 2-form F (Figures 
4.1, 4.2, 4.3). Is there not some way to treat the two fields on more nearly the same 
footing? Yes, construct the 2-form *F (Figure 4.4, right) that is dual (“perpendicular”; 
Box 4.3; exercise 3.14) to F. 
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Figure 4.4. 

The Faraday structure 

f = A dx v =±F 01 dt A dx + j F w dx A dt = E x dx A dt 

associated with an electric field in the x-direction, and the dual (“perpendicular”) Maxwell honeycomb¬ 
like 2-form 

F ~~ ~2 Fpv dx^ A dx = *F 23 dx 2 A dx 3 = F 01 dx 2 A dx 3 = F 10 dx 2 A dx 3 = dy A dz. 


Represent in geometric form the field of a point-charge of strength e at rest at 
the origin. Operate in flat space with spherical polar coordinates: 

ds 2 — —dr 2 = g^ v dx* dx v 

= — dt 2 -f dr 2 + r 2 dd 2 + r 2 sin 2 0 dy 2 . (4.15) 

The electric field in the r-direction being E r = e/r 2 , it follows that the 2-form F 

or Faraday is 

F = y dx 11 A dx v - - E T dt A dr = - ~dt A dr. (4.16) 

Its dual, according to the prescription in exercise 3.14, is Maxwell. 

Maxwell = *F — e sm6 d9 A d<p, (4.17) 

as illustrated in Figure 4.5. 

Take a tour in the positive sense around a region of the surface of the sphere 
illustrated in Figure 4.5. The number of tubes of *F encompassed in the route will 
be precisely 

( number\ _ / solid \ 

of tubes/ \angle/* 


Pattern of tubes in dual 
structure M ex well for 
point-charge at rest 


The whole number of tubes of *F emergent over the entire sphere will be 47re, in 
conformity with Faraday’s picture of tubes of force. 
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Box 4.3 DUALITY OF 2-FORMS IN SPACETIME 

Given a general 2-form (containing six exterior or wedge products) 

F = E x dx A dt + E y dy A dt + * * * + B z dx A dy , 
one gets to its dual (“perpendicular”) by the prescription 

*F = — B x dx A dt — * • • + E y dz A dx + E z dx A dy. 

Duality Rotations 

Note that the dual of the dual is the negative of the original 2-form; thus 

**F = - E x dx A dt - - —B z dx A dy = — F. 

In this sense * has the same property as the imaginary number i: ** = ii = — 1. 

Thus one can write 


e *a _ cos a + * s [ n a 

This operation, applied to F, carries attention from the generic 2-form in its simplest- 
representation (see exercise 4.1) 

F = E x dx A dt -f B x dy A dz 

to another “duality rotated electromagnetic field” 

e* a F = (E x cos a — B x sin a) dx A dt + ( B x cos a + E x sin a) dy A dz. 

If the original field satisfied Maxwell’s empty-space field equations, so does the new 
field. With suitable choice of the “complexion” a, one can annul one of the two 
wedge products at any chosen point in spacetime and have for the other 

(. B x 2 + E x 2 ) 1/2 dy A dz. 

___ J 


Field of a point-charge in How can one determine the structure of tubes associated with a charged particle 

mot,on moving at a uniform velocity? First express *F in rectangular coordinates moving 

with the particle (barred coordinates in this comoving “rocket” frame of reference; 
unbarred coordinates will be used later for a laboratory frame of reference). The 
relevant steps can be listed: 

(a) 


*F = e sin 6 dd A d<p — —e(d cos 0) A d<p; 
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Figure 4.5. 

The field of 2-forms Maxwell - *F = e sin 6 dO A that 
describes the electromagnetic field of a charge e at rest at the 
origin. This picture is actually the intersection of *F with a 
3-surface of constant time t; i.e., the time direction is sup¬ 
pressed from the picture. 



(b) 


(c) 


— arctan — ; 
x 


dy = 


x dy - y dx 
x 2 + y 2 ’ 


cos e = —; -t/(cos 6) = + J-(xdx+ y dy + zdz ); 


(d) combine to find 


*F = (e/r 3 )(x dy A dz + ytfz A dx + zdx A dy) (4.18) 

(electromagnetic field of point charge in a comoving Cartesian system; spherically 
symmetric). Now transform to laboratory coordinates: 


(a) 

(b) 


velocity parameter a 


velocity f) = tanh a 


1 

— p= = cosh a, 




= sinha 


t = t cosh « — x sinh a, 
x — —t sinh a + x cosh a, 

/ = y 2 = z; 


r = [(* cosh a - t sinh a) 2 + y 2 + z 2 ] 1/2 ; 


(c) *F = ( e/r 3 )[(x cosh « - t sinh a) dy A dz + y dz A 

(cosh ctdx - sinh « dt) + z(cosh a dx - sinh a dt) A dy]; (4.19) 
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(d) compare with the general dual 2-form, 



•F = E x dy A dz + E y dz A dx + E z dx A dy 
+ B x dt A dx + B y dt A dy + B t dt A dz . 


and get the desired individual field components 


(e) 

E x = (e/F 3 )(x cosh a — t sinh a), B x = 0, 

Ey = (e/r 3 )_ycosha, B y = — (e/7 3 )z sinh a, 

E z = (e/r 3 )z cosh a, B z = (e/r 3 )y sinh a. 

(4.20) 

One can 

verify that the invariants 



B 2 -E 2 = \F ap F^, 

(4.21) 



(4.22) 


have the same value in the laboratory frame as in the rocket frame, as required. 
Note that the honeycomb structure of the differential form is not changed when 
one goes from the rocket frame to the laboratory frame. What changes is only the 
mathematical formula that describes it. 


§4.4. RADIATION FIELDS 


How an acceleration causes 
radiation 


The Maxwell structure of tubes associated with a charge in uniform motion is more 
remarkable than it may seem at first sight, and not only because of the Lorentz 
contraction of the tubes in the direction of motion. The tubes arbitrarily far away 
move on in military step with the charge on which they center, despite the fact that 
there is no time for information “emitted” from the charge “right now” to get to 
the faraway tube “right now.” The structure of the faraway tubes “right now” must 
therefore derive from the charge at an earlier moment on its uniform-motion, 
straight-line trajectory. This circumstance shows up nowhere more clearly than in 
what happens to the field in consequence of a sudden change, in a short time At, 
from one uniform velocity to another uniform velocity (Figure 4.6). The tubes have 
the standard patterns for the two states of motion, one pattern within a sphere of 
radius r, the other outside that sphere, where r is equal to the lapse of time (“cm 
of light-travel time”) since the acceleration took place. The necessity for the two 
patterns to fit together in the intervening zone, of thickness Ar = At, forces the field 
there to be multiplied up by a “stretching factor,” proportional to r. This factor is 
responsible for the well-known fact that radiative forces fall off inversely only as 
the first power of the distance (Figure 4.6). 

When the charge continuously changes its state of motion, the structure of the 
electromagnetic field, though based on the same simple principles as those illustrated 
in Figure 4.6, nevertheless looks more complex. The following is the Faraday 2-form 
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Figure 4.6. 

Mechanism of radiation. J. J. Thomson’s way to understand why the strength of an electromagnetic wave 
falls only as the inverse first power of distance r and why the amplitude of the wave varies (for low 
velocities) as sin# (maximum in the plane perpendicular to the line of acceleration). The charge was 
moving to the left at uniform velocity. Far away from it, the lines of force continue to move as if this 
uniform velocity were going to continue forever (Coulomb field of point-charge in slow motion). However, 
closer up the field is that of a point-change moving to the right with uniform velocity (1/r 2 dependence 
of strength upon distance). The change from the one field pattern to another is confined to a shell of 
thickness At located at a distance r from the point of acceleration (amplification of field by “stretching 
factor” r sin# A$/At\ see text). We thank C. Teitelboim for the construction of this diagram. 



for the field of an electric dipole of magnitude p x oscillating up and down parallel Field of an oscillating dipole 
to the 2 -axis: 

F = E x dx A dt + • * • + B x dy A dz 4- • ■ * = real part of {/> 1 e iwT_iw * 

[2 co ^(tt - 7 ?) dl Adt + sin ^ (73 ~ 7? ~ ~jjr ) r de A dt 

gives £„ 

drArdd]} (4.23) 


gives E r 

, • a I — /to co 2 \ 


gives 




3 > 
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and the dual 2-form Maxwell = *F is 

*F = —B x dx A dt — • • • + E x dy A </- + • • • = real part of 
[sin 9 ^^ dt A r sin 9 d$ 

gives B 0 

4- 2 cos 0 d9 A r sin 9 d<j> 

gives E r 

+ sin 9 (-j — — ~\ r sin^<f<J) A cfr]}. (4.24) 

gives E e 


§4.5. MAXWELL'S EQUATIONS 


Taking exterior derivative 


The general 2-form F is written as a superposition of wedge products with a 
factor 

F ^jF^dx* A dx\ (4.25) 


because the typical term appears twice, once as F xy dx A dy and the second time 
as Fy X c/y A c(x, with — — ^xy t/y /\ dx ” — dx /\ dy 

If differentiation (“taking the gradient”; the operator d) produced out of a scalar 
a 1-form, it is also true that differentiation (again the operator d , but now generally 
known under Cartan’s name of “exterior differentiation”) produces a 2-form out 
of the general 1-form; and applied to a 2-form produces a 3-form; and applied to 
a 3-form produces a 4-form, the form of the highest order that spacetime will 
accommodate. Write the general/-form as 


& yj ^0:10:2...<* 


dx ai A dx a2 A 


A dx a r 


(4.26) 


where the coefficient _ afJ like the wedge product that follows it, is antisym¬ 
metric under interchange of any two indices. Then the exterior derivative of 0 is 


d<p = 


1 ^ ( &aia2...a f 

J\ dx^° 


dx ao A dx ai A dx a2 A • • • A dx a K 


(4.27) 


Take the exterior derivative of Faraday according to this rule and find that it 
vanishes, not only for the special case of the dipole oscillator, but also for a general 
electromagnetic field. Thus, in the coordinates appropriate for a local Lorentz frame, 
one has 
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dF = d(E x dx A dt + • • • + B x dy A dz + • • •) 

,/* + $*) 

+ • • • (5 more such sets of 4 terms each)_ 


-t 


dE x ZE ZE X 
-rfdt + —±dx + 1 

ot dx 


A dx A dt 


(4.28) 


Note that such a term as dy A dy A dz is automatically zero (“collapse of egg-crate 
cell when stamped on”). Collect the terms that do not vanish and find 


dF = 


(ZB. ZB ZB, \ 

= {-zf + if + ^f) dxAdyAdz 


+ 


X 

k dt 

+ 

1 

V 

dz i 

<ZB V 

1 

ZE X 

SEy 

, dt 

I 

dz 

dx > 

'ZB X 

+ 


ZE X ' 


dt A dz A dx 


(4.29) 


Each term in this expression is familiar from Maxwell’s equations 

div B = V • B = 0 

and 

curl E — V X E ~ —B. 

Each vanishes, and with their vanishing Faraday itself is seen to have zero exterior 
derivative: 

dF = 0. (4.30) 

In other words, “ Faraday is a closed 2-form”; “the tubes of F nowhere come to 
an end.” 

A similar calculation gives for the exterior derivative of the dual 2-form Maxwell 
the result 

d*F = d{ — B x dx A dt - • • • + E x dy A dz + • • •) 

Id E dE v dE \ 

= \— + -zf + —) dxAdyAdz 

'dE, ZB, ZB„ 


+ 

+ • 


(dE. dB dB u \ 

(—---h -r-^) dt A dy A dz 

\ dt dy dz 1 


— 4-77(p dx A dy A dz 
— J x dt A dy A dz 
— J y dt A dz A dx 
— J z dt A dx A dy) = 4 tt *J; 


d*F = 4 tt *J. 


(4.31) 


Faraday structure: tubes 
nowhere end 


Maxwell structure: density 
of tube endings given by 
charge-current 3-form 
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Duality: the only place in 
electromagnetism where 
metric must enter 


Closed 2-form contrasted 
with general 2-form 


In empty space this exterior derivative, too, vanishes; there Maxwell is a closed 
2-form; the tubes of *F 9 like the tubes of F, nowhere come to an end. 

In a region where charge is present, the situation changes. Tubes of Maxwell 
take their origin in such a region. The density of endings is described by the 3-form 
*J - charge , a "collection of eggcrate cells” collected along bundles of world lines. 
The two equations 


dF — 0 


and 


d*F = 477 *J 

summarize the entire content of Maxwell’s equations in geometric language. The 
forms F = Faraday , and *F = Maxwell , can be described in any coordinates one 
pleases—or in a language (honeycomb and egg-crate structures) free of any reference 
whatsoever to coordinates. Remarkably, neither equation makes any reference 
whatsoever to metric . As Hermann Weyl was one of the most emphatic in stressing 
(see also Chapters 8 and 9), the concepts of form and exterior derivative are metric- 
free. Metric made an appearance only in one place, in the concept of duality 
(“perpendicularity”) that carried attention from F to the dual structure *F. 


§4.6. EXTERIOR DERIVATIVE AND CLOSED FORMS 

The words "honeycomb” and "egg crate” may have given some feeling for the 
geometry that goes with electrodynamics. Now to spell out these concepts more 
clearly and illustrate in geometric terms, with electrodynamics as subject matter, 
what it means to speak of "exterior differentiation.” Marching around a boundary, 
yes; but how and why and with what consequences? It is helpful to return to functions 
and 1-forms, and see them and the 2-forms Faraday and Maxwell and the 3-form 
charge as part of an ordered progression (see Box 4.4). Two-forms are seen in this 
box to be of two kinds: (1) a special 2-form, known as a "closed” 2-form, which 
has the property that as many tubes enter a closed 2-surface as emerge from it 
(exterior derivative of 2-form zero; no 3-form derivable from it other than the trivial 
zero 3-form!); and (2) a general 2-form, which sends across a closed 2-surface a 
non-zero net number of tubes, and therefore permits one to define a nontrivial 3-form 
("exterior derivative of the 2-form”), which has precisely as many egg-crate cells 
in any closed 2-surface as the net number of tubes of the 2-form emerging from 
that same closed 2-surface (generalization of Faraday’s concept of tubes of force 
to the world of spacetime, curved as well as flat). 


(continued on page 120) 
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Box 4.4 THE PROGRESSION OF FORMS AND EXTERIOR DERIVATIVES 




0-Form or Scalar,/ 

An example in the context of 3-space and Newto¬ 
nian physics is temperature, T(x,y,z), and in the 
context of spacetime, a scalar potential, <f>(/, x,y, z). 



From Scalar to 7-Form 

Take the gradient or “exterior derivative” of a 
scalar / to obtain a special 1-form, y = df Com¬ 
ments: (a) Any additive constant included in / is 
erased in the process of differentiation; the quan¬ 
tity n in the diagram at the left is unknown and 
irrelevant, (b) The 1-form y is special in the sense 
that surfaces in one region “mesh” with surfaces 
in a neighboring region (“closed 1-form”), (c) Line 
integral df is independent of path for any class 
of paths equivalent to one another under continu¬ 
ous deformation, (d) The 1-form is a machine to 
produce a number (“bongs of bell” as each succes¬ 
sive integral surface is crossed) out of a displace¬ 
ment (approximation to concept of a tangent 
vector). 


General 7-Form (3 = fi a dx a 


This is a pattern of surfaces, as illustrated in the 
diagram at the right; i.e., a machine to produce 
a number (“bongs of bell”; </ 3 , i/)) out of a vector. 
A 1-form has a reality and position in space inde¬ 
pendent of all choice of coordinates. Surfaces do 
not ordinarily mesh. Integral f(3 around indicated 
dosed loop does not give zero (“more bongs than 
antibongs”). 





From 7-Form to 2-Form £ = dfi = 


Ms. 

dx fl 


dx^ A dx a 


f is a pattern of honeycomb-like cells, with a di¬ 
rection of circulation marked on each, so stationed 
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^ Box 4.4 { continued ) 


"A 


that the number of cells encompassed in the dotted 
closed path is identical to the net contribution 
(excess of bongs over antibongs) for the same path 
in the diagram of /? above. The “exterior deriva¬ 
tive” is defined so this shall be so; the generalized 
Stokes theorem codifies it. The word “exterior” 
comes from the fact that the path goes around the 
periphery of the region under analysis. Thus the 
2-form is a machine to get a number (number of 
tubes, , u A v)) out of a bit of surface (u A v) 
that has a sense of circulation indicated upon 
it. The 2-form thus defined is special in this sense: 
a rubber sheet “supported around its edges” by 
the dotted curve or any other closed curve is 
crossed by the same number of tubes when: (a) 
it bulges up in the middle; (b) it is pushed down 
in the middle; (c) it experiences any other continu¬ 
ous deformation. The Faraday or 2-form F of 
electromagnetism, always expressible as F — dA 
(A ■= 4-potential, a 1-form), also has always this 
special property (“conservation of tubes”). 



0-Form to /-Form to 2-Form? No! 

Go from scalar / to 1-form y — df. The next step 
to a 2-form a is vacuous. The net contribution of 
the line integral fy around the dotted closed path 
is automatically zero. To reproduce that zero result 
requires a zero 2-form. Thus a = dy = ddf has 
to be the zero 2-form. This result is a special in¬ 
stance of the general result dd — 0. 


General 2-Form a = -~o a ^ dx a A dx&, with o ae = — o^ a 

Again, this is a honeycomb-like structure, and 
again a machine to get a number (number of 
tubes, <cr, u A v)) out of a surface (u A v) that 
has a sense of circulation indicated on it. It is 
general in the sense that the honeycomb structures 
in one region do not ordinarily mesh with those 
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in a neighboring region. In consequence, a closed 
2-surface, such as the box-like surface indicated 
by dotted lines at the right, is ordinarily crossed 
by a non-zero net number of tubes. The net num¬ 
ber of tubes emerging from such a closed surface 
is, however, exactly zero when the 2-form is the 
exterior derivative of a 1-form. 



00 

From 2-Form to 3-Form fj — do = — |Q!/31 dx y A dx a A dx&, 

dx y 

where dx y A dx a A dx& = 3! dx [y ® dx a ® dx@ ] 


This egg-crate type of structure is a machine to 
get a number (number of cells <//, u A v A w>) 
from a volume (volume u A v A w within which 
one counts the cells). A more complete diagram 
would provide each cell and the volume of inte¬ 
gration itself with an indicator of orientation 
(analogous to the arrow of circulation shown for 
cells of the 2-form). The contribution of a given 
cell to the count of cells is -f-1 or — 1, according 
as the orientation indicators have same sense or 
opposite sense. The number of egg-crate cells of 
^ — do in any given volume (such as the volume 
indicated by the dotted lines) is tailored to give 
precisely the same number as the net number of 
tubes of the 2-form a (diagram above) that emerge 
from that volume (generalized Stokes theorem). 
For electromagnetism, the exterior derivative of 
Faraday or 2-form F gives a null 3-form, but the 
exterior derivative of Maxwell or 2-form *F gives 
477 times the 3-form *J of charge: 



J = p dx A dy A dz — J x dt A dy A dz 

— J y dt A dz A dx — J z dt A dx A dy. 
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Box 4.4 (continued) 



From /-Form to 2-Form to 3-Form? No! 

Starting with a 1-form (electromagnetic 4-potential), construct its exterior deriva¬ 
tive. the 2-form F = dA (Faraday). The tubes in this honeycomb-like structure never 
end. So the number of tube endings in any elementary volume, and with it the 3-form 
dF — ddA. is automatically zero. This is another example of the general result that 
dd = 0. 


From 2-Form to 3-Form to 4-Form? No! 

Starting with 2-form *F (Maxwell), construct its exterior derivative, the 3-form 
4?r *J. The cells in this egg-crate type of structure extend in a fourth dimension 
(“hypertube”). The number of these hypertubes that end in any elementary 4-vol¬ 
ume, and with it the 4-form 

d( 4t7 *J) = dd*F, 

is automatically zero, still another example of the general result that dd = 0. This 
result says that 

,, , (dp dJ x dJ u dj r \ 

d*J = I — + —£■ + — 1 + —^ )dt A dx A dy A dz = 0 
\ ct dx dy dz / 

(“law of conservation of charge”). Note: 

dx a A dx@ A dx y A dx* = 4! dx la <8> dxP <8 dx y ® dx s] . 

This implies dt A dx A dy A dz = c. 


From 3-Form to 4-Form t = dv 


yl 

dx* 


dx* A dx a A dxP A dx y 


This four-dimensional “super-egg-crate” type structure is a machine to get a number 
(number of cells, <r,#i A u A v A w» from a 4-volume n A u A v A w. 
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From 4-Form to 5-Form? No! 

Spacetime, being four-dimensional, cannot accommodate five-dimensional egg-crate 
structures. At least two of the dx * 1 ’s in 

dx a A dx& A dx y A dx s A dx* 

must be the same; so, by antisymmetry of “A,” this “basis 5-form” must vanish. 


Results of Exterior Differentiation, Summarized 


0-form 

/ 



1-form 

df 

A 


2-form 

o 

III 

1- 

F=dA 

*F 

3-form 

dF = ddA = 0 

4t7 *J - d*F v 

4-form ; 



d(4ir *J) = dd*F = 0 t = dv fj 

5-form? 

No! 


O 

III 

£ 

o 

III 

* 


New Forms from Old by Taking Dual (see exercise 3.14) 

Dual of scalar f is 4-form: */ = / dx° A dx 1 A dx 2 A dx 3 = fc. 

Dual of 1-form J is 3-form: *J = J° dx 1 A dx 2 A dx 3 — J 1 dx 2 A dx 3 A dx° 

-f J 2 dx 3 A dx° A dx 1 — J 3 dx° A dx 1 A dx 2 . 

Dual of 2-form F is 2-form: *F = dx* A dx% where 

F a * = v aX V fi8 J\s- 

Dual of 3-form K is 1-form: *K = K 012 dx 3 — K 123 dx° -{- K 230 dx 1 — K 301 dx 2 , 

where K a & y = tj atL t\^ v y\ yX K fivX . 

Dual of 4-form I is a scalar: L = L 0123 dx° A dx 1 A dx 2 A dx 3 ; 

*1 — 1 0123 — t 
L — — —^ 0123 - 

Note 1: This concept of duality between one form and another is to be distinguished 
from the concept of duality between the vector basis e a and the 1-form basis w a 
of a given frame. The two types of duality have nothing whatsoever to do with each 
other! 

v_ 
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Box 4.4 (continued) 


Note 2: In spacetime, the operation of taking the dual, applied twice, leads back 
to the original form for forms of odd order, and to the negative thereof for forms 
of even order. In Euclidean 3-space the operation reproduces the original form, 
regardless of its order. 


Duality Plus Exterior Differentiation 

Start with scalar <f>. Its gradient d<f> is a 1-form. Take its dual, to get the 3-form 
Take its exterior derivative, to get the 4-form </*</<£. Take its dual, to get the scalar 
0(f>= —*d*d<t>. Verify by index manipulations that □ as defined here is the 
wave operator; i.e., in any Lorentz frame, □<£ = <j> a ’ a = —(d 2 <p/dt 2 ) + V 2 <J>. 

Start with 1-form A. Get 2-form F = dA. Take its dual *F = *dA, also a 2-form. 

Take its exterior derivative, obtaining the 3-form d*F (has value 477 *J in electro¬ 
magnetism). Take its dual, obtaining the 1-form *d*F = *d*dA = 4ttJ (“Wave 
equation for electromagnetic 4-potential”). Reduce in index notation to 

V' = A *y’ - A ,y” = 

[More in Flanders (1963) or Misner and Wheeler (1957); see also exercise 3.17.] 

_____ J 


§4.7. DISTANT ACTION FROM LOCAL LAW 

Differential forms are a powerful tool in electromagnetic theory, but full power 
requires mastery of other tools as well. Action-at-a-distance techniques (“Green’s 
functions,” “propagators”) are of special importance. Moreover, the passage from 
Maxwell field equations to electromagnetic action at a distance provides a preview 
of how Einstein’s local equations will reproduce (approximately) Newton’s 1/r 2 law. 

In flat spacetime and in a Lorentz coordinate system, express the coordinates of 
particle A as a function of its proper time a, thus: 

a" = a"(a), = d^a), = a»(a). (4.32) 

da da z 

Dirac found it helpful to express the distribution of charge and current for a particle 
of charge e following such a motion as a superposition of charges that momentarily 
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flash into existence and then flash out of existence. Any such flash has a localization 
in space and time that can be written as the product of four Dirac delta functions 
[see, for example, Schwartz (1950-1951), Lighthill (1958)]: 


S 4 (*a - a") = S[x° - a°(a)] 5[x J - a 1 ^)] S[x 2 - a 2 (a)] S[* 3 - ^(a)]. (4.33) 


Here any single Dirac function S(x) (“symbolic function”; “distribution”; “limit of 
a Gauss error function” as width is made indefinitely narrow and peak indefinitely 
high, with integrated value always unity) both (1) vanishes for x 0, and (2) has 
the integral /i" S(x) </x = 1. Described in these terms, the density-current vector 
for the particle has the value (“superposition of flashes”) 

J» = ej S 4 [x" - a’XoOld'Xa) da. (4.34) 


The density-current (4.34) drives the electromagnetic field, F. Write F = dA to 
satisfy automatically half of Maxwell’s equations ( dF = ddA = 0): 


F = 

" a dx* 


dx a ' 


(4.35) 


In flat space, the remainder of Maxwell’s equations ( d*F — 477 *J ) become 


or 


dF " 

—= 4ttJ 
dx* M 


0 dA" d 2 A u . , 

-7)"“- = 4irJ a . 

0x" dx" dx v dx a M 


(4.36) 


Make use of the freedom that exists in the choice of 4-potentials A r to demand 


dA^ 

dx" 


= 0 


(4.37) 


(Lorentz gauge condition; see exercise 3.17). Thus get 

CL4 a = —477 J^. (4.38) 

The density-current being the superposition of “flashes,” the effect ( A) of this 
density-current can be expressed as the superposition of the effects E of elementary 
flashes; thus 

A^ix) = j E[x — ^(aljd^a) da, (4.39) 

where the “elementary effect” E (“kernel”; “Green’s function”) satisfies the equation 

□£(x) = -477 5 4 (x). (4.40) 


World line of charge 
regarded as succession of 
flash-on, flash-off charges 


The electromagnetic wave 
equation 


The solution of the wave 
equation 


One solution is the “half-advanced-plus-half-retarded potential,” 
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It vanishes everywhere except on the backward and forward light cones, where it 
has equal strength. Normally more useful is the retarded solution, 

which is obtained by doubling (4.41) in the region of the forward light cone and 
nullifying it in the region of the backward light cone. Ail electrodynamics (Coulomb 
forces, Ampere’s law, electromagnetic induction, radiation) follows from the simple 
expression (4.39) for the vector potential [see, e.g., Wheeler and Feynman (1945) 
and (1949), also Rohrlich (1965)]. 


ICISES 


Exercise 4.1. 


GENERIC LOCAL ELECTROMAGNETIC FIELD 
EXPRESSED IN SIMPLEST FORM 


In the laboratory Lorentz frame, the electric field is E , the magnetic field B. Special cases 
are: (1) pure electric field (B = 0); (2) pure magnetic field (E = 0); and (3) “radiation field” 
or “null field” (E and B equal in magnitude and perpendicular in direction). All cases other 
than (1), (2), and (3) are “generic.” In the generic case, calculate the Poynting density of 
flow of energy E X B/Ak and the density of energy ( E 2 + B 2 )/8rc\ Define the direction of 
a unit vector n and the magnitude of a velocity parameter a by the ratio of energy flow 
to energy density: 


n tanh 2a 


2 Ex B 
E 2 + B 2 


View the same electromagnetic field in a rocket frame moving in the direction of n with 
the velocity parameter a (not 2a; factor 2 comes in because energy flow and energy density 
are components, not of a vector, but of a tensor). By employing the formulas for a Lorentz 
transformation (equation 3.23), or otherwise, show that the energy flux vanishes in the rocket 
frame, with the consequence that E and B are parallel. No one can prevent the F-axis from 
being put in the direction common to E and B. Show that with this choice of direction, 
Faraday becomes 


F = E z dz A dt + B z dx A dy 

(only two wedge products needed to represent the generic local field; “canonical representa¬ 
tion”; valid in one frame, valid in any frame). 

Exercise 4.2. FREEDOM OF CHOICE OF 1-FORMS IN CANONICAL 
REPRESENTATION OF GENERIC LOCAL FIELD 

Deal with a region so small that the variation of the field from place to place can be neglected. 
Write Faraday in canonical representation in the form 

F = dp l A dq 1 + dp u A dq 11 , 

where p A (A = I or II) and q A are scalar functions of position in spacetime. Define a 
“canonical transformation” to new scalar functions of position p A and q A by way of the 
“equation of transformation” 


Pa d P A = dS + Pa * A 
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Figure 4.7. 

Some simple types of 1-forms compared and contrasted. 


where the “generating function” S of the transformation is an arbitrary function of the q A 
and the q A : 

dS = (dS/dq A ) dq A + (dS/dq*) dq A . 

(a) Derive expressions for the two p A ' s and the two p/s in terms of S by equating 
coefficients of dq l , dq 11 , dq T . dq fi individually on the two sides of the equation of trans¬ 
formation. 

(b) Use these expressions for the pf s and p/s to show that F — dp A A dq A and F = 
dp A A dq A , ostensibly different, are actually expressions for one and the same 2-form in 
terms of alternative sets of 1-forms. 

Exercise 4.3, A CLOSED OR CURL-FREE 1-FORM IS A GRADIENT 

Given a 1-form a such that da = 0, show that a can be expressed in the form a — df 
where/is some scalar. The 1-form a is said to be “curl-free,” a narrower category of 1-form 
than the “rotation-free” 1-form of the next exercise (expressible as a ~ h df ), and it in turn 
is narrower (see Figure 4.7) than the category of “l-forms with rotation” (not expressible 
in the form a = h df). When the 1-form a is expressed in terms of basis 1-forms dx a , 
multiplied by corresponding components o a , show that “curl-free” implies o [a ^ = 0. 

Exercise 4.4. CANONICAL EXPRESSION FOR A ROTATION-FREE 1-FORM 

In three dimensions a rigid body turning with angular velocity w about the z-axis has 
components of velocity v y — wjc, and v x = —coy. The quantity curl v = V x v has z-com- 
ponent equal to 2<o, and all other components equal to zero. Thus the scalar product of v 
and curl v vanishes: 

= °- 

The same concept generalizes to four dimensions, 

l \a.P v y) ~ 

and lends itself to expression in coordinate-free language, as the requirement that a certain 
3-form must vanish: 


dv A v — 0. 
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Any 1-form v satisfying this condition is said to be “rotation-free.” Show that a 1-form is 
rotation-free if and only if it can be written in the form 

v — h df 

where h and/ are scalar functions of position (the “Frobenius theorem”). 

Exercise 4.5. FORMS ENDOWED WITH POLAR SINGULARITIES 

List the principal results on how such forms are representable, such as 

® l =-^ A Vx +«!, 

and the conditions under which each applies [for the meaning and answer to this exercise, 
see Lascoux (1968)]. 

Exercise 4.6. THE FIELD OF THE OSCILLATING DIPOLE 

Verify that the expressions given for the electromagnetic field of an oscillating dipole in 
equations (4.23) and (4.24) satisfy dF — 0 everywhere and d*F = 0 everywhere except at 
the origin. 

Exercise 4.7. THE 2-FORM MACHINERY TRANSLATED 
INTO TENSOR MACHINERY 

This exercise is stated at the end of the legend caption of Figure 4.1. 

Exercise 4.8. PANCAKING THE COULOMB FIELD 

Figure 4.5 shows a spacelike slice, t — const, through the Maxwell of a point-charge at rest. 
By the following pictorial steps, verify that the electric-field lines get compressed into the 
transverse direction when viewed from a moving Lorentz frame: (1) Draw a picture of an 
equatorial slice (8 = tt/ 2; /, r, <£> variable) through Maxwell = *F. (2) Draw various space¬ 
like slices, corresponding to constant time in various Lorentz frames, through the result¬ 
ant geometric structure. (3) Interpret the intersection of Maxwell - *F with each Lorentz 
slice in the manner of Figure 4.3. 

Exercise 4.9. COMPUTATION OF SURFACE INTEGRALS 

In Box 4.1 the definition 

/*-/•- A &) M ‘ ■■■*’ 

is given for the integral of a p-form a over a p-surface ^(X 1 ,..., \ p ) in ^-dimensional space. 
From this show that the following computational rule (also given in Box 4.1) works: (1) 
substitute the equation for the surface, 

x k = x k (\\ 

into a and collect terms in the form 

a - a(X\... 9 X*>) dX 1 A ... A dX*>; 

(2) integrate 

/«■;.-/ a(A». \r)d\ 1 ...d\P 

using the elementary definition of integration. 
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Exercise 4.10. WHITAKER'S CALUMOID, OR, THE LIFE OF A LOOP 

Take a closed loop, bounding a 2-dimensional surface 5. It entraps a certain flux of Faraday 
<P F — S&F (“niagnetic tubes”) and a certain flux of Maxwell <P M — f s *F (“electric tubes”). 

(a) Show that the fluxes & F and & M depend only on the choice of loop, and not on the 
choice of the surface 5 bounded by the loop, if and only if dF = d*F = 0 (no magnetic 
charge; no electric charge). Hint : use generalized Stokes theorem, Boxes 4.1 and 4.6. 

(b) Move the loop in space and time so that it continues to entrap the same two fluxes. 
Move it forward a little more here, a little less there, so that it continues to do so. In this 
way trace out a 2-dimensional surface (“calumoid”; see E. T. Whitaker 1904) $ = ^{a, b)\ 
x M = x^a, b). Show that the elementary bivector in this surface, I — d^/da A d&/db 
satisfies <F, X) = 0 and < *F , Z> = 0, 

(c) Show that these differential equations for x^a, b) can possess a solution, with given 
initial condition x* = x^a, 0) for the initial location of the loop, if dF = 0 and d*F = 0 
(no magnetic charge, no electric charge). 

(d) Consider a static, uniform electric field F — —E x dt A dx. Solve the equations, 
(F,Z) = 0 and (*F,Z) = 0 to find the equation 9(a,b) for the most general calumoid. 
[Answer: y = y(a), z — z(a ), x = x(b ), t = /(6).] Exhibit two special cases: (i) a calumoid 
that lies entirely in a hypersurface of constant time [loop moves at infinite velocity; analogous 
to super-light velocity of point of crossing for two blades of a pair of scissors]; (ii) a calumoid 
whose loop remains forever at rest in the /, x, y , z Lorentz frame. 


Exercise 4.11. DIFFERENTIAL FORMS AND HAMILTONIAN MECHANICS 

Consider a dynamic system endowed with two degrees of freedom. For the definition of 
this system as a Hamiltonian system (special case: here the Hamiltonian is independent of 
time), one needs (1) a definition of canonical variables (see Box 4.5) and (2) a knowledge 
of the Hamiltonian H as a function of the coordinates q\ q 2 and the canonically conjugate 
momenta p v p 2 . To derive the laws of mechanics, consider the five-dimensional space of 
Pv and and a curve in this s P ace leadin S from starting values of the five 

coordinates (subscript A) to final values (subscript B), and the value 

pB pB 

I = J p^dq 1 + p 2 dq 2 - H(p, q) dt ss J w 

A A 

of the integral I taken along this path. The difference of the integral for two “neigh¬ 
boring” paths enclosing a two-dimensional region <$, according to the theorem of Stokes 
(Boxes 4.1 and 4.6), is 



The principle of least action (principle of “extremal history”) states that the representative 
point of the system must travel along a route in the five-dimensional manifold (route with 
tangent vector citf/dt) such that the variation vanishes for this path; i.e., 

dw(. .., <&/dt) = 0 

(2-form dcu with a single vector argument supplied, and other slot left unfilled, gives the 
1-form in 5-space that must vanish). This fixes only the direction of d&/dr, its magnitude 
can be normalized by requiring {dt, dtf/dt) = 1. 

(a) Evaluate dw from the expression w = pjdq J — H dt. 

(b) Set dtf/dt = giidP/dq’) 4- p^/dp,) + l(d<?/dt\ and expand dw(. . . y d^/dt) = 0 in 
terms of the basis {dp } , dq k , dt). 
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Box 4.5 METRIC STRUCTURE AND HAMILTONIAN OR "SYMPLECTIC STRUCTURE" 
COMPARED AND CONTRASTED 


Metric structure 


1. Physical application 

2. Canonical structure 


3. Nature of “metric” 

4. Name for given coordinate 
system and any other set of 
four coordinates in which 
metric has same form 

5. Field equation for this metric 


6. The four-dimensional manifold 

7. Coordinate-free description of 
the structure of this manifold 

8. Canonical coordinates 
distinguished from other 
coordinates (allowable but 
less simple) 


Symplectic 

structure 

Hamiltonian mechanics 


Geometry of spacetime Hamiltonian mechanics 

(•••’■••> = "d* 2 " = -dr ® dt Q — dp 1 A dq 1 + dp 2 A dq 2 
4 dx & dx 4- dy & dv 
4- dz & dz 

Symmetric Antisymmetric 


Lorentz coordinate system 


= 0 (zero Riemann 
curvature; flat spacetime) 


Spacetime 

Riemann — 0 


Antisymmetric 

System of “canonically” (or 
“dynamically”) conjugate 
coordinates 


dO — 0 (“closed 2-form”; 
condition automatically 
satisfied by expression above). 

Phase space 

dO = 0 


Make metric take above form Make metric take above form 


(item 2) 


(item 2) 


(c) Show that this five-dimensional equation can be written in the 4-dimensional phase 
space of [q\ p k ) as 

0(. .., d#/dt) = dH , 

where 0 is the 2-form defined in Box 4.5. 

(d) Show that the components of'0(..., d&/di) = dH in the {q\p k } coordinate system 
are the familiar Hamilton equations. Note that this conclusion depends only on the form 
assumed for 0, so that one also obtains the standard Hamilton equations in any other 
phase-space coordinates {q’,p k } (“canonical variables”) for which 

0 = dp t A dq 1 4* dp 2 A dq 2 . 


Exercise 4.12. SYMMETRY OPERATIONS AS TENSORS 

We define the meaning of square and round brackets enclosing a set of indices as follows: 


v = ~ y v 

( fl l-“p) ~ p\ a ^- a Tp’ 
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Box 4.6 BIRTH OF STOKES' THEOREM 

Central to the mathematical formulation of electromagnetism are the theorems of 
Gauss (taken up in Chapter 5) and Stokes. Both today appear together as one unity 
when expressed in the language of forms. In earlier times the unity was not evident. 
Everitt (1970) recalls the history of Stokes’ theorem: “The Smith’s Prize paper set 
by [G. C.] Stokes [Lucasian Professor of Mathematics] and taken by Maxwell in 
[February] 1854 ... 

5. Given the centre and two points of an ellipse, and the length of the major axis, 
find its direction by a geometrical construction. 

6. Integrate the differential equation 

(< a 2 — x 2 ) dy 2 4- Ixydydx + ( a 2 — y 2 ) dx 2 = 0. 

Has it a singular solution? 

7. In a double system of curves of double curvature, a tangent is always drawn at 
the variable point P; shew that, as P moves away from an arbitrary fixed point Q, 
it must begin to move along a generating line of an elliptic cone having Q for vertex 
in order that consecutive tangents may ultimately intersect, but that the conditions 
of the problem may be impossible. 

8. If X, 7, Z be functions of the rectangular co-ordinates x, y, z , dS an element 
of any limited surface, /, m, n the cosines of the inclinations of the normal at dS to 
the axes, ds an element of the bounding line, shew that 



the differential coefficients of X , Y, Z being partial, and the single integral being taken 
all round the perimeter of the surface 

marks the first appearance in print of the formula connecting line and surface 
integrals now known as Stokes’ theorem. This was of great importance to Maxwell’s 
development of electromagnetic theory. The earliest explicit proof of the theorem 
appears to be that given in a letter from Thomson to Stokes dated July 2, 1850.” 
[Quoted in Campbell and Garnett (1882), pp. 186-187.] 

_ J 
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Here the sum is taken over all permutations n of the numbers 1, 2,...,/?, and (— 1)* is 
+1 or — 1 depending on whether the permutation is even or odd. The quantity V may have 
other indices, not shown here, besides the set of p indices a v a 2 ,..., a p , but only this set 
of indices is affected by the operations described here. The numbers w 2 » • ■ • * w p are the 
numbers 1,2,...,/? rearranged according to the permutation tt. (Cases p = 2, 3 were treated 
in exercise 3.12.) We therefore have machinery to convert any rank-/? tensor with components 
V ai Ap into a new tensor with components 

Since this machinery Alt is linear, it can be viewed as a tensor which, given suitable argu¬ 
ments u, v, ...»iv, or, /?,...» y produces a number 

w ■ ■ ■ w> ' a i/ I P„ ■ ■ ■ Ym- 

(a) Show that the components of this tensor are 

= o»o _I 


(Note: indices of 8 are 
almost never raised or 
lowered, so this notation 
leads to no confusion.) 


where 


1 if (a x ,..., a p ) is an even permutation of (fi v ..., /? p ), 
— 1 if (a 1# ..., a p ) is an odd permutation of ..., /? p ), 

0 if (i) any two of the a’s are the same, 

0 if (ii) any two of the /Ts are the same, 

. 0 if (iii) the a’s and /?’s are different sets of integers. 


Note that the demonstration, and therefore these component values, are correct in any 
frame. 

(b) Show for any “alternating” (i.e., “completely antisymmetric ”) tensor A ai ap = A [ai ap] 
that 



U Yi...Y p Y p+1 ...Y p+fl 


= S 

a l <a 2 <...<a p 


°Yi. y 


= a l 


or 

Yi.Y p+(? 


The final line here introduces the convention that a summation over indices enclosed between 
vertical bars includes only terms with those indices in increasing order. Show, consequently 
or similarly, that 


8 a V a p01~Pq 
^ 1 —.Yp+ 9 




Yi*.-Yp 


(c) Define the exterior (“wedge”) product of any two alternating tensors by 


<« A 


8* i-^p V\...v q n 

Xi...XpXj,+ i...Xp+g "|ri...p 9 |> 


(U A V)K^ 


gx 1 ...x p x p+1 ...x p+9 


L/lM,-Mpl pi 


and similarly 
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Show that this implies equation (3.45b). Establish the associative law for this product rule 
by showing that 


[(a A 0) A Yl^ 

= l fl A (P a y Wr ; 


and show that this reduces to the 3-form version of Equation (3.45c) when a , /?, and y are 
all 1-forms. 

(d) Derive the following formula for the components of the exterior product of p vectors 


(«#! A i/ 2 A ... A u p r^ = BpfUuJ* ... (u p Y 

= plu 1 lai u 2 a2 ... ufd 

= 5 T2..T detf (“'‘) X ]' 
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CHAPTER 

STRESS-ENERGY TENSOR 
AND CONSERVATION LAWS 


V 


§5.1. TRACK-1 OVERVIEW 



- ^ 

The rest of this chapter is 

Track 2. 

It depends on no preceding 
Track-2 material. 

It is needed as preparation 
for Chapter 20 (conservation 
laws for mass and angular 
momentum). 

It will be extremely helpful in 
all applications of gravitation 
theory (Chapters 18-40). 

_ > 


“Geometry tells matter how to move, and matter tells geometry how to curve.” 
However, it will do no good to look into curvature (Part III) and Einstein’s law 
for the production of curvature by mass-energy (Part IV) until a tool can be found 
to determine how much mass-energy there is in a unit volume. That tool is the 
stress-energy tensor. It is the focus of attention in this chapter. 

The essential features of the stress-energy tensor are summarized in Box 5.1 for 
the benefit of readers who want to rush on into gravitation physics as quickly as 
possible. Such readers can proceed directly from Box 5.1 into Chapter 6—though 
by doing so, they close the door on several later portions of track two, which lean 
heavily on material treated in this chapter. 


§5.2. THREE-DIMENSIONAL VOLUMES AND DEFINITION 
OF THE STRESS-ENERGY TENSOR 

Spacetime contains a flowing “river” of 4-momentum. Each particle carries its 
4-momentum vector with itself along its world line. Many particles, on many world 
lines, viewed in a smeared-out manner (continuum approximation), produce a 
continuum flow—a river of 4-momentum. Electromagnetic fields, neutrino fields, 
me$on fields: they too contribute to the river. 

How can the flow of the river be quantified? By means of a linear machine: the 
stress-energy tensor T. 

Choose a small, three-dimensional parallelepiped in spacetime with vectors A, 
B, C for edges (Figure 5.1). Ask how much 4-momentum crosses that volume in 
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Box 5.1 CHAPTER 5 SUMMARIZED 

A. STRESS-ENERGY TENSOR AS A MACHINE 


At each event in spacetime, there exists a stress-energy tensor. It is a machine that 
contains a knowledge of the energy density, momentum density, and stress as 
measured by any and all observers at that event. Included are energy, momentum, 
and stress associated with all forms of matter and all nongravitational fields. 

The stress-energy tensor is a linear, symmetric machine with two slots for the 
insertion of two vectors: T(. ..,...). Its output, for given input, can be summarized 
as follows. 

(1) Insert the 4-velocity u of an observer into one of the slots; leave the other slot 
empty. The output is 


r(u,...) = r(..., ii) = 


/density of 4-momentum, 
f “ dp/dV r ” i.e., 4-momentum 
per unit of three-dimensional volume, 

I as measured in observer’s 

l Lorentz frame at event where 

\T is chosen / 


i.e., T a pU$ = Tp a uP = —(dp a /dV) for observer with 4-velocity u a . 

(2) Insert 4-velocity of observer into one slot; insert an arbitrary unit vector n into 
the other slot. The output is 


component, “n • dp/dV”, of \ 

4-momentum density along the 
n direction, as measured in 
observer’s Lorentz frame / 

i.e., T a pU a nP = T afi n a u^ = —n^dp^/dV. 

(3) Insert 4-velocity of observer into both slots. The output is the density of mass- 
energy that he measures in his Lorentz frame: 


T(u , n ) = T(n, u) = 


Y^ u u ) — ( mass_ener gy P er unit volume as measured\ 

\ in frame with 4-velocity u ) 

(4) Pick an observer and choose two spacelike basis vectors, e ; - and e k , of his Lorentz 
frame. Insert e ; - and e k into the slots of T. The output is the j\k component of 
the stress as measured by that observer: 


Tjk = T(*j> e k) - T kj = r (e&’ «j) 

//-component of force acting \ 

_ from side x k — e to side x k -f £, 
across a unit surface area with 
\ perpendicular direction e k ) 


/A:-component of force acting \ 
from side x> — e to side x j + e, 
across a unit surface area with 
\ perpendicular direction e } / 
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Box 5.1 (continued) 


B. STRESS-ENERGY TENSOR FOR A PERFECT FLUID 

One type of matter studied extensively later in this book is a “perfect fluid. ” A perfect 
fluid is a fluid or gas that (1) moves through spacetime with a 4-velocity u which 
may vary from event to event, and (2) exhibits a density of mass-energy p and an 
isotropic pressure p in the rest frame of each fluid element. Shear stresses, anisotropic 
pressures, and viscosity must be absent, or the fluid is not perfect. The stress-energy 
tensor for a perfect fluid at a given event can be constructed from the metric tensor, 
g , the 4-velocity, u, and the rest-frame density and pressure, p and p: 

T = (p + p)u ® u + pg, or T ap = (p + p)u a u p + pg a/3 . 

In the fluid’s rest frame, the components of this stress-energy tensor have the 
expected form (insert into a slot of T ' as 4-velocity of observer, just the fluid’s 
4-velocity): 

T a p uP = [(p + p)u«u p + p8 a p)uP = -(p + p)u a + pu a = -p« a ; 
i.e., 

T°pitP = —p — -(mass-energy density) = — dp°/dV,\ 

~ 0 — —(momentum density) = — dp r /dV\ 

also 


T- k = T(e-,e h ) = p8 jk = stress-tensor components. 


C. CONSERVATION OF ENERGY-MOMENTUM 

In electrodynamics the conservation of charge can be expressed by the differential 
equation 


0(charge density)/9r + V • (current density) = 0; 

i.e., /°^ 0 + V • / = 0; i.e. J a a = 0; i.e., ▼ • J = 0. Similarly, conservation of 
energy-momentum can be expressed by the fundamental geometric law 

▼ •r = o. 

(Because 7 is symmetric, it does not matter on which slot the divergence is taken.) 

This law plays an important role in gravitation theory. 

___ J 
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Figure 5.1. 

The “river” of 4-momentum flowing through spacetime, and three different 3-volumes across which it 
flows. (One dimension is suppressed from the picture; so the 3-volumes look like 2-volumes.) The first 
3-volume is the interior of a cubical soap box momentarily at rest in the depicted Lorentz frame. Its 
edges are Le x , Le y , Ley, and its volume 1-form, with “positive” sense toward future (“standard 
orientation”), is I = L 3 dt = — Vu( V = L 3 = volume as measured in rest frame; u = -dt = 4-velocity 
of box). The second 3-volume is the “world sheet” swept out in time At by the top of a second cubical 
box. The box top’s edges are Le x and Ley and its volume 1-form, with “positive” sense away from 
the box’s interior, in direction of increasing y, is I = L 2 At dy = d Ato (G = L 2 = area of box top; 
° - d y - unit 1-form containing world tube). The third 3-volume is an arbitrary one, with edges A , 
B, C and volume 1-form 2* = e fia/3y A a B^C y . 


its positive sense (i.e., from its “negative side” toward its “positive side”). To calculate 
the answer: (1) Construct the “volume 1-form” 

= +Wr 4 “ 5/5C7 ; (5.1) 


Mathematical representation 
of 3-volumes 


the parallelepiped lies in one of the 1-form surfaces, and the positive sense across 
the parallelepiped is defined to be the positive sense of the 1-form I . (2) Insert 
this volume 1-form into the second slot of the stress-energy tensor T. The result is 


T(. ..,!)= p = 

empty j 
slot 


( momentum crossing from 
negative side toward positive side 


(5.2) 


Momentum crossing a 
3-volume calculated, using 
stress-energy tensor 


(3) To get the projection of the 4-momentum along a vector w or 1-form a, insert 
the volume 1-form I into the second slot and w or a into the first: 

T(w,Z)= vsr p, T(a, Z) = <a,p>. (5.3) 


This defines the stress-energy tensor. 
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The key features of 3-volumes and the stress-energy tensor are encapsulated by 
the above three-step procedure. But encapsulation is not sufficient; deep under¬ 
standing is also required. To gain it, one must study special cases, both of 3-volumes 
and of the operation of the stress-energy machinery. 


Interior of a soap box: 


Its volume 1-form 


Its 4-momentum content 


A Special Case 

A soap box moves through spacetime. A man at an event ;? 0 on the box’s world 
line peers inside it, and examines all the soap, air, and electromagnetic fields it 
contains. He adds up all their 4-momenta to get a grand total p boxat#v How much 
is this grand total? One can calculate it by noting that the 4-momentum inside the 
box at ty’o is precisely the 4-momentum crossing the box from past toward future 
there (Figure 5.1). Hence, the 4-momentum the man measures is 

Pbox at 9o (5.4) 

where Z is the box’s volume 1-form at ^ 0 . But for such a soap box, Z has a 
magnitude equal to the box’s volume V as measured by a man in its momentary 
rest frame, and the box itself lies in one of the hyperplanes of Z ; equivalently, 

Z = - Vu, (5.5) 

where u is the soap box’s 4-velocity at 9 0 (minus sign because u, regarded as a 
1-form, has positive sense toward the past, u 0 < 0); see Box 5.2. Hence, the total 
4-momentum inside the box is 

Aboxat^o Vu) — VT(. . . , I/), (^*6) 

or, in component notation, 


(p a ) boxat^o = - VT*u t ,. (5.6') 

The energy in the box, as measured in its rest frame, is minus the projection of 
the 4-momentum on the box’s 4-velocity: 

E = -u • Pboxat^o = + VT ali u a Up = VT(u, u); 


so 


Its energy density 


energy density as\ 

| measured in box’s! =.-^ = T(u, u). (5.7) 

\ rest frame / 


Another Special Case 

A man riding with the same soap box opens its top and pours out some soap. In 
a very small interval of time At, how much total 4-momentum flows out of the box? 



Box 5.2 THREE-DIMENSIONAL VOLUMES 


A. General Parallelepiped 

1. Edges of parallelepiped are three vectors A, 
B, C. One must order the edges; e.g., “A 
is followed by B is followed by C.” 



(One dimension, that orthogonal 
to the parallelepiped, is 
suppressed here.) 


2. Volume trivector is defined to be A A B A C. It enters into the sophisticated 
theory of volumes (Chapter 4), but is not used much in the elementary theory. 

3. Volume 1-form is defined by 1 ^ — e /iia(iy A a B^C y . (A,B, C must appear here 
in standard order as chosen in step 1.) Note that the vector “corresponding” 
to E and the volume trivector are related by E = — *(A ABA C). 

4. Orientation of the volume is defined to agree 
with the orientation of its 1-form E. More 
specifically: the edges A, B, C lie in a hyper¬ 
plane of r«r, A> = <r, b> = <r, c> = o ; 
no “bongs of bell”). Thus, the volume itself 
is one of E’s hyperplanes! The positive sense 
moving away from the volume is defined to 
be the positive sense of I. Note: reversing (Qne dimension> that along which c 
the order of A, B, C reverses the positive extends, is suppressed here.) 
sense! 

5. The “standard orientation ” for a spacelike 3-volume has the positive sense of 
the 1-form E toward the future, corresponding t oA,B,C forming a righthanded 
triad of vectors. 



B. 3-Volumes of Arbitrary Shape 

Can be analyzed by being broken up into union of parallelepipeds. 

C. Interior of a Soap Box (Example) 

1. Analysis in soap box's rest frame. Pick an event on the box’s world line. The 
box’s three edges there are three specific vectors A, B, C. In the box’s rest frame 
they are purely spatial: A° = B° = C° = 0. Hence, the volume 1-form has 
components 2^ = 0 and 




Box 5.2 ( continued) 


A 1 A 2 A 3 
= € 0ijk A i B>C k = det 5 1 B 2 £ 3 
C 1 c 2 c 3 

= A • (B x C), in the standard notation of 3-dimensional vector analysis; 

= + V (V =z volume of box) if (A, B, C) are righthand ordered (positive 
sense of I toward future; standard orientation); 

= —V (V — volume of box) if (A, B, C) are lefthand ordered (positive sense 
of r toward past). 

This result reexpressed in geometric language: Let u be the box’s 4-velocity 
and V be its volume, as measured in its rest frame. Then either 
r = - vu, in which case the “positive side” of the box’s 3-surface is the future side, 
and its edges are ordered in a righthanded manner—the standard orien¬ 
tation; 

or else 

E — + Vu, in which case the “positive side” is the past side, and the box’s edges 
are ordered in a lefthanded manner. 


D. 3-Volume Swept Out in Time At by Two-Dimensional Top 
of a Soap Box (Example) 


1. Analysis in box’s rest frame: Pick an event on t 

box’s world line. There the two edges of the ^ 

box top are vectors A and £. In the box’s \ J. 
rest frame, orient the space axes so that A u ^ r 

and B lie in the jy,z-plane. During the lapse ^ y 

of a proper time At, the box top sweeps out B 

a 3-volume whose third edge is u At (u — 

4-velocity of box). In the box’s rest-frame, 
with ordering “ A followed by B followed by 
u At ,” the volume 1-form has components 

I ^ — 1 2 — y ^ — 0, and 
2 1 = e iik<y A ’ Bk Atu° = -€ 0 ljk A>B k At 

= —d At (d = area of box top) if ( e x ,A, B) are righthand ordered 
= +d At (d = area of box top) if (e x ,A,B) are lefthand ordered. 

(Note: No standard orientation can be defined in this case, because I can be 
carried continuously into — I by purely spatial rotations.) 

2. This result reexpressed in geometric language: Let d be the area of the box 
top as measured in its rest frame; and let a be a unit 1-form, one of whose 
surfaces contains the box top and its 4-velocity (i.e., contains the box top’s 
“world sheet”). Orient the positive sense of a with the (arbitrarily chosen) 
positive sense of the box-top 3-volume. Then 


Z — d At a. 
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To answer this question, consider the three-dimensional volume swept out during 
At by the box’s opened two-dimensional top (“world sheet of top”). The 4-momen¬ 
tum asked for is the 4-momentum that crosses this world sheet in the positive sense 
(see Figure 5.1); hence, it is 


Pflowsout 

where I is the world sheet’s volume 1-form. Let d be the area of the box top, and 
o be the outward-oriented unit 1-form, whose surfaces contain the world sheet (i.e., 
contain the box top and its momentary 4-velocity vector). Then 

Z^dAro (5.9) 

(see Box 5.2); so the 4-momentum that flows out during At is 

Pflowsout = d ArT(...,o). (5.10) 

§5.3. COMPONENTS OF STRESS-ENERGY TENSOR 

Like all other tensors, the stress-energy tensor is a machine whose definition and 
significance transcend coordinate systems and reference frames. But any one ob¬ 
server, locked as he is into some one Lorentz frame, pays more attention to the 
components of T than to T itself. To each component he ascribes a specific physical 
significance. Of greatest interest, perhaps, is the “time-time” component. It is the 
total density of mass-energy as measured in the observer’s Lorentz frame: 

T 00 = —T 0 °= T 00 = 7*(e 0 , e 0 ) = density of mass-energy (5.11) 

(cf. equation 5.7, with the observer’s 4-velocity u replaced by the basis vector e 0 = u ). 

The “spacetime” components T j0 can be interpreted by considering the interior 
of a soap box at rest in the observer’s frame. If its volume is V 9 then its volume 
1-form is Z = — Vu = + Vdt\ and the jn-component of 4-momentum inside it is 

p* = (dx^p) = T(dx * I) = VT(dx», dt) = FT 7 * 0 . 

Thus, the 4-momentum per unit volume is 


p»/V= 7 7 * 0 , 


or, equivalently: 


T 00 = density of mass-energy 

(units: g/cm 3 , or erg/cm 3 , or cm” 2 ); 

T j0 — density of j-component of momentum 
(units: g (cm/sec) cm* 3 , or cm’ 2 ). 


(5.12a) 


(5.13a) 

(5.13b) 


The top of a soap box: 


Its volume 1 -form 


Its 4-momentum that flows 
across 


Physical interpretation of 
stress-energy tensor's 
components: 

T 00 : energy density 


P°: momentum density 


The components T 7 ** can be interpreted using a two-dimensional surface of area 
d, at rest in the observer’s frame with positive normal pointing in the ^-direction. 
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7" ofc : energy flux 
P k : stress 


Number-flux vector for 
swarm of particles defined 


During the lapse of time At, this 2-surface sweeps out a 3-volume with volume 1 -form 
Z — (f At dx k (see Box 5.2). The ja-component of 4-momentum that crosses the 
2-surface in time At is 

p* = T{dx*, I) - a At T(dx *, dx k ) - a At P* 


Thus, the flux of 4-momentum (4-momentum crossing a unit surface oriented 
perpendicular to e k , in unit time) is 


(/>Vtf40crossingsurfaceltoe, = P*. 

(5.12b) 

or, equivalently: 


T° k = A>component of energy flux 

(5.13c) 

(units: erg/cm 2 sec, or cm' 2 ); 

V k ~ j, k component of “stress” 


=E /c-component of flux of y-component of momentum 

(5.13d) 


=y-component of force produced by fields and matter at x k — € acting 
on fields and matter at x k + € across a unit surface, the perpendicular 
to which is e k 

(units: dynes/cm 2 , or cm -2 ). 

(Recall that “momentum transfer per second” is the same as “force.”) 

The stress-energy tensor is necessarily symmetric, T a & = T^ a ; but the proof of 
this will be delayed until several illustrations have been examined. 


§5.4. STRESS-ENERGY TENSOR FOR A SWARM 
OF PARTICLES 

Consider a swarm of particles. Choose some event 9 inside the swarm. Divide the 
particles near into categories, A = 1, 2,..., in such a way that all particles in 
the same category have the same properties: 

m U) , rest mass; 

u U) , 4-velocity; 

P(A) — m (A) u (A)’ 4-momentum. 


Let N U) be the number of category-v4 particles per unit volume, as measured in the 
particles’ own rest frame. Then the “number-flux vector” S U) , defined by 

*U) = *U)"U)> (5-14) 

has components with simple physical meanings. In a frame where category-^ parti¬ 
cles have ordinary velocity v U) , the meanings are: 
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5 u> = N u) u h) = N u) t 1 - v u) 2 ] _1/2 = number density; (5.15a) 

_i ' dT 

i i 

Number density in Lorentz contraction 

particles’ rest frame factor for volume 

S u) = N U) U U) = = fl nx of particles. (5.15b) 


Consequently, the 4-momentum density has components 

T& = Pfjfito “ m U) U tA)N{A) U U) 

= m OO N U) u ti) u U)’ 

and the flux of ju-component of momentum across a surface with perpendicular 
direction e- is 

— Pto^l i) = m oo u to^oo u L) 

= m Oi) N OO u to u L)- 


These equations are precisely the ja, 0 and / \i,j components of the geometric, frame- 
independent equation 

Too = m oo N oo u U) ® u (a) = Poo ® S u , (5.16) 


Stress-energy tensor for 
swarm of particles 


The total number-flux vector and stress-energy tensor for all particles in the swarm 
near 9 are obtained by summing over all categories: 


s = 2 au«u>; ( 5 - 17 ) 

A 

T = 2 m u> N u) u u> 0 °C4) = E Pu) 0 s u>- ( 5 - 18 ) 

A A 


§5.5. STRESS-ENERGY TENSOR FOR A PERFECT FLUID 

There is no simpler example of a fluid than a gas of noninteracting particles (“ideal Ideal gas defined 
gas”) in which the velocities of the particles are distributed isotropically. In the 
Lorentz frame where isotropy obtains, symmetry argues equality of the diagonal 
space-space components of the stress-energy tensor. 


T = T = T = V 

1 xx 1 yy zz 


m OQ» AA) 

(1 - V, 




U) 


2 ) 1/2 (1 


r U) 


2y/2 ’ 


(5.19) 


and vanishing of all the off-diagonal components. Moreover, (5.19) represents a 
product: the number of particles per unit volume, multiplied by velocity in the 
jc-direction (giving flux in the x-direction) and by momentum in the x-direction, 
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Stress-energy tensor for 
gas or perfect fluid 

Perfect fluid defined 


giving the standard kinetic-theory expression for the pressure, p . Therefore, the 
stress-energy tensor takes the form 


p 

0 

0 

0 

0 

p 

0 

0 

0 

0 

p 

0 

0 

0 

0 

p 


(5.20) 


in this special Lorentz frame—the ‘Test frame” of the gas. Here the quantity p has 
nothing directly to do with the rest-masses of the constituent particles. It measures 
the density of rest-plus-kinetic energy of these particles. 

Rewrite (5.20) in terms of the 4-velocity u a = (1,0,0,0) of the fluid in the gas’s 
rest frame, and find 


p 

0 

0 

0 


0 

0 

0 

0 

0 

0 

0 

0 


0 

p 

0 

0 

0 

0 

0 

0 

+ 

0 

0 

p 

0 

0 

0 

0 

0 


0 

0 

0 

p 


= P U « U P + PiVafit + 


or, in frame-independent, geometric language 

T = pg -f (p + p)u ® u. (5.21) 

Expression (5.21) has general application. It is exact for the “ideal gas” just consid¬ 
ered. It is also exact for any fluid that is “perfect” in the sense that it is free of 
such transport processes as heat conduction and viscosity, and therefore (in the rest 
frame) free of shear stress (diagonal stress tensor; diagonal components identical, 
because if they were not identical, a rotation of the frame of reference would reveal 
presence of shear stress). However, for a general perfect fluid, density p of 
mass-energy as measured in the fluid’s rest frame includes not only rest mass plus 
kinetic energy of particles, but also energy of compression, energy of nuclear binding, 
and all other sources of mass-energy [total density of mass-energy as it might be 
determined by an idealized experiment, such as that depicted in Figure 1.12, with 
the sample mass at the center of the sphere, and the test particle executing oscillations 
of small amplitude about that location, with co 2 = (4tt/3)p], 


§5.6. ELECTROMAGNETIC STRESS-ENERGY 

Faraday, with his picture of tensions along lines of force and pressures at right angles 
to them (Figure 5.2), won insight into new features of electromagnetism. In addition 
to the tension E 2 /%tt (or B 2 /Stt) along lines of force, and an equal pressure at right 
angles, one has the Poynting flux (E X B)/4tt and the Maxwell expression for the 
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Figure 5.2. 

Faraday stresses at work. When the electromagnet is connected 
to an alternating current, the aluminum ring flies into the air. 


energy density, ( E 2 + All these quantities find their places in the Maxwell 

stress-energy tensor, defined by 


4mT* v = F fm F\ 


\r p F a ^. 


Stress-energy tensor for 
' ' ' electromagnetic field 


Exercise 5.1 . EXERCISE 

Show that expression (5.22), evaluated in a Lorentz coordinate frame, gives 

T 00 = (E 2 + B 2 )/Sv, T 0 ’ = P° = (EX B)’/4v, 

i r i i (5-23) 

V k = -(£#£* + BiB k ) + J (E 2 + B 2 )8’ k I. 

Show that the stress tensor does describe a tension (E 2 4- B 2 )/87t along the field lines and 
a pressure (E 2 + B 2 )/ 8-7T perpendicular to the field lines, as stated in the text. 


§5,7. SYMMETRY OF THE STRESS-ENERGY TENSOR 

All the stress-energy tensors explored above were symmetric. That they could not 
have been otherwise one sees as follows. 

Calculate in a specific Lorentz frame. Consider first the momentum density Proof that stress-energy 
(components P°) and the energy flux (components T °*). They must be equal because tensor > s symmetric 
energy = mass (“£ — Me 2 = AT’): 

T oi = (energy flux) 

= (energy density) X (mean velocity of energy flow)* 

= (mass density) X (mean velocity of mass flow)*’ 

= (momentum density) = P°. 

Only the stress tensor T }k remains. For it, one uses the same standard argument 
as in Newtonian theory. Consider a very small cube, of side L, mass-energy T°°L 3 , 
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and moment of inertia — r°°Z/\ With the space coordinates centered at the cube, 
the expression for the z-component of torque exerted on the cube by its surround¬ 
ings is 



(— T yz L 2 ) 

(L/2) 

-f 

(T^L 2 ) 

{-L/2) 

ly- component^ 

j lever \ 


/y-component\ 

j lever \ 

of force on 

j arm to ! 


of force on 

j arm to 

\ -j- x face j 

U-x facej 


\ —x face j 

\—x face/ 


- ( -T*vL>) (L/2) - (T**L 2 ) j-L/2) 

( A-component\ / lever \ /A-component\ / lever \ 
of force on arm to j of force on tf arm to | 
+y face j\yy face/ \ — y face /\— y face/ 


_ T VX )L 3 . 


Since the torque decreases only as L 3 with decreasing L, while the moment of inertia 
decreases as L 5 , the torque will set an arbitrarily small cube into arbitrarily great 
angular acceleration—which is absurd. To avoid this, the stresses distribute them¬ 
selves so the torque vanishes: 

yyx = t x v 


Put differently, if the stresses were not so distributed, the resultant infinite angular 
accelerations would instantaneously redistribute them back to equilibrium. This 
condition of torque balance, repeated for all other pairs of directions, is equivalent 
to symmetry of the stresses: 

P k = T k K (5.25) 


§5.8. CONSERVATION OF 4-MOMENTUM: 
INTEGRAL FORMULATION 


Integral conservation law for 
4-momentum: 

<£ r* d 3 i = o 

c'T 


Energy-momentum conservation has been a cornerstone of physics for more than 
a century. Nowhere does its essence shine forth so clearly as in Einstein’s geometric 
formulation of it (Figure 5.3,a). There one examines a four-dimensional region of 
spacetime V bounded by a closed, three-dimensional surface 3T. As particles and 
fields flow into V and later out, they carry 4-momentum. Inside T the particles 
collide, break up, radiate; radiation propagates, jiggles particles, produces pairs. But 
at each stage in this complex maze of physical processes, total energy-momentum 
is conserved. The energy-momentum lost by particles goes into fields; the energy- 
momentum lost by fields goes into particles. So finally, when the “river” of 4-mo- 
mentum exits from IT, it carries out precisely the same energy-momentum as it 
carried in. 

Restate this equality by asking for the total flux of 4-momentum outward across 
3T. Count inflowing 4-momentum negatively. Then “inflow equals outflow” means 
“total outflow vanishes”; 




(a) A four-dimensional region of spacetime ’V bounded by a closed three-dimensional surface 3‘T~. The 
positive sense of 3*7* is defined to be everywhere outward (away from ‘7*). Conservation of energy- 
momentum demands that every bit of 4-momentum which flows into T’ through 3T* must somewhere 
flow back out; none can get lost inside; the interior contains no ‘'sinks.” Equivalently, the total flux 
of 4-momentum across 3‘T in the positive (outward) sense must be zero: 


<j> P“ <P2„ = 0. 

av 

Figures (b), (c). (d), and (e) depict examples to which the text applies this law of conservation of 
4-momentum. All symbols T' (or S) in these figures mean spacetime volumes (or spacelike 3-volumes) 
with standard orientations. The dotted arrows indicate the positive sense of the closed surface c“7* used 
in the text’s discussion of 4-momentum conservation. How 3‘7* is constructed from the surfaces ,S and 
T is indicated by formulas below the figures. For example, in case (b), r7* = S. 2 - means that cT' 
is made by joining together with its standard orientation and with reversed orientation. 
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Total flux of 4-momentum outward across a closed 
three-dimensional surface must vanish. ' ‘ 

To calculate the total outward flux in the most elementary of fashions, approximate 
the closed 3-surface dT by a large number of flat 3-volumes (“boiler plates”) with 
positive direction oriented outward (away from T). Then 

Ptotaio U t= 2 r(..., r U) ) = o, (5.27) 

boiler plates A 


where -F U) is the volume 1-form of boiler plate A. Equivalently, in component 
notation 

PVa,out = X T^S U)a . (5.27') 

A 

To be slightly more sophisticated about the calculation, take the limit as the number 
of boiler plates goes to infinity and their sizes go to zero. The result is an integral 
(Box 5.3, at the end of this section), 


P'm aiout=f T** d 3 Z a = 0. (5.28) 

d-V 

Think of this (like all component equations) as a convenient way to express a 
coordinate-independent statement: 


Special cases of integral 
conservation law: 


Ptotal out = j T‘ d 3 Z = 0. (5.29) 

J br 

To be more sophisticated yet (not recommended on first reading of this book) and 
to simplify the computations in practical cases, interpret the integrands as exterior 
differential forms (Box 5.4, at the end of this section). 

But however one calculates it, and however one interprets the integrands, the 
statement of the result is simple: the total flux of 4-momentum outward across a 
closed 3-surface must vanish. 

Several special cases of this “integral conservation law,” shown in Figure 5.3, are 
instructive. There shown, in addition to the general case (a), are: 


Case (b) 

The closed 3-surface dT is made up of two slices taken at constant time t of a specific 
Lorentz frame, plus timelike surfaces at “infinity” that join the two slices together. 
The surfaces at infinity do not contribute to d*2 a if the stress-energy tensor 

dies out rapidly enough there. The boundary dT of the standard-oriented 4-volume 
T, by definition, has its positive sense away from T. This demands nonstandard 
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orientation of S 1 (positive sense toward past), as is indicated by writing dT — 
S 2 — S a ; and it produces a sign flip in the evaluation of the hypersurface integral 

0=f r*(P2 = -f T a0 dxdydz+f T a0 dx dy dz. 

J s x J s 2 

Because T a0 is the density of 4-momentum, this equation says 



( total 4-momentum in \ 
all of space at time t 1 ) 


J T a0 dx dy dz 
S 1 

( total 4-momentum in 
all of space at time t 2 



T a0 dx dy dz. 


(5.30) 


Total 4-momentum conserved 
in time 


Case (c) 

Here one wants to compare hypersurface integrals over 5 and 5, which are slices 
of constant time, t = const and t = const in two different Lorentz frames. To form 
a closed surface, one adds time-like hypersurfaces at infinity and assumes they do 
not contribute to the integral. The orientations fit together smoothly and give a closed 
surface 

0T = 5 — 5 + (surfaces at infinity) 

only if one takes T = C V' 2 — C V 1 —i.e., only if one uses the nonstandard 4-volume 
orientation in (See part AT of Box 5.3 for “standard” versus “non-standard” 
orientation.) The integral conservation law then gives 

0 = jT T • d 3 Z — jT T-d 3 I, 


or, equivalently, 


| T • d 3 F — (total 4-momentum p on 5) 

a 


f T m d 3 I = (total 4-momentum p on 


Total 4-momentum the same 
in all Lorentz frames 


(5.31) 


This says that observers in different Lorentz frames measure the same total 4-mo¬ 
mentum p. It does not mean that they measure the same components ( p a ^ p a )\ 
rather, it means they measure the same geometric vector 


Pons = P a *a = Pons = 


a vector whose components are connected by the usual Lorentz transformation law 

P a = (5.32) 



146 


5. STRESS-ENERGY TENSOR AND CONSERVATION LAWS 


Total 4-momentum 
independent of hypersurface 
where measured 


Change with time of 
4-momentum in a box equals 
flux of 4-momentum across 
its faces 


Differential conservation law 
for 4-momentum: ▼■7=0 


Case (d) 

Here the contribution to the integral comes entirely from two arbitrary spacelike 
hypersurfaces, 5^ and S B , cutting all the way across spacetime. As in cases (a) and 
(b), the integral form of the conservation law says 

PonS A PonS B ’ (5.33) 

i.e., the total 4-momentum on a spacelike slice through spacetime is independent of 
the specific slice chosen —so long as the energy-momentum flux across the “hypersur¬ 
face at infinity” connecting S A and S B is zero. 

Case (e) 


This case concerns a box whose walls oscillate and accelerate as time passes. The 
three-dimensional boundary dT is made up of (1) the interior S of the box, at an 
initial moment of time t = constant in the box’s initial Lorentz frame, taken with 
nonstandard orientation; (2) the interior 5 of the box, at 7 — constant in its final 
Lorentz frame, with standard orientation; (3) the 3-volume ff swept out by the box’s 
two-dimensional faces between the initial and final states, with positive sense oriented 
outward. The integral conservation law f- dr T* d 3 Z = 0 says 


/total 4-momentum 
\in box at S 


( total 4-momentum\ 
in box at 5 / 

_ /total 4-momentum that enters box through 
V its faces between states 5 and 5 


(534) 


§5.9. CONSERVATION OF 4-MOMENTUM: 

DIFFERENTIAL FORMULATION 

Complementary to any “integral conservation law in flat spacetime” is a “differential 
conservation law” with identical information content. To pass back and forth between 
them, one can use Gauss’s theorem. 

Gauss’s theorem in four dimensions, applied to the law of 4-momentum conserva¬ 
tion, converts the surface integral of into a volume integral of 7> a a : 

0 = T* a d 3 Z a = f T a dt dx dy dz. (5.35) 

J 9 r J r 

(See Box 5.3 for elementary discussion; Box 5.4 for sophisticated discussion.) If the 
integral of a is to vanish, as demanded, for any and every 4-volume T, then 
T m a must itself vanish everywhere in spacetime: 

T m a — 0; i.e., ▼ * T = 0 everywhere. (5.36) 

(continued on page 152) 
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Box 5.3 VOLUME INTEGRALS, SURFACE INTEGRALS, AND 
GAUSS’S THEOREM IN COMPONENT NOTATION 

A. Volume Integrals in Spacetime 

1. By analogy with three-dimensional space, the volume of a “hyperparallele¬ 
piped” with vector edges A, B, C, D is 


4-volume = i2~= a B l3 C y D s = det 


A 0 

A 1 

A 2 

A 3 

B° 

B 1 

B 2 

B 3 

C° 

C 1 

C 2 

C 3 

D° 

D 1 

D 2 

D 3 


= *{A A B A C A D). 


Here, as for 3-volumes, orientation matters; interchange of any two edges 
reverses the sign of i2. The standard orientation for any 4-volume is the one 
which makes i2 positive; thus, e 0 A e, A e 2 A e 3 has standard orientation if 
e 0 points toward the future and e v e 2 , e 3 are a righthanded triad. 

2. The “volume element” whose edges in a specific, standard-oriented Lorentz 
frame are 

A a = {At, 0,0,0), B a = (0, Ax, 0,0), C a = (0,0, Ay, 0), D a = (0,0,0, Az) 
has a 4-volume, according to the above definition, given by 
A*S2 = c 0123 At Ax Ay Az = At Ax Ay Az. 


3. Thus, the volume integral of a tensor S over a four-dimensional region T of 
spacetime, defined as 


M= Lim S ^ at center of (volume of <Z), 

( number of\ /elementary\ 
elementary 1 I volumes ct ) 
volumes I l in T / 


can be calculated in a Lorentz frame by 


MV = 



J S a p y d 4 i2 - f S a p y dt dx dy dz. 
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Box 5.3 (continued) 

B. Integrals over 3-Surfaces in Spacetime 

1. Introduce arbitrary coordinates a , b , c on the 
three-dimensional surface. The elementary 
volume bounded by coordinate surfaces 

a 0 < a < a 0 + Aa, b 0 < b < b 0 + Ab , 
c 0 < c < c 0 + Ac 




has edges 
dx a 


A a = 


Aa, BP =^Ab,a = ^f-Ac; 
da db dc 



so its volume 1-form is 

/i3 v „ dx a dx^ bx'*' ^ al a 

*z,= t mnsris-sr*‘ M ‘ lc - 

2. The integral of a tensor S over the 3-surface 5 thus has components 

X r dx^ dx p dx^ 

„ S V^y = l 

An equivalent formula involving a Jacobian is often used (see exercise 5.5): 


N« t 


- f y 1 dix^x^x*) 

-J** IT € 


fiv\ ' 


d(a 9 b , c) 


da db dc. 


C. Gauss's Theorem Stated 

1. Consider a bounded four-dimensional region 
of spacetime T with closed boundary 3T. 
Orient the volume 1-forms on bV so that the 
“positive sense” is away from T. 

2. Choose a tensor field S. Integrate its diver¬ 
gence over V, and integrate it itself over 3T. 
The results must be the same ( Gauss's 
theorem ): 

f sy >7 d*a = f s%yd 3 i r 

r * J dr 


^^Positive sense 
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D. 



Proof of Gauss's Theorem 


1. The indices a and ft of S a p y “go along for a free ride,” so one can suppress 
them from the proof. Then the equation to be derived is 

f S y dt dx dydz = <f S y d 3 I y . 

*V ’ “ar 

2. Since the integral of a derivative is just the original function, the volume integral 
of S% is 



3. The surface integral f dr S°d 3 Z 0 can be reduced to the same set of terms: 

a. Use x, y, z as coordinates on 3T. On the “up” side, d 3 I 0 must be positive 
to achieve a “positive” sense pointing away from T, so (see part B above) 

d 3 z o = £ 0a p y ^^717 dxd y dz = e oi 23 dx d y dz = dx d y dz - 

b. On the “down” side, d 3 2 0 must be negative, so 

d 3 Z 0 = — dx dy dz. 


c. Hence, 


f S°d 3 I o=f S°dxdydz-f S°dxdydz. 

* / 9*y “up” “down” 

4. Equality is proved for the other components in the same manner. Adding 
components produces the result desired; 




j 


f ST'yd*0 = f s y d 3 l r 

■r ’ J -&r 



FOR THE READER WHO HAS STUDIED CHAPTER 4 

Box 5.4 I. EVERY INTEGRAL IS THE INTEGRAL OF A FORM. 

II. THE THEOREM OF GAUSS IN THE LANGUAGE OF FORMS. 


I. Every integral encountered in Chapter 5 can be interpreted as the integml of an 
exterior differential form. This circumstance shows up in foil Hold and threefold 
integrals, for example, in the fact that 

d 4 tt = e = *1 = e 0123 dt A dx A dy A dz 

and 

d3S n = Wvl dx<x A dx& A dxl 

are basis 4- and 3-forms. (Recall: the indices afty between vertical bars are to be 
summed only over 0 < a < ft < y < 3.) A more extensive glossary of notations is 
found in C below. 

II. Gauss's Theorem for a tensor integral in flat space reads 


f (V-S)d 4 S2 = S'dZ 

J r >r 

for any tensor, such as S = S a (3 y e a ® w 1 * <8> e y (see Box 5.3 for component form). 
It is an application of the generalized Stokes Theorem (Box 4.1), and depends on 
the fact that the basis vectors e a and at & of a global Lorentz frame are const ant s y 
i.e., are independent of x. The definitions follow in A; the proof is in B. 

A. Tensor-valued integrals can be defined in flat spaces because one uses constant 
basis vectors. Thus one defines 

f S- d 3 I = e a <S> w* f S a ff d 3 I y 

for a tensor of the indicated rank. One justifies pulling basis vectors and forms outside 
the integral sign because they are constants, independent of location in spacetime. 
Each of the numbers fS a p y d 3 2 y (for a, ft = 0,1,2,3) is then evaluated by substi¬ 
tuting any properly oriented parametrization of the hypersurface into the 3-form 
S a p y d 3 Z y as described in Box 4.1 (arbitrary curvilinear parametrization in the part 
of the calculation not involving the “free indices” a and ft). In other words, 
S' d 3 £ — e a ® ujP ® S a p y d 3 2 y is considered a “tensor-valued 3-form.” Under 
an integral sign, it is contracted with the hyperplane element tangent to the 3-surface 
f?(A\ A 2 , A 3 ) of integration to form the integral 


/ S •= / (S ■ <W, § A § A §) dX‘ d» dX‘ 


= e„ <8> 


fs°, 


y p 

(3 


d(x x ,x fl , x v ) 
J( A 1 , A 2 , A 3 ) 


dX 1 d\ 2 dX 3 . 


Jacobian 

determinant 


Although constant basis vectors e a , W& derived from rectangular coordinates are 
essential here, a completely general parametrization of the hypersurface may be used . 
B. The proof of Gauss's Theorem is a computation: 



( e a , are constant) 




(Stokes Theorem) 


(see below) 



The missing computational step above is 


(merely notation) 


d(S a 0 y d 3 Z y ) = ( dS a 0 y/dx*>)dxf> A d 3 Z y 

= (ds a /3 v/dxy)*\. 

Here the first step uses d{d 3 Z y ) — 0 (which follows from £ m(iy = const in flat 
spacetime). The second step uses 


dx*> A d 3 Z y = «**!. 

[Write the lefthand side of this identity as e 7( ^ X | dx p A dx* A dx v A dx x . The only 
possible non-zero term in the sum over fiv\ is the one with p < v < X all different 
from p. The righthand side is the value of this term.] 

C. Glossary of notations . 

Charge density 3-form: 


*J = J*d 3 Z„ = J • d 3 Z 


= J\ e fiapy dx a A A dxV3! ; 


(V) ai8y 




Maxwell and Faraday 2-forms: 


= d% p ; 

F = \ F HV dx>1 A dx “- 



Basis 2-forms: 

dx a A dx&; (one way to label) 

d 2 S flv = e^ ja/3| dx a A dxP. (dual way to label) 

Energy-momentum density 3-form: 

T* d 3 Z = efT^ d 3 Z v = *7"; 

dual on last index, (*7')' i a(8y - T^e va ^ r 

Angular momentum density 3-form: 

A e J^ a d^S a = V; 

(V)^y = 
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Newtonian fluid characterized 
by \v>\ < 1 , p < p 


Stress-energy tensor and 
equation of motion for a 
Newtonian fluid 


Application of V • T = 0 to 
an electrically charged, 
vibrating rubber block 


(In the frame-independent equation ▼ • T = 0, one need not worry about which 
slot of T to take the divergence on; the slots are symmetric, so either can be used.) 

The equation ▼ • T = 0 is the differential formulation of the law of 4-momentum 
conservation. It is also called the equation of motion for stress-energy, because it places 
constraints on the dynamic evolution of the stress-energy tensor. To examine these 
constraints for simple systems is to realize the beauty and power of the equation 

▼ • r = o. 


§5.10. SAMPLE APPLICATIONS OF V - T = 0 


The equation of motion V • T = 0 makes contact with the classical (Newtonian) 
equations of hydrodynamics, when applied to a nearly Newtonian fluid. Such a fluid 
has low velocities relative to the Lorentz frame used, |tP| < 1; and in its rest frame 
its pressure is small compared to its density of mass-energy, p/p = p/pc 2 < 1. For 
example, the air in a hurricane has 


£ 

P 


\v*\ ~~ 100 km/hour — 3,000 cm/sec — 10 7 c = 10 7 < 

1 atmosphere 10 6 dynes/cm 2 _ cm 2 ^ iQ -12 c 2 _ 
10 -3 g/cm 3 10 -3 g/cm 3 sec 2 


1 , 

10- 12 < 1 . 


The stress-energy tensor for such a fluid has components 


r 00 = (p + p)u°u° - ^ p, 

(5.37a) 

T 0 ’ — T*° = (p + p)u°u j zz piP, 

(5.37 b) 

T )k = (p + p)u*u k + p 8* k zz pv*v k + p 8^ k ; 

(5.37c) 


and the equation of motion V * T — 0 has components 

T°° 0 + T° j • = dp/dt + V ■ (pv) = 0 

(5.38a) 

(“equation of continuity”); 


and 

P% + T’ k k = d(pv’)/dt + d(pv i v k )/dx k + dp/dx’ = 0, 
or, equivalently (by combining with the equation of continuity), 

— + (v V)v = —- Vp (“Euler’s equation”). (5.38b) 

dt p L 

Box 5.5 derives and discusses these results from the Newtonian viewpoint. 

As a second application of V • T = 0, consider a composite system: a block of 
rubber with electrically charged beads imbedded in it, interacting with an electro¬ 
magnetic field. The block of rubber vibrates, and its accelerating beads radiate 
electromagnetic waves; at the same time, incoming electromagnetic waves push on 
the beads, altering the pattern of vibration of the block of rubber. The interactions 
shove 4-momentum back and forth between beaded block and electromagnetic field. 
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Box 5.5 NEWTONIAN HYDRODYNAMICS REVIEWED 

Consider a classical, nonrelativistic, perfect fluid. Apply Newton’s law F = ma to 
a “fluid particle”; that is, to a small fixed mass of fluid followed in its progress 
through space: 


— (momentum per unit mass) = (force per unit mass) 


(force per unit volume) —(gradient of pressure) 


(density) 


(density) 


or 


dv 

dt 



( 1 ) 


Translate from time-rate of change following the fluid to time-rate of change as 
measured at a fixed location, finding 



(Latin indices run from 1 to 3; summation convention; upper and lower indices used 
indifferently for space dimensions in flat space!) This is Euler's fundamental equation 
for the hydrodynamics of a perfect fluid. 

Two further equations are needed to complete the description of a perfect fluid. 
One states the absence of heat transfer by requiring that the specific entropy (entropy 
per unit mass) be constant for each fluid “particle”: 

f = 0, or |+(rV) S = 0. (3) 

The final equation expresses the conservation of mass: 

~ + V • (pp) = 0, (4) 



dp 

dt 


+ ( P v k),k = 
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Box 5.5 (continued) 


it is analagous in every way to the equation that expresses conservation of charge 
in electrodynamics and that bears the same name, “equation of continuity 

The Newtonian stress-energy tensor, like its relativistic counterpart, is linked to 
conservation of momentum and mass. Therefore examine the time-rate of change 
of the density of fluid momentum, pv i9 contained in a unit volume; thus, 

d(pVi)/dt = -(pViV k \ k -p ti . (5) 

Momentum flows into the little volume element on the left (“force equals time-rate 
of change of momentum”) and out on the right; similarly at the other faces. Therefore 
the righthand side of (5) must represent the divergence of this momentum flux: 

d(pv i )/ct= -T iKk . (6) 

Consequently, we take for the momentum flux itself 


T ik = T ik = 


pv,v k + S ik p . 

“convection” “push” 


For the momentum density, the Newtonian value is 

r 01 = p° = pu { . (8) 

With this notation, the equation for the time-rate of change of momentum becomes 

dPv/dx* 1 = 0; (9) 

and with T 00 = p, the equation of continuity reads 

3T°V3^ = 0. (10) 

In conclusion, these Newtonian considerations give a reasonable approximation to 
the relativistic stress-energy tensor: 


I (p + p)u°u° - p 


pv i v i + 


I (p + p^V 


(p + p)u°u’ 


(p + p)wV + S ij p I 
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The 4-momentum of neither block nor field is conserved; neither ▼ • 7J block) nor 
▼ • 7(em field) vanishes. But total 4-momentum must be conserved, so 

▼ • (W> + 7(emfield)) must vanish. (5.39) 

For a general electromagnetic field interacting with any source, ▼ • 7J emfield) has the 
form 

Afield),, = - (5-40) 


(This was derived in exercise 3.18 by combining = 0 with expression 5.22 for 
the electromagnetic stress-energy tensor, and with Maxwell’s equations.) For our 
beaded block, J is the 4-current associated with the vibrating, charged beads, and 
F is the electromagnetic field tensor. The time component of equation (5.40) reads 


rjem field),» = = ~ E J 

/rate at which electric field E does work 
— \on a unit volume of charged beads 


(5.41) 


For comparison, 7^° lock)f0 is the rate at which the block’s energy density changes 
with time, — 7^ lock) ^ is the contribution of the block’s energy flux to this rate of 
change of energy density, and consequently their difference has the meaning 

rate at which mass-energy of block per\ 
unit volume increases due to actions 
other than internal mechanical forces ’ $A2) 

between one part of block and another/ 

Hence, the conservation law 


r rOv _ 

1 (block), v 


(^em field) + ^Tblock)), v — ® 

says that the mass-energy of the block increases at precisely the same rate as the 
electric field does work on the beads. A similar result holds for momentum: 


^femfield), v^k = = ~(J°E + J X B) 

( Lorentz force per unit volume 
acting on beads 


( rate at which momentum per unit volume 
of block increases due to actions 
other than its own stresses 


(5.43) 


(5.44) 


so the conservation law 


(^tem field) + 7&ek»).r = 0 

says that the rate of change of the momentum of the block equals the force of the 
electromagnetic field on its beads. 
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Angular momentum defined 
and its integral conservation 
law derived 


EXERCISES 


§5.11. ANGULAR MOMENTUM 

The symmetry. T^ v — T Vfi , of the stress-energy tensor enables one to define a 
conserved angular momentum J*& 9 analogous to the linear momentum p a . The 
angular momentum is defined relative to a specific but arbitrary origin—an event 
3 with coordinates, in a particular Lorentz frame, 

x a {3) = a a . (5.45) 

The angular momentum about 3 is defined using the tensor 

= ( x « _ a *)Tfiy _ ( X /J _ a P)T ay . (5.46) 

(Note that x a — a a is the vector separation of the “field point” jt a from the “origin” 
3\ T ay is here evaluated at the “field point”.) Because of the symmetry of T, f 
has vanishing divergence: 

r py t y = 8«yT0y + (X* - a a )T^\j - 80 y T«y - (x* - a^)T^ %y 

0 0 

_ ppa _ pap — Q ( 5 . 47 ) 

Consequently, its integral over any closed 3-surface vanishes 

f }*y&2 y = 0 (5.48) 

J d‘V 

(“integral form of the law of conservation of angular momentum”). 

The integral over a spacelike surface of constant time t is 

jap = f 0 dx d yfa- J _ aayjpo _ _ a /») J«0] dx dy fa ( 5 . 49 ) 

Recalling that T&° is momentum density, one sees that (5.49) has the same form 
as the equation 7 = rX p” of Newtonian theory. Hence the name “total angular 
momentum” for J a &. Various aspects of this conserved angular momentum, including 
the tie to its Newtonian cousin, are explored in Box 5.6. 


Exercise 5.2. CHARGE CONSERVATION 

Exercise 3.16 revealed that the charge-current 4-vector J satisfies the differential conservation 
law ▼ • J = 0. Write down the corresponding integral conservation law, and interpret it for 
the four closed surfaces of Fig. 5.3. 

Exercise 5.3. PARTICLE PRODUCTION 

Inside highly evolved, massive stars, the temperature is so high that electron-positron pairs 
are continually produced and destroyed. Let S be the number-flux vector for electrons and 
positrons, and denote its divergence by 


e = V • S. 


(5.50) 



Box 5.6 ANGULAR MOMENTUM 


A. Definition of Angular Momentum 

(a) Pick an arbitrary spacelike hypersurface 5 and an arbitrary event d with 
coordinates x a {d) = a a . (Use globally inertial coordinates throughout.) 

(b) Define “total angular momentum on S about d” to be 





jilv J d x dy dz . 


B. Conservation of Angular Momentum 


(a) T** v — 0 implies ^ V0L A = 0. 

(b) This means that J^ is independent of the hypersurface S on which it is calculated 
(Gauss’s theorem): 


J^(S A ) - J^(S B ) 


f f* v <* d 3 I 
•'ar 


a 


f /""“.a d * X = 

Jctr 



(Note: dT = (boundary of T) includes S A , S B , and timelike surfaces at spatial 
infinity; contribution of latter dropped—localized source.) 


C. Change of Point About Which Angular Momentum is Calculated 

Let b a be vector from d 0 to dp. b a = af — af. Then 
/'"‘'(about d ^ — /'""(about <f 0 ) 

= f T va d 3 S a + b v f d 3 1 a 

J s J s 

= -b»P v + b v P*, 
where is total 4-momentum. 




Box 5.6 (continued) 




D. Intrinsic Angular Momentum 

(a) Work, for a moment, in the system’s rest frame, where 

P° = A/, Pi = 0, x C] J = ~~ f x^T 00 d 3 x = location of center of mass. 

M J 

Intrinsic angular momentum is defined as angular momentum about any event 
{a°,x C] J) on center of mass’s world line. Its components are denoted S ^ and work 
out to be 


S°> = 0, S jk = €’ ki S l , 


where 

S = J(x~ x CM ) x (momentum density) d s x 
EE “intrinsic angular momentum vector.” 

(b) Define “intrinsic angular momentum 4-vector” to be that 4-vector whose 
components in the rest frame are (0, S'); then the above equations say 

S* 9 = U a S^\ 

Up ~ Pp/M = 4-velocity of center of mass, 

UpS* = 0 . 


E. Decomposition of Angular Momentum into Intrinsic 
and Orbital Parts 

(a) Pick an arbitrary event d, whose perpendicular displacement from center- 
of-mass world line is — Y a , so 
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(b) Then, by Part C, the angular momentum about d is 

ju-v — UaS ^ + - Y^P*. 


S"" (intrinsic) Lf" (orbital) 

(c) Knowing the angular momentum about d, and the 4-momentum (and hence 
4-velocity), one can calculate the vector from d to the center-of-mass world line, 

Y* = -J»’P,/M 2 , 

and the intrinsic angular momentum 

S = —U a J^t 

2 l apfp- 

V___ J 


Use Gauss’s theorem to show that e is the number of particles created (minus the number 
destroyed) in a unit four-dimensional volume of spacetime. 

Exercise 5.4. INERTIAL MASS PER UNIT VOLUME 

Consider a stressed medium in motion with ordinary velocity |v| < 1 with respect to a specific 
Lorentz frame. 

(a) Show by Lorentz transformations that the spatial components of the momentum density 
are 


T 0 ’ = ^ m )k v k , (5.51) 

k 

where 

m jk - T oo 8 jk _j_ T Jk (5.52) 

and 7*' are the components of the stress-energy tensor in the rest frame of the medium. 
Throughout the solar system 7™ > \T&\ (see, e.g., discussion of hurricane in §5.10), so one 
is accustomed to write T 0 ’ — T m v*, i.e., “(momentum density) = (rest-mass density) x (ve¬ 
locity)”. But inside a neutron star may be of the same order of magnitude as T&, so 
one must replace “(momentum density) = (rest-mass density) X (velocity)” by equations 
(5,51) and (5.52), at low velocities. 

(b) Derive equations (5.51) and (5.52) from Newtonian considerations plus the equivalence 
of mass and energy. (Hint: the total mass-energy carried past the observer by a volume V 
of the medium includes both the rest mass T^V and the work done by forces acting across 
the volume’s faces as they “push” the volume through a distance.) 

(c) As a result of relation (5.51), the force per unit volume required to produce an 
acceleration dv k /dt in a stressed medium, which is at rest with respect to the man who applies 
the force, is 


F* = dT°i/dt = ^ m )k dv k /dt. 

k 


(5.53) 
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This equation suggests that one call m jk the “inertial mass per unit volume” of a stressed 
medium at rest. In general m jk is a symmetric 3-tensor. What does it become for the special 
case of a perfect fluid? 

(d) Consider an isolated, stressed body at rest and in equilibrium (T afi 0 = 0) in the 
laboratory frame. Show that its total inertial mass, defined by 

= j m'i dx dy dz , (5.54) 

stressed 

body 


is isotropic and 


Exercise 5.5. 

(a) Write out 


equals the rest mass of the body 

A/h — J* T 00 dx dy dz. 

DETERMINANTS AND JACOBIANS 

explicitly the sum defining d 2 S 01 in 

, 2 c _ 0JC a dx& j jl 

d^^^ — ^dadb. 


(5.55) 


Thereby establish the formula 




9 (x a ,x p ) 

S{a^j 


dadb = 


1 8(x a ,x0) 

d(a,b) 


dadb. 


(Expressions such as these should occur only under integral signs. In this exercise one may 
either supply an f... wherever necessary, or else interpret the differentials in terms of the 
exterior calculus, dadb —*- da A db; see Box 5.4.) The notation used here for Jacobian 
determinants is 

MH = 

d(a, b) 


8 f 3 / 


da 

db 

dg 


da 

db 


(b) By a similar inspection of a specific case, show that 


d 3 !., 




dx a dxP dx y . JL , 1 d(x a , xP, x y ) 

1 5'lTir‘ fc ** a, 3F < ^ d(a,b,c) 


da db dc . 


(c) Cite a precise definition of the value of a determinant as a sum of terms (with suitably 
alternating signs), with each term a product containing one factor from each row and 
simultaneously one factor from each column. Show that this definition can be stated (in the 
4x4 case, with the p X p case an obvious extension) as 

det.4 = det ll/l x p ll = ( afiyS A a 0 A fi 1 A y 2 A* 3 . 


(d) Show that 


det 


(for a definition of Sffi y a s , see exercises 3.13 and 4.12). 
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(e) Use properties of the 6-symbol to show that the matrix A ~ 1 inverse to A has entries 
( A~ l y a given by 


(^ _1 ) M a(det A) 

(f) By an “index-mechanics” computation, from the formula for det A in part (d) derive 
the following expression for the derivative of the logarithm of the determinant 

</ln|deM| = traced' 1 dA). 

Here dA is the matrix WdA*J\ whose entries are 1-forms. 


Exercise 5.6. CENTROIDS AND SIZES 

Consider an isolated system with stress-energy tensor T^, total 4-momentum P a , magnitude 
of 4-momentum M — ( —P*P) 1/2 , intrinsic angular momentum tensor S a P, and intrinsic 
angular momentum vector S a . (See Box 5.6.) An observer with 4-velocity u a defines the 
centroid of the system, at his Lorentz time x° = t and in his own Lorentz frame, by 

X’Jf) — (1 /P°) f x i T 00 d 3 x in Lorentz frame where u = 0^/0jc°. (5.56) 

This centroid depends on (i) the particular system being studied, (ii) the 4-velocity u of 
the observer, and (iii) the time t at which the system is observed. 

(a) Show that the centroid moves with a uniform velocity 

dX’Jdt = Pi/P°, (5.57) 


corresponding to the 4-velocity 


U = P/M. (5.57') 

Note that this “4-velocity of centroid” is independent of the 4-velocity u used in defining 
the centroid. 

(b) The centroid associated with the rest frame of the system (i.e., the centroid defined 
with u = U) is called the center of mass; see Box 5.6. Let f u be a vector reaching from 
any event on the center-of-mass world line to any event on the world line of the centroid 
associated with 4-velocity u, thus the components of ( u in any coordinate system are 

= x u- (5.58) 


Show that ( u satisfies the equation 


- P a %) - = 0. (5.59) 


[Hint; perform the calculation in a Lorentz frame where u = 0^/0*°.] 

(c) Show that, as seen in the rest-frame of the system at any given moment of time, the 
above equation reduces to the three-dimensional Euclidean equation 

— (v X S)/M y (5.59') 

where v = u/u° is the ordinary velocity of the frame associated with the centroid. 

(d) Assume that the energy density measured by any observer anywhere in spacetime is 



162 


5. STRESS-ENERGY TENSOR AND CONSERVATION LAWS 


non-negative ( u-T-u > 0 for all timelike u). In the rest frame of the system, construct 
the smallest possible cylinder that is parallel to S' and that contains the entire system (T a p = 0 
everywhere outside the cylinder). Show that the radius r 0 of this cylinder is limited by 

r 0 > \S\/M. _ (5.60) 

Thus, a system with given intrinsic angular momentum S and given mass M has a minimum 
possible size r 0min = \S\/M as measured in its rest frame. 



CHAPTER 


■\ 


ACCELERATED OBSERVERS 

The objective world simply is; it does not happen. Only to 
the gaze of my consciousness, crawling upward along the life 
line [world line] of my body, does a section of this world 
come to life as a fleeting image in space 
which continuously changes in time. 

HERMAN WEYL (1949, p. 116) 




§6.1. ACCELERATED OBSERVERS CAN BE ANALYZED 
USING SPECIAL RELATIVITY 


It helps in analyzing gravitation to consider a situation where gravity is mocked 
up by acceleration. Focus attention on a region so far from any attracting matter, 
and so free of disturbance, that (to some proposed degree of precision) spacetime 
there can be considered to be flat and to have Lorentz geometry. Let the observer 
acquire the feeling that he is subject to gravity, either because of jet rockets strapped 
to his legs or because he is in a rocket-driven spaceship. How does physics look 
to him? 

Dare one answer this question? At this early stage in the book, is one not too 
ignorant of gravitation physics to predict what physical effects will be measured by 
an observer who thinks he is in a gravitational field, although he is really in an 
accelerated spaceship? Quite the contrary; special relativity was developed precisely 
to predict the physics of accelerated objects—e.g., the radiation from an accelerated 
charge. Even the fantastic accelerations 


Accelerated motion and 
accelerated observers can be 
analyzed using special 
relativity 


^nuclear ~~ 1)2 / r ~~ 10 31 cm/sec 2 — 10 28 “earth gravities” 


suffered by a neutron bound in a nucleus, and the even greater accelerations met 
in high-energy particle-scattering events, are routinely and accurately treated within 


Box 6.1 GENERAL RELATIVITY IS BUILT ON SPECIAL RELATIVITY 




A tourist in a powered interplanetary rocket feels 
“gravity.” Can a physicist by local effects convince 
him that this “gravity” is bogus? Never, says Ein¬ 
stein’s principle of the local equivalence of gravity 
and accelerations. But then the physicist will make 
no errors if he deludes himself into treating true 
gravity as a local illusion caused by acceleration. 
Under this delusion, he barges ahead and solves 
gravitational problems by using special relativity: 
if he is clever enough to divide every problem into 
a network of local questions, each solvable under 
such a delusion, then he can work out all influ- 

___ 


ences of any gravitational field. Only three basic 
principles are invoked: special-relativity physics, 
the equivalence principle, and the local nature of 
physics. They are simple and clear. To apply them, 
however, imposes a double task: (1) take space- 
time apart into locally flat pieces (where the prin¬ 
ciples are valid), and (2) put these pieces together 
again into a comprehensible picture. To undertake 
this dissection and reconstitution, to see curved 
dynamic spacetime inescapably take form, and to 
see the consequences for physics: that is general 
relativity. 


the framework of special relativity. The theoretician who confidently applies special 
relativity to antiproton annihilations and strange-particle resonances is not about 
to be frightened off by the mere illusions of a rocket passenger who gullibly believed 
the travel brochures advertising “earth gravity all the way.” When spacetime is flat, 
move however one will, special relativity can handle the job. (It can handle bigger 
jobs too; see Box 6.1.) The essential features of how special relativity handles the 
job are summarized in Box 6.2 for the benefit of the Track-1 reader, who can skip 
the rest of the chapter, and also for the benefit of the Track-2 reader, who will find 
it useful background for the rest of the chapter. 


( 

Box 6.2 ACCELERATED OBSERVERS IN BRIEF 


'\ 


An accelerated observer can carry clocks and measuring rods with him, and can 
use them to set up a reference frame (coordinate system) in his neighborhood. 

His clocks, if carefully chosen so their structures are affected negligibly by acceler¬ 
ation (e.g., atomic clocks), will tick at the same rate as unaccelerated clocks moving 
momentarily along with him: 


At i 


time interval ticked off \ 
by observer’s clocks as he 
| moves a vector displacement 
f along his world line / 


= [-*{, f)] 1/2 - 


And his rods, if chosen to be sufficiently rigid, will measure the same lengths as 
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momentarily comoving, unaccelerated rods do. (For further discussion, see § 16.4, 
and Boxes 16.2 to 16.4.) 

Let the observer’s coordinate system be a Cartesian latticework of rods and clocks, 
with the origin of the lattice always on his world line. He must keep his latticework 
small: 

^ — /spatial dimensions^ ^ / the acceleration measured \ -1 _ 1 

V of lattice / \by accelerometers he carries/ g' 

At distances / away from his world line, strange things of dimensionless magnitude 
gl happen to his lattice—e.g., the acceleration measured by accelerometers differs 
from g by a fractional amount ~~gl (exercise 6.7); also, clocks initially synchronized 
with the clock on his world line get out of step (tick at different rates) by a fractional 
amount ~gl (exercise 6.6). (Note that an acceleration of one “earth gravity” corre¬ 
sponds to 

g -1 ~ 10~ 3 sec 2 /cm ~ 10 18 cm ~ 1 light-year, 

so the restriction / 1 /g is normally not severe.) 

To deduce the results of experiments and observations performed by an accelerated 
observer, one can analyze them in coordinate-independent, geometric terms, and 
then project the results onto the basis vectors of his accelerated frame. Alternatively, 
one can analyze the experiments and observations in a Lorentz frame, and then 
transform to the accelerated frame. 

As deduced in this manner, the results of experiments performed locally (at 
l < 1 /g) by an accelerated observer differ from the results of the same experiments 
performed in a Lorentz frame in only three ways: 

(1) There are complicated fractional differences of order gl < 1 mentioned above, 
that can be made negligible by making the accelerated frame small enough. 

(2) There are Coriolis forces of precisely the same type as are encountered in 
Newtonian theory (exercise 6.8). These the observer can get rid of by carefully 
preventing his latticework from rotating—e.g., by tying it to gyroscopes that 
he accelerates with himself by means of forces applied to their centers of 
mass (no torque!). Such a nonrotating latticework has “Fermi-Walker trans¬ 
ported” basis vectors (§6.5), 

de a . 

~J T ~ = "( a ’ <V) ~ a (“ ’ a *')> (1) 

where u = 4-velocity, and a = du/di = 4-acceleration. 

(3) There are inertial forces of precisely the same type as are encountered in 
Newtonian theory (exercise 6.8). These are due to the observer’s acceleration, 
and he cannot get rid of them except by stopping his accelerating. 
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-\ 

The rest of this chapter is 
Track 2. 

It depends on no preceding 
Track-2 material. 

It is needed as preparation 
for 

(1) the mathematical 
analysis of gyroscopes 
in curved spacetime 
(exercise 19.2, §40.7), 
and 

(2) the mathematical 
theory of the proper 
reference frame of an 
accelerated observer 
(§13.6). 

It will be helpful in many 
applications of gravitation 
theory (Chapters IB-40). 

__ J 


Uniformly accelerated 
observer moves on hyperbola 
in spacetime diagram 


§6.2. HYPERBOLIC MOTION 

Study a rocket passenger who feels “gravity” because he is being accelerated in fiat 
spacetime. Begin by describing his motion relative to an inertial reference frame. 
His 4-velocity satisfies the condition ur — — 1. To say that it is fixed in magnitude 
is to say that the 4-acceleration, 


a — du/dj , 


( 6 . 1 ) 


is orthogonal to the 4-velocity: 

0 = {d/dj){-\/2) = ( d/dj ) |yic(/J = 3'u. 


( 6 . 2 ) 


This equation implies that a 0 = 0 in the rest frame of the passenger (that Lorentz 
frame, where, at the instant in question, u = e 0 ); in this frame the space components 
of av reduce to the ordinary definition of acceleration, a { — d 2 x i /dt 2 . From the 
components a M = (0; a *) in the rest frame, then, one sees that the magnitude of the 
acceleration in the rest frame can be computed as the simple invariant 


o 

3" 


= = ( drx/dt 2 ) 2 


as measured in rest frame* 


Consider, for simplicity, an observer who feels always a constant acceleration g . 
Take the acceleration to be in the x 1 direction of some inertial frame, and take 
x 2 = x 3 = 0. The equations for the motion of the observer in that inertial frame 
become 



o 





(6.3) 


Write out the three algebraic equations 

= - 1 , 

= — u°a° + u l a l = 0, 


and 


= g 2 - 

Solve for the acceleration, finding 


o du° i 

a =17 = gu ’ 


i du 1 o 
a 1 = — = gu°. 


(6.4) 


These linear differential equations can be solved immediately. The solution, with 
a suitable choice of the origin, reads 

t — g - 1 sinhgT, x ~ g~ l coshgr. (6.5) 

Note that x 2 ~ t 2 — g~ 2 . The world line is a hyperbola in a spacetime diagram 
(“hyperbolic motion”; Figure 6.1). Several interesting aspects of this motion are 
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Figure 6.1. 

Hyperbolic motion. World line of an object that (or an ob¬ 
server who) experiences always a fixed acceleration g with 
respect to an inertial frame that is instantaneously comoving 
(different inertial frames at different instants!). The 4-acceler¬ 
ation a is everywhere orthogonal (Lorentz geometry!) to the 
4-velocity u. 


treated in the exercises. Let the magnitude of the constant acceleration g be the 
acceleration of gravity, g = 980 cm/sec 2 experienced on earth: g ~ (10 3 cm/sec 2 )/ 
(3 X 10 10 cm/sec) 2 = (3 x 10 7 sec • 3 x 10 10 cm/sec) -1 = (1 light-year) -1 . Thus the 
observer will attain relativistic velocities after maintaining this acceleration for 
something like one year of his own proper time. He can outrun a photon if he has 
a head start on it of one light-year or more. 


Exercise 6.1. A TRIP TO THE GALACTIC NUCLEUS EXERCISES 

Compute the proper time required for the occupants of a rocket ship to travel the —30,000 
light-years from the Earth to the center of the Galaxy. Assume that they maintain an 
acceleration of one “earth gravity” (10 3 cm/sec 2 ) for half the trip, and then decelerate at 
one earth gravity for the remaining half. 

Exercise 6.2. ROCKET PAYLOAD 

What fraction of the initial mass of the rocket can be payload for the journey considered 
in exercise 6.1? Assume an ideal rocket that converts rest mass into radiation and ejects all 
the radiation out the back of the rocket with 100 per cent efficiency and perfect collimation. 

Exercise 6.3. TWIN PARADOX 

(a) Show that, of all timelike world lines connecting two events (I and fri, the one with the 
longest lapse of proper time is the unaccelerated one. (Hint: perform the calculation in the 
inertial frame of the unaccelerated world line.) 

(b) One twin chooses to move from (I tofr? along the unaccelerated world .line. Show that 
the other twin, by an appropriate choice of accelerations, can get from (f to in arbitrarily 
small proper time. 

(c) If the second twin prefers to ride in comfort, with the acceleration he feels never 
exceeding one earth gravity, g, what is the shortest proper time-lapse he can achieve between 
ct and #? Express the answer in terms of g and the proper time-lapse Jt measured by the 
unaccelerated twin. 

(d) Evaluate the answer numerically for several interesting trips. 
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Exercise 6.4. RADAR SPEED INDICATOR 

A radar set measures velocity by emitting a signal at a standard frequency and comparing 
it with the frequency of the signal reflected back by another object. This redshift measurement 
is then converted, using the standard special-relativistic formula, into the corresponding 
velocity, and the radar reads out this velocity. How useful is this radar set as a velocity-mea¬ 
suring instrument for a uniformly, accelerated observer? 

(a) Consider this problem first for the special case where the object and the radar set are 
at rest with respect to each other at the instant the radar pulse is reflected. Compute the 
redshift 1 + 2 = to c /to 0 that the radar set measures in this case, and the resulting (incorrect) 
velocity it infers. Simplify by making use of the symmetries of the situation. 

(b) Now consider the situation where the object has a non-zero velocity in the momentary 
rest frame of the observer at the instant it reflects the radar pulse. Compute the ratio of 
the actual relative velocity to the velocity read out by the radar set. 

Exercise 6.5. RADAR DISTANCE INDICATOR 

Use radar as a distance-measuring device. The radar set measures its proper time r between 
the instant at which it emits a pulse and the later instant when it receives the reflected pulse. 
It then performs the simple computation L 0 = r/2 and supplies as output the “distance” 
L 0 . How accurate is the output reading of the radar set for measuring the actual distance 
L to the object, when used by a uniformly accelerated observer? (L is defined as the distance 
in the momentary rest frame of the observer at the instant the pulse is reflected, which is 
at the observer’s proper time halfway between emitting and receiving the pulse.) Give a 
correct formula relating L 0 = r/2 to the actual distance L. Show that the reading L 0 becomes 
infinite as L approaches g -1 , where g is the observer’s acceleration, as measured by an 
accelerometer he carries. 


§6.3. CONSTRAINTS ON SIZE OF AN ACCELERATED FRAME 


Difficulties in constructing 
"the coordinate system of an 
accelerated observer": 


Breakdown in communication 
between observer and events 
at distance 
/ > (acceleration) -1 


It is very easy to put together the words “the coordinate system of an accelerated 
observer,” but it is much harder to find a concept these words might refer to. The 
most useful first remark one can make about these words is that, if taken seriously, 
they are self-contradictory. The definite article “the” in this phrase suggests that 
one is thinking of some unique coordinate system naturally associated with some 
specified accelerated observer, such as one whose world line is given in equation 
(6.5). If the coordinate system is indeed natural, one would expect that the coordi¬ 
nates of any event could be determined by a sufficiently ingenious observer by sending 
and receiving light signals. But from Figure 6.1 it is clear that the events composing 
one quarter of all spacetime (Zone III) can neither send light signals to, nor receive 
light signals from, the specified observer. Another half of spacetime suffers lesser 
disabilities in this respect: Zone II cannot send to the observer, Zone IV cannot 
receive from him. It is hard to see how the observer could define in any natural 
way a coordinate system covering events with which he has no causal relationship, 
which he cannot see, and from which he cannot be seen! 

Difficulties also occur when one considers an observer who begins at rest in one 
frame, is accelerated for a time, and maintains thereafter a constant velocity, at rest 
in some other inertial coordinate system. Do his motions define in any natural way 
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x° = 0 


Figure 6.2. 

World line of an observer who has undergone a brief period of 
acceleration. In each phase of motion at constant velocity, an inertial 
coordinate system can be set up. However, there is no way to reconcile 
these discordant coordinates in the region of overlap (beginning at 
distance g -1 to the left of the region of acceleration). 


a coordinate system? Then this coordinate system (1) should be the inertial frame 
x* 1 in which he was at rest for times x° less than 0, and (2) should be the other 
inertial frame x* for times x 0 ' > T in which he was at rest in that other frame. 
Evidently some further thinking would be required to decide how to define the 
coordinates in the regions not determined by these two conditions (Figure 6.2). More 
serious, however, is the fact that these two conditions are inconsistent for a region 
of spacetime that satisfies simultaneously x° < 0 and x 0 ' > T'. In both examples 
of accelerated motion (Figures 6.1 and 6.2), the serious difficulties about defining 
a coordinate system begin only at a finite distance g~ x from the world line of the 
accelerated observer. The problem evidently has no solution for distances from the 
world fine greater than g _1 . It does possess a natural solution in the immediate 
vicinity of the observer. This solution goes under the name of “Fermi-Walker 
transported orthonormal tetrad.” The essential idea lends itself to simple illustration 
for hyperbolic motion, as follows. 


Natural coordinates 
inconsistent at distance 
/ > (acceleration) -1 


§6.4. THE TETRAD CARRIED BY A UNIFORMLY 
ACCELERATED OBSERVER 

An infinitesimal version of a coordinate system is supplied by a “tetrad,” or “moving Orthonormal tetrad of basis 
frame” (Cartan’s “repere mobile”), or set of basis vectors e 0 ,, e v , e 2 ,, e 3 , (subscript vectors carried by uniformly 
tells which vector, not which component of one vector!) Let the time axis be the accelerated observer 
time axis of a comoving inertial frame in which the observer is momentarily at rest. 

Thus the zeroth basis vector is identical with his 4-velocity: e 0 , = u. The space axes 
e 2 and e 3 are not affected by Lorentz transformations in the 1-direction. Therefore 
take e<y and e 3 , to be the unit basis vectors of the all-encompassing Lorentz frame 
relative to which the hyperbolic motion of the observer has already been described 
in equations (6.5): e 2 > — e 2 ; e 3 , = e 3 . The remaining basis vector, e r , orthogonal 
to the other three, is parallel to the acceleration vector, e r = g~ 1 a [see equation 
(6.4)]. There is a more satisfactory way to characterize this moving frame: the time 
axis e# is the observer’s 4-velocity, so he is always at rest in this frame; and the 
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other three vectors e r are chosen in such a way as to be (1) orthogonal and (2) 
nonrotating. These basis vectors are: 

{e 0 >y = (cosh gr; sinh gr, 0, 0); 

(e r Y = (sinh gr ; cosh gr , 0, 0); 

(e 2 ,y = (0; 0, 1, 0); 

(e 3 r = (0; 0 , 0 , 1 ). (6.6) 

There is a simple prescription to obtain these four basis vectors. Take the four basis 
vectors e 0 , e v e 2 , e 3 of the original global Lorentz reference frame, and apply to 
them a simple boost in the 1-direction, of such a magnitude that e 0 * comes into 
coincidence with the 4-velocity of the observer. The fact that these vectors are all 
orthogonal to each other and of unit magnitude is formally stated by the equation 

* e v’ = V*'* ( 6 - 7 ) 


§6.5. THE TETRAD FERMI-WALKER TRANSPORTED BY 
AN OBSERVER WITH ARBITRARY ACCELERATION 


Orthonormal tetrad of 
arbitrarily accelerated 
observer: should be 
"nonrotating'' 


"Nonrotating" means 
rotation only in timelike 
plane of 4-velocity and 
4-acceleration 


Mathematics of rotation in 
3-space 


Turn now from an observer, or an object, in hyperbolic motion to one whose 
acceleration, always finite, varies arbitrarily with time. Here also we impose three 
criteria on the moving, infinitesimal reference frame, or tetrad: (1) the basis vectors 
of the tetrad must remain orthonormal [equation (6.7)]; (2) the basis vectors must 
form a rest frame for the observer at each instant (e 0 , = u); and (3) the tetrad should 
be “nonrotating.” 

This last criterion requires discussion. The basis vectors of the tetrad at any proper 
time r must be related to the basis vectors e 0 , e v e 2 , e 3 of some given inertial frame 
by a Lorentz transformation e^(r) = A p fi ,(r)e v . Therefore the basis vectors at two 
successive instants must also be related to each other by a Lorentz transformation. 
But a Lorentz transformation can be thought of as a “rotation” in spacetime. The 
4-velocity t/, always of unit magnitude, changes in direction. The very concept of 
acceleration therefore implies “rotation” of velocity 4-vector. How then is the re¬ 
quirement of “no rotation” to be interpreted? Demand that the tetrad e^(r) change 
from instant to instant by precisely that amount implied by the rate of change of 
u = e 0 ,, and by no additional arbitrary rotation. In other words, (1) accept the 
inevitable pseudorotation in the timelike plane defined by the velocity 4-vector and 
the acceleration, but (2) rule out any ordinary rotation of the three space vectors. 

Nonrelativistic physics describes the rotation of a vector (components u 4 ) by an 
instantaneous angular velocity vector (components a^). This angular velocity appears 
in the formula for the rate of change of v, 

(dv^dt) = (<o X v\ = e m a $ v k . (6.8) 


For the extension to four-dimensional spacetime, it is helpful to think of the rotation 
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as occurring in the plane perpendicular to the angular velocity vector <o. Thus rewrite 
(6.8) as 


dvjdt = -Q ik v k , (6.9) 

where 

®jk — ~®kj — ^% e ijk ( 6 . 10 ) 

has non-zero components only in the plane of the rotation. In other words, to speak 
of'‘a rotation in the (1, 2)-plane” is more useful than to speak of a rotation about 
the 3-axis. The concept of “plane of rotation” carries over to four dimensions. There 
a rotation in the (l,2)-plane will leave constant not only the u 3 but also the t; 0 
component of the velocity. The four-dimensional definition of a rotation is 

dv^ 

-j- = with Qt* = (6.11) 

To test the appropriateness of this definition of a generalized rotation or infinitesimal 
Lorentz transformation, verify that it leaves invariant the length of the 4-vector: 

d{v^)/dr = Iv^dv^/dr) = = 0 . ( 6 . 12 ) 

The last expression vanishes because £2^ is antisymmetric, whereas v^v v is symme¬ 
tric. Note also that the antisymmetric tensor Q* v (“rotation matrix”; “infinitesimal 
Lorentz transformation”) has 4 X 3/2 = 6 independent components. This number 
agrees with the number of components in a finite Lorentz transformation (three 
parameters for rotations, plus three parameters for the components of a boost). The 
“infinitesimal Lorentz transformation” here must (1) generate the appropriate Lor¬ 
entz transformation in the timelike plane spanned by the 4-velocity and the 4-accel- 
eration, and (2) exclude a rotation in any other plane, in particular, in any spacelike 
plane. The unique answer to these requirements is 

Q* 9 = a*u v - a v u*\ i.e., O = a A u. (6.13) 

Apply this rotation to a spacelike vector w orthogonal to u and a, (u • w = 0 and 
a • w = 0). Immediately compute Qf LV w p — 0. Thus verify the absence of any space 
rotation. Now check the over-all normalization of in equation (6.13). Apply the 
infinitesimal Lorentz transformation to the velocity 4-vector u of the observer. Thus 
insert v M = a M in (6.11). It then reads 

du^/dr = a* = u^a 1 ' u p ) — a^(u v ii v ) — a* 1 . 

This result is an identity, since u* u = — 1 and u • a =0. 

A vector v that undergoes the indicated infinitesimal Lorentz transformation, 

dv^/dr = (uW - u'a*)v 9 , (6.14) 

is said to experience “Fermi-Walker transport” along the world line of the observer. 


Mathematics of rotation in 
spacetime 


Fermi-Walker law of transport 
for "nonrotating" tetrad of 
basis vectors carried by an 
accelerated observer 
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Figure 6.3. 

Construction of spacelike hyperplanes (dashed) orthogonal to 
the world line (heavy line) of an accelerated particle at selected 
moments along that world line. Note crossing of hyperplanes 
at distance g _1 (T) (time-dependent acceleration!) from the 
world line. 


The natural moving frame associated with an accelerated observer consists of four 
orthonormal vectors, each of which is Fermi- Walker transported along the world line 
and one of which is e 0 . = u (the 4-velocity of the observer). Fermi-Walker transport 
of the space basis vectors e- can be achieved in practice by attaching them to 
gyroscopes (see Box 6.2 and exercise 6.9). 


§6.6. THE LOCAL COORDINATE SYSTEM OF 
AN ACCELERATED OBSERVER 


Tetrad used to construct 
"local coordinate system of 
accelerated observer" 


Extend this moving frame or “infinitesimal coordinate system” to a “local coordinate 
system” covering a finite domain. Such local coordinates can escape none of the 
problems encountered in “hyperbolic motion” (Figure 6.1) and “briefly accelerated 
motion” (Figure 6.2). Therefore the local coordinate system has to be restricted to 
a region within a distance g~ l of the observer, where these problems do not arise. 
Figure 6.3 illustrates the construction of the local coordinates At any given proper 
time r the observer sits at a specific event &(t) along his world line. Let the displace¬ 
ment vector, from the origin of the original inertial frame to his position ^(r), be 
z(t). At ^(t) the observer has three spacelike basis vectors e r (r), e 2 '(r), e 3'( r )- The 
point £P(t) plus those basis vectors define a spacelike hyperplane. The typical point 
of this hyperplane can be represented in the form 


x = P'e^T) + l 2 'e 2 .(r) + l 3 'e 3 -(r) + z(r) 

= (separation vector from origin of original inertial frame). 


(6.15) 


Here the three numbers play the role of Euclidean coordinates in the hyperplane. 
This hyperplane advances as proper time unrolls. Eventually the hyperplane cuts 
through the event ?P Q to which it is desired to assign coordinates. Assign to this event 
as coordinates the numbers = r, given by (6.75). Call these four numbers 
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“coordinates relative to the accelerated observer.” In detail, the prescription for the 
determination of these four coordinates consists of the four equations 



x" = F'KF)]" + 2 »( t), 


(6.16) 


in which the x“ are considered as known, and the coordinates r, £* are considered 
unknowns. 

At a certain distance from the accelerated world line, successive spacelike hyper¬ 
planes, instead of advancing with increasing r, will be retrogressing. At this distance, 
and at greater distances, the concept of “coordinates relative to the accelerated 
observer” becomes ambiguous and has to be abandoned. To evaluate this distance, 
note that any sufficiently short section of the world line can be approximated by 
a hyperbola (“hyperbolic motion with acceleration g”), where the time-dependent 
acceleration g(r) is given by the equation g 2 = a^a^. 

Apply the above general prescription to hyperbolic motion, arriving at the equa¬ 
tions 

*° = (r 1 + 4 r )sinh(gr y ), 

x 1 = (g -1 + £ r )cosh(g£°'), 

* 2 = F, 

x 3 = F- (6.17) 


Local coordinate system for 
uniformly accelerated 
observer 


The surfaces of constant are the hyperplanes with x°/x l = tanh gF sketched in 
Figure 6.4. Substitute expressions (6.17) into the Minkowski formula for the line 
element to find 


ds 2 = dx* dx v 

= -(1 + gF'm 0 ') 2 + (4 r ) 2 + (4 2 ') 2 + (4 3 ') 2 . (6.18) 



Figure 6.4. 

Local coordinate system associated with an ob¬ 
server in hyperbolic motion (heavy black world 
line). The local coordinate system fails for less 
than — g~ 1 . 
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EXERCISES 


The coefficients of d£ v ' in this expansion are not the standard Lorentz metric 
components. The reason is clear. The do not form an inertial coordinate system. 
However, at the position of the observer, £} = 0, the coefficients reduce to the 
standard form. Therefore these “local coordinates” approximate a Lorentz coordinate 
system in the immediate neighborhood of the observer. 


Exercise 6.6. CLOCK RATES VERSUS COORDINATE TIME 
IN ACCELERATED COORDINATES 

Let a clock be attached to each grid point, (£ l ,£ J >£ 3 ) = constant, of the local coordinate 
system of an accelerated observer. Assume for simplicity that the observer is in hyperbolic 
motion. Use equation (6.18) to show that proper time as measured by a lattice clock differs 
from coordinate time at its lattice point: 

dr/d^ = 1 + 

(Of course, very near the observer, at < g~ l , the discrepancy is negligible.) 

Exercise 6.7. ACCELERATION OF LATTICE POINTS 
IN ACCELERATED COORDINATES 

Let an accelerometer be attached to each grid point of the local coordinates of an observer 
in hyperbolic motion. Calculate the magnitude of the acceleration measured by the acceler¬ 
ometer at (£*, £ 2 , £ 3 *). 

Exercise 6.8. OBSERVER WITH ROTATING TETRAD 

An observer moving along an arbitrarily accelerated world line chooses not to Fermi-Walker 
transport his orthonormal tetrad. Instead, he allows it to rotate. The antisymmetric rotation 
tensor O that enters into his transport law 

de^/di = — O • e a > (6.19) 

splits into a Fermi-Walker part plus a spatial rotation part: 

Q» v = aW - a v u* + u a a^ a ^ ( 6 . 20 ) 



w = a vector orthogonal to 4-velocity u. 

(a) The observer chooses his time basis vector to be e 0 > = u. Show that this choice is 
permitted by his transport law (6.19), (6.20). 

(b) Show that Q^ R) produces a rotation in the plane perpendicular to u and w— i.e., that 

G(sr) ' u = 0, fi(sn) ’ w = 0- (6.21) 

(c) Suppose the accelerated observer Fermi-Walker transports a second orthonormal tetrad 
e a ". Show that the space vectors of his first tetrad rotate relative to those of his second tetrad 
with angular velocity vector equal to w. Hint: At a moment when the tetrads coincide, show 
that (in three-dimensional notation, referring to the 3-space orthogonal to the observer’s world 
line): 


d(e f — e r )/dr = co X e f . 


(6.22) 
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(d) The observer uses the same prescription [equation (6.16)] to set up local coordinates 
based on his rotating tetrad as for his Fermi-Walker tetrad. Pick an event £ on the observer’s 
world line, set r = 0 there, and choose the original inertial frame of prescription (6.16) so 
(1) it comoves with the accelerated observer at is?, (2) its origin is at £?, and (3) its axes 
coincide with the accelerated axes at <2. Show that these conditions translate into 


z"(0) = 0, e„.(0) = e a . 


(6.23) 


(e) Show that near is?, equations (6.16) for the rotating, accelerated coordinates reduce 


to: 


x° — £ 0 ' + a k ,£ k 'g>' 4- 0([£ a '] 3 ); 

x> = & + J a'! 0 ' 2 + + 0([f''] 3 ). 


(6.24) 


(f) A freely moving particle passes through the event is? with ordinary velocity v as 
measured in the inertial frame. By transforming its straight world line x* = id* 0 to the 
accelerated, rotating coordinates, show that its coordinate velocity and acceleration there are: 

(d&/dp)*t = o'; 


= —d - 2e** 1 toV + 2v>a k v k . 


inertial 

acceleration 


t t 

Coriolis relativisitc 
acceleration correction to 

inertial acceleration 


(6.25) 


Exercise 6.9. THOMAS PRECESSION 

Consider a spinning body (gyroscope, electron,...) that accelerates because forces act at its 
center of mass. Such forces produce no torque; so they leave the body’s intrinsic angular- 
momentum vector S unchanged, except for the unique rotation in the u A a plane required 
to keep S orthogonal to the 4-velocity u. Mathematically speaking, the body Fermi-Walker 
transports its angular momentum (no rotation in planes other than u A a): 

dS/dj zz (u A a) • S. (6.26) 


This transport law applies to a spinning electron that moves in a circular orbit of radius 
r around an atomic nucleus. As seen in the laboratory frame, the electron moves in the x, 
j-plane with constant angular velocity, to. At time ( = 0, the electron is at x = r, y = 0; 
and its spin (as treated classically) has components 

S° = 0 , = S v = 0 , S z = ~fi. 

V2 2 

Calculate the subsequent behavior of the spin as a function of laboratory time. S M (r). Answer: 
S x = —L ft (cos o)t cos wy t + y sin i ot sin coy/); 

V2 

S v = —h (sin <o/ cos coy/ — y cos to/ sin coy/); 

S* =-n ; 

2 


(6.27) 


5° =-— h vy sin toy/; 

V2 

Y = (i - v 2 r V2 . 


v — cor; 
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Rewrite the time-dependent spatial part of this as 


S 1 + iS y = JL Huf + ,(1 - y)sin(<oy/)£ f “ ( ]. 

V2 

The first term rotates steadily in a retrograde direction with angular velocity 

"Thomas = (V ~ 1)" 

2; y t> 2 co < to if v < 1. 


(6.28) 


(6.29) 


It is called the Thomas precession. The second term rotates in a righthanded manner for 
part of an orbit (0 < coy/ < -tt) and in a lefthanded manner for the rest (tt < toy/ < 2-r). 
Averaged in time, it does nothing. Moreover, in an atom it is very small (y — 1 < 1). It 
must be present, superimposed on the Thomas precession, in order to keep 

S • u — 5 • « — S°u° = 0, (6.30) 

and 

S 2 — S 2 — (S' 0 ) 2 = 3ft 2 /4 = constant. (6.31) 

It comes into play with righthanded rotation when S * u is negative; it goes out of play when 
S* u = 0; and it returns with lefthanded rotation when S*« turns positive. 

The Thomas precession can be understood, alternatively, as a spatial rotation that results 
from the combination of successive boosts in slightly different directions. [See, e.g., exercise 
103 of Taylor and Wheeler (1966).] For an alternative derivation of the Thomas precession 
(6.29) from “spinor formalism,” see §41.4. 



CHAPTER 


INCOMPATIBILITY OF GRAVITY 
AND SPECIAL RELATIVITY 

_ J 


§7.1. ATTEMPTS TO INCORPORATE GRAVITY 
INTO SPECIAL RELATIVITY 

The discussion of special relativity so far has consistently assumed an absence of 
gravitational fields. Why must gravity be ignored in special relativity? This chapter 
describes the difficulties that gravitational fields cause in the foundations of special 
relativity. After meeting these difficulties, one can appreciate fully the curved-space¬ 
time methods that Einstein introduced to overcome them. 

Start, then, with what one already knows about gravity, Newton’s formulation 
of its laws: 

d 2 x i /dt 2 = -d&/dx\ (7.1) 

V 2 <P = 4 vGp. (7.2) 


-. 

This chapter is entirely 
Track 2. 

It depends on no preceding 
Track-2 material. 

It is not needed as 
preparation for an,y later 
chapter, but will be 
helpful in Chapter 18 (weak 
gravitational fields), and in 
Chapters 38 and 39 
(experimental tests and other 
theories of gravity). 

V_ J 


These equations cannot be incorporated as they stand into special relativity. The 
equation of motion (7.1) for a particle is in three-dimensional rather than four-di¬ 
mensional form; it requires modification into a four-dimensional vector equation 
for d 2 x^/dT 2 . Likewise, the field equation (7.2) is not Lorentz-invariant, since the 
appearance of a three-dimensional Laplacian operator instead of a four-dimensional 
d’Alembertian operator means that the potential <P responds instantaneously to 
changes in the density p at arbitrarily large distances away. In brief, Newtonian 
gravitational fields propagate with infinite velocity. 

One’s first reaction to these problems might be to think that they are relatively 
straightforward to correct. Exercises at the end of this section study some relatively 
straightforward generalizations of these equations, in which the gravitational poten¬ 
tial 0 is taken to be first a scalar, then a vector, and finally a symmetric tensor field. 
Each of these theories has significant shortcomings, and all fail to agree with obser¬ 
vations. The best of them is the tensor theory (exercise 7.3, Box 7.1), which, however, 


Newton's gravitational laws 
must be modified into 
four-dimensional, geometric 
form 


All straightforward 
modifications are 
unsatisfactory 
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Best modification (tensor 
theory in flat spacetime) is 
internally inconsistent; when 
repaired, it becomes general 
relativity. 


is internally inconsistent and admits no exact solutions. This difficulty has been 
attacked in recent times by Gupta (1954, 1957, 1962), Kraichnan (1955), Thirring 
(1961), Feynman (1963), Weinberg (1965), Deser (1970). They show how the flat- 
space tensor theory may be modified within the spirit of present-day relativistic field 
theory to overcome these inconsistencies. By this field-theory route (part 5 of Box 
17.2), they arrive uniquely at standard 1915 general relativity. Only at this end point 
does one finally recognize, from the mathematical form of the equations, that what 
ostensibly started out as a flat-space theory of gravity is really Einstein’s theory, 
with gravitation being a manifestation of the curvature of spacetime. This book 
follows Einstein’s line of reasoning because it keeps the physics to the fore. 


EXERCISES 


EXERCISES ON FLAT-SPACETIME THEORIES OF GRAVITY 

The following three exercises provide a solid challenge. Happily, all three require similar 
techniques, and a solution to the most difficult one (exercise 7.3) is presented in Box 7.1. 
Therefore, it is reasonable to proceed as follows, (a) Work either exercise 7.1 (scalar theory 
of gravity) or 7.2 (vector theory of gravity), skimming exercise 7.3 and Box 7.1 (tensor theory 
of gravity) for outline and method, not for detail, whenever difficulties arise, (b) Become 
familiar with the results of the other exercise (7.2 or 7.1) by discussing it with someone who 
has worked it in detail, (c) Read in detail the solution to exercise 7.3 as presented in Box 
7.1, and compare with the computed results for the other two theories, (d) Develop computa¬ 
tional power by checking some detailed computations from Box 7.1. 


Exercise 7.1. SCALAR GRAVITATIONAL FIELD, <P 

A. Consider the variational principle 81 = 0, where 


C <p( dz a dzfi \ l/ *si 

=-m j ,q-t, a/3 ——) d\. 


(7.3) 


Here m = (rest mass) and z a (X) = (parametrized world line) for a test particle in the scalar 
gravitational field <P. By varying the particle’s world line, derive differential equations 
governing the particle’s motion. Write them using the particle’s proper time as the path 
parameter, 


, / dz a dz & V /2 

= i\) dx > 


so that u a = dz a /dr satisfies u a u^ a p = — 1. 

B. Obtain the field equation for <P(x) implied by the variational principle 67 = 0, where 
I = f£ d*x and 

*=- <dc *■' w J -** - ■*• <74 > 


Show that the second term here gives the same integral as that studied in part A (equa¬ 
tion 7.3). 

Discussion: The field equations obtained describe how a single particle of mass m generates 
the scalar field. If many particles are present, one includes in £ a term — fme 0 6 4 [x — z(t)] dr 
for each particle. 

C. Solve the field equation of part B, assuming a single source particle at rest. Also assume 
that e 0 = 1 is an adequate approximation in the neighborhood of the particle. Then check 
this assumption from your solution; i.e., what value does it assign to e 0 at the surface of 
the earth? (Units with c = 1 are used throughout; one may also set G = 1, if one wishes.) 
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D. Now treat the static, spherically symmetric field 0 from part C as the field of the sun 
acting as a given external field in the variational principle of part A, and study the motion 
of a planet determined by this variational principle. Constants of motion are available from 
the spherical symmetry and time-independence of the integrand. Use spherical coordinates 
and assume motion in a plane. Derive a formula for the perihelion precession of a planet. 

E. Pass to the limit of a zero rest-mass particle in the equations of motion of part A. 
Do this by using a parameter A different from proper time, so chosen that k* ~ dx^/dX 
is the energy-momentum vector, and by taking the limit m —*- 0 with k° = ym = E remain¬ 
ing finite (so u° = y —oo). Use these equations to show that the quantities q* — k^e* are 
constants of motion, and from this deduce that there is no bending of light by the sun in 
this scalar theory. 


Exercise 7.2. VECTOR GRAVITATIONAL FIELD, 0 

A. Verify that the variational principle 81 — 0 gives Maxwell’s equations by varying A and 
the Lorentz force law by varying z m (t), when 


I = 


-1 

167: 




dz** 


(7.5) 


Here is an abbreviation for A u /l — A^ v . Hint: to vary A^(x), rewrite the last term as 
a spacetime integral by introducing a delta function 6 4 [x — r(r)] as in exercise 7 . 1 , parts 
A and B. 

B. Define, by analogy to the above, a vector gravitational field 0 M with G^ = 0 Pji — 0^ v 
using a variational principle with 

' - * uSaS' a ~ a " ** + T " flSTT?- * * m $*-!£"*■ (7<i) 


(Note: if many particles are present, one must augment 7 by terms hm^dz^/dr^dz^/dr) dr + 
m^^dz^/dr) di for each particle.) Find the “Coulomb” law in this theory, and verify that 
the coefficients of the terms in the variational principle have been chosen reasonably. 

C. Compute the perihelion precession in this theory. 

D. Compute the bending of light in this theory (i.e., scattering of a highly relativistic 
particle u° = y — 00 ), as it passes by the sun, because of the sun’s 0 field. 

E. Obtain a formula for the total field energy corresponding to the Lagrangian implicit 
in part B. Use the standard method of Hamiltonian mechanics, with 


Afield — 


16? tG 


f G^G“- d*x=f £d*x\ 


F is the Lagrangian density and L = d 3 x is the Lagrangian. The corresponding Hamil¬ 
tonian density (= energy density) is 




dF 


d0 


- F . 


p,o 


Show that vector gravitational waves carry negative energy. 


Exercise 7.3. SYMMETRIC TENSOR GRAVITATIONAL FIELD, = h 

Here the action principle is, as for the vector field, 81 = 0, with 7 = 7 field 4 - 7 particle + 

interaction- ^particle is the Same as for the vector field: 


7 


particle 


1 r dz“ d: 

2 m J dr dr 


dr. 


(7.7) 
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However, / fieW and /, nteraction are different: 


4eld ” J 

. Note that 

£ <= ifi (y h ^° R " s ’ a ~ one h her i 

Us not an /i_ 

with 

(7.8a) 

(7.8b) 

h — h — — 7) • 

V*‘ - ,l IXV 2 *fiv rl O 9 

(7.8c) 

I. — — f h rl^x 

A interaction 2 J V*' 

(7-9) 

Here P“' is the stress-energy tensor for all nongravitational fields and matter present. For 
a system of point particles (used throughout this exercise), 

T»-(x) = f - *(t)] dr. 

(7.10) 

A. Obtain the equations of motion of a particle by varying z*( t) in S(/ particle + 
■^interaction) = Express your result in terms of the “gravitational force field” 


(7.11) 


derived from the tensor gravitational potentials h^ v = h Vfl . 

B. Obtain the field equations from <$(/ field + interaction) = 0; express them in terms of 

ft + JPhf* - h ap 7]^ ~ hrt ij a '. (7.12) 

Discuss gauge invariance, and the condition h^ a a = 0. 

C. Find the tensor gravitational potentials h 9 due to the sun (treated as a point mass). 

D. Compute the perihelion precession. 

E. Compute the bending of light. 

F. Consider a gravitational wave 

hP = exp(ik a x a ). (7.13) 

What conditions are imposed by the field equations? By the gauge condition 

— 0? (7.14) 

Show that, by further gauge transformations 

^nv ^tiv *F £fi,v 4* (7.15) 

that preserve the h* a a = 0 restrictions, further conditions 

u a h a » = 0, h« a = 0 (7.16) 

can be imposed, where u a is a fixed, timelike vector. It is sufficient to consider the case, 
obtained by a suitable choice of reference frame, where u a = (1; 0, 0, 0) and k a = (to; 0, 0, to). 

G. From the Hamiltonian density 

X = h'jp£/dh„)~£ (7.17) 

for the field, show that the energy density of the waves considered in part F is positive. 

H. Compute T^ v v for the stress-energy tensor of particles T* v that appears in the action 
integral I. Does T^ v v vanish (e.g., for the earth in orbit around the sun)? Why? Show that 
the coupled equations for fields and particles obtained from 81 = 0 have no solutions. 

(continued on page 187) 


Box 7.1 AN ATTEMPT TO DESCRIBE GRAVITY BY A SYMMETRIC 

TENSOR FIELD IN FLAT SPACETIME [Solution to exercise 7.3] 

Attempts to describe gravity within the framework of special relativity would natu¬ 
rally begin by considering the gravitational field to be a scalar (exercise 7.1) as it 
is in Newtonian theory, or a vector (exercise 7.2) by analogy to electromagnetism. 
Only after these are found to be deficient (e.g., no bending of light in either theory; 
negative-energy waves in the vector theory) would one face the computational 
complexities of a symmetric tensor gravitational potential, h^ v — h v ^ which has more 
indices. 

The foundations of the most satisfactory of all tensor theories of gravity in flat 
spacetime are laid out at the beginning of exercise 7.3. The choice of the Lagrangian 
made there (equations 7.8) is dictated by the demand that h^ be a “Lorentz covari¬ 
ant, massless, spin-two field.” The meaning of this demand, and the techniques of 
special relativity required to translate it into a set of field equations, are customarily 
found in books on elementary particle physics or quantum field theory; see, e.g., 
Wentzel (1949), Feynman (1963), or Gasiorowicz (1966). Fierz and Pauli (1939) 
were the first to write down this Lagrangian and investigate the resulting theory. 
The conclusions of the theory are spelled out here in the form of a solution to exercise 
7.3. 

A. Equation of Motion for a Test Particle (exercise 7.3A) 

Carry out the integration in equation (7.9), using the particle stress-energy tensor 
of equation (7.10), to find 

4+ i = Vticle + interaction = J m f dj ' (1) 


where 


z* 4 = dz^/dr. 


Then compute SI p+i , and find that the coefficient of the arbitrary variation in path 
vanishes if and only if 

(d/dr)^^ + h^z”] - y = 0. 


Rewrite this equation of motion in the form 

+ hfLr)z P + — 0 * ( 2 ) 

where 7^ ajS is defined in equation (7.11). 

B r Field Equations (exercise 7.3B) 

Use 4eid and ^interaction in the forms given in equations (7.8) and (7.9); but for the 
quickest and least messy derivation, do not use the standard Euler-Lagrange 
equations. Instead, compute directly the first-order change Si? f produced by a small 
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Box 7.1 (continued) 


variation 8h a p of the field. For the second term of it is clear (by relabeling dummy 
indices as needed) that varying each factor gives the same result, so the two terms 
from the product rule combine: 


A similar result holds for the first term of in view of the identity a^b ^ = a^ v b^ v , 
which holds for the “bar” operation of equations (7.8); each side here is just a^ v b^ l> 
— \a fi ti b v v . Consequently, 

-(32^7 G) 8£ r = 8h^ a - 2fr* , Sh^*. (3) 

Next use this expression in <5/ field ; and, by an integration by parts, remove the 
derivatives from 8h^ giving 

«/ w = (32*<?)-> f 8h„p - 2h^^8h m ]d*x. 

To find the coefficient of 8h (iv in this expression, write (from equation 7.8c) 

8h aH = ( 8 \ 8 rfi ~ 5/ W 


and then rearrange and relabel dummy (summation) indices to obtain 

S/fieid = (3277G)-1 J [h^'%8h pfi - 2h»ey8h'Jd*x. 

By combining this with 5/ interaction = ^T 11 " 8h v d*x, and by using the symmetry 
5/ V = 8h w obtain 

+ h* a <a - v + h ea ^> 1 = 16-tt-GP'. 

The definition made in equation (7.12) allows this to be rewritten as 

ap = I677 GT^. 


(4) 

(4') 


B 2 . Gauge Invariance (exercise 7.3B, continued) 

The symmetries, 


of H mv & imply an identity 

n — J-flLOth’P] = n 

— 77 y 0t((3v) — ^ 

analogous to F* 9 9fL = 0 in electromagnetism. 

Thus 7>' v ~ 0 is required of the sources, just as is J* — 0 in electromagnetism 
(exercise 3.16). These identities make the field equations (4') too weak to fix h^ v 
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r 

completely. In particular, by direct substitution in equations (4), one verifies that 
to any solution one can add a gauge field 

U (gauge) _ £ It 

* v 9 '*' (5) 

V gauge) = ^ 

without changing T^ v . 

Let vanish outside some finite spacetime volume, but be otherwise arbitrary. 
Then h^ v and h v = h^ v + /z M / gauge) both satisfy the source equation (4) for the same 
source T^ v and the same boundary conditions at infinity. We therefore expect them 
to be physically equivalent. 

By a specialization of the gauge analogous to the “Lorentz” specialization A a a = 0 
of electromagnetism (equation 3.58a; exercise 3.17), one imposes the condition 

k^ a = 0 . ( 6 ) 

This reduces the field equations (4) to the simple d’Alembertian form 

□/i> = = - 16?r (7) 

(see exercise 18.2). Here and henceforth we set G = 1 (“geometrized units”). 

C. Field of a Point Mass (exercise 7.3C) 

For a static source, the wave equation (7) reduces to a Laplace equation 

v% u = - 16 * 7 ;,. 

The stress-energy tensor for a static point mass (equation 7.10) is T 00 = MS 3 (x) and 
= 0. Put this into the Laplace equation, solve for and use equation (7.8c) 
to obtain h . The result is: 

h 00 = 2M/r; h 0k = 0; h ik = S ik (2M/r) (8) 

(see equation 18.15a). 


D. Perihelion Precession (exercise 7.3D) 


Direct substitution of the potential (8) into the equations of motion (2) is tedious 
and not very instructive. Variational principles are popular in mechanics because 
they simplify such calculations. Return to the basic variational principle 8I p+i = 0 
(equation 1), and insert the potential (8) for the sun. Convert to spherical coordinates 
so oriented that the orbit lies in the equatorial (0 = tt/2) plane: 

I p+i = fLdr; (9) 


L L = i ffl [-( 1 - 2Mr~ l )i- + (1 + 2Mr~ l )(r 2 + rV)]. 


( 10 ) 
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~\ 

Box 7.1 (continued) 

From the absence of explicit <>-, and T-dependence in L, infer three constants 
of motion: the canonical momenta 


(this defines y) and 


P t = -my = dL/di 


~ ma — dL/d$ 


(this defines a); and the Hamiltonian 


H — x^dL/dx 11 ) — L, 

which can be set equal to — \m by appropriate normalization of the path parameter 
r. From these constants of the motion, derive an orbit equation as follows: (1) 
calculate H = —\m in terms of r, r, <p, and /; (2) eliminate t and <j> in favor of the 
constants y and a; (3) as in Newtonian orbit problems, define u = M/r, and write 

du_ _ u_ _ __ Mr_ _ _ M.j + 2u)r 
d<+> <p r 2 <p a 


(4) in H , eliminate r in favor of du/d$ via the above equation, and eliminate r in 
favor of u\ (5) solve for du/d<}>. The result is 



4 - u 2 = (y 2 — 1 q- 



1 + 2u 
1 - 2 u 




( 11 ) 


Neglecting cubic and higher powers of u - GM/c 2 r ~~ (1 — y 2 ) in this equation, 
derive the perihelion shift. (For details of method, see exercise 40.4, with the y and 
a of this box renamed E and L, and with the y and ft of that exercise set equal 
to 1 and 0.) The resulting shift per orbit is 

Af> = M/r 0 + O([M/r 0 ] 2 ). (12) 

This is | the prediction of general relativity; and it disagrees with the observations 
on Mercury (see Box 40.3). 


E. Bending of Light (exercise 7.3E) 

The deflection angle for light passing the sun is, on dimensional grounds, a small 
quantity, A<fi — M 0 /R 0 ■— 10“ 6 ; from the outset, one makes approximations based 
on this smallness. A diagram of the photon trajectory, set in the jc, z-plane, shows 
that, for initial motion parallel to the z-axis, the deflection angle can be expressed 
in terms of the final momentum as A(j> = pjp z . Compute the final p x by an integral 
along the trajectory, 

f +oc 

Px = J (dp x /dz) dz , 

J 
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treating p z as essentially constant. This computation requires generalization of the 
equation of motion (2) to the case of zero rest mass. To handle the limit m —*- 0, 
introduce a new parameter \ = r/m\ then p ** = m(dz^/dr) — dz^/dX, Also define 
Pn = + h piV )p v i since this quantity appears simply in equation (2) and agrees 

with p^ in the limit r —oo, where one will need to evaluate it. Then equation (2) 
reads, for any m , including m — 0, 





On the righthand side here, since h a ^ M is small, a crude approximation t op^ suffices; 
p l = p 2 = 0, p° = p 3 = dz/dX = w = constant. Thus, 


and 


For the sun, h 00 = /? 33 


dP 1 

~dX = 2"^ 00 + ^ 03 + ^ 33 ), i^ 2 

~ 2 (/2 °° + + * 33 ^ 1 ’ 

= 2M/r, and /z 03 = 0 (equation 8), so 


A<j> = 


(p x \ _ (PA r x 2A/1 dz ‘ 2M r x ^ 4A/ 

Wfinal " U 'final “L ^ ^ ” * i, d + “ i ' 


For light grazing the sun, 1 = R 0 , this gives = 4A/ 0 //? 0 radians = 1".75, which 
is also the prediction of general relativity, and is consistent with the observations 
(see Box 40.1). 


F. Gravitational Waves (exercise 7.3F) 

The field equations (4) and gauge properties (5) of the present flat-spacetime theory 
are identical to those of Einstein’s “linearized theory.” Thus, the treatment of 
gravitational waves using linearized theory, which is presented in §§18.2, 35.3, and 
35.4, applies here. 


G. Positive Energy of the Waves (exercise 7.3G) 

Computing a general formula for 9C from equation (7.17) is tedious, but it is sufficient 
to consider only the special case of a plane wave (equation 7.13)—or better still, 





Box 7.1 (continued) 


a plane wave with only h l2 = h 2l = f(z — t). Any gravitational wave can be con¬ 
structed as a superposition of such plane waves. First compute the Langrangian for 
this case. According to equation (7.8), it reads 

£ f = (32 7 7)“ 1 P 12 , 0 ) 2 - (*i2,3) 2 ]- 

Now the full content of the formula (7.17) defining Oi is precisely the following: 
start from the Lagrangian; keep all terms that are quadratic in time derivatives; 
omit all terms that are linear in time derivatives; and reverse the sign of terms that 
contain no time derivatives. The result is 

X = (32^r 1 p 12 ,o) 2 + (^I 2 t3 ) 2 ]. (14) 

which is positive. 


H. Self-Inconsistency of the Theory (exercise 7.3H) 

From equation (7.10), find 


T » p . = m 


I 


dz * dz v 5 
dr dr dx v 


8 4 [x — z(r)] dr. 


But 6 4 (x — z) depends only on the difference — z M , so — d/dz*' can replace d/dx v 
when acting on the 6-function. Noting that 

-*,!-£**-*)l. 

rewrite T^ v v as 

T ^ v — ~ m f ^(d/dr) 8 4 [x — z(r)]dr = +m J z^ 8 4 [x — z(r)\ dr. 

(The last step is obtained by an integration by parts.) Thus T* 9 w = 0 holds if and 
only if — 0. But z* — 0 means that the gravitational fields have no effect on the 
motion of the particle. But this contradicts the equation of motion (2), which follows 
from the theory’s variational principle. Thus, this tensor theory of gravity is incon¬ 
sistent. [Stated briefly, equation (4) requires T^ v v — 0, while equation (2) excludes 
it.] 

The fact that, in this theory, gravitating bodies cannot be affected by gravity, also 
holds for bodies made of arbitrary stress-energy (e.g., rubber balls or the Earth). 
Since all bodies gravitate, since the field equations imply T^ v v — 0, and since this 
“equation of motion for stress-energy” implies conservation of a body’s total 4-mo¬ 
mentum = fT»° d 2 x, no body can be accelerated by gravity. The Earth cannot 
be attracted by the sun; it must fly off into interstellar space! 

Straightforward steps to repair this inconsistency in the theory lead inexorably 
to general relativity (see Box 17.2 part 5). Having adopted general relativity as 
the theory of gravity, one can then use the present flat-spacetime theory as an 
approximation to it (“Linearized general relativity”; treated in Chapters 18, 19, and 
35; see especially discussion at end of §18.3). 
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§7.2. GRAVITATIONAL RED SHIFT DERIVED 
FROM ENERGY CONSERVATION 

Einstein argued against the existence of any ideal, straight-line reference system such 
as is assumed in Newtonian theory. He emphasized that nothing in a natural state 
of motion, not even a photon, could ever give evidence for the existence or location 
of such ideal straight lines. 

That a photon must be affected by a gravitational field Einstein (1911) showed 
from the law of conservation of energy, applied in the context of Newtonian gravita¬ 
tion theory. Let a particle of rest mass m start from rest in a gravitational field g 
at point a and fall freely for a distance h to point It gains kinetic energy mgh. 
Its total energy, including rest mass, becomes 

m mgh . (7.18) 

Now let the particle undergo an annihilation at converting its total rest mass 
plus kinetic energy into a photon of the same total energy. Let this photon travel 
upward in the gravitational field to (!. If it does not interact with gravity, it will 
have its original energy on arrival at Ct. At this point it could be converted by a 
suitable apparatus into another particle of rest mass m (which could then repeat 
the whole process) plus an excess energy mgh that costs nothing to produce. To avoid 
this contradiction of the principal of conservation of energy, which can also be stated 
in purely classical terms, Einstein saw that the photon must suffer a red shift. The 
energy of the photon must decrease just as that of a particle does when it climbs 
out of the gravitational field. The photon energy at the top and the bottom of its 
path through the gravitational field must therefore be related by 

^bottom = AopO + S h ) = AopC 1 + geonvVc 2 ). ( 7 - 19 ) 

The drop in energy because of work done against gravitation implies a drop in 
frequency and an increase in wavelength (red shift; traditionally stated in terms of 
a red shift parameter, z = JA/A); thus, 

I _!_ z _ ^top _ ^bottom __ ^bottom _ j ^ (7.20) 

^bottom ^top ^top 

The redshift predicted by this formula has been verified to 1 percent by Pound and 
Snider (1964, 1965), refining an experiment by Pound and Rebka (1960). 


§7.3. GRAVITATIONAL REDSHIFT IMPLIES 
SPACETIME IS CURVED 

An argument by Schild (1960, 1962, 1967) yields an important conclusion: the 
existence of the gravitational redshift shows that a consistent theory of gravity cannot 
be constructed within the framework of special relativity. 


Gravitational redshift derived 
from energy considerations 
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7. INCOMPATIBILITY OF GRAVITY AND SPECIAL RELATIVITY 


Assume gravity is described 
by an (unspecified) field in 
flat spacetime . . . 


♦ 



t 



Figure 7.1. 

Successive pulses of light rising from height z v to height z 2 = z x + h against the gravitational field of 
the earth. The paths Yi> and y 2 must be exactly congruent, whether sloped at 45° (left) or having variable 
slope (right). 


Whereas Einstein’s argument (last section) was formulated in Newtonian theory, 
Schild’s is formulated in special relativity. It analyzes gravitational redshift experi¬ 
ments in the field of the Earth, using a global Lorentz frame tied to the Earth’s 
center. It makes no demand that free particles initially at rest remain at rest in this 
global Lorentz frame (except far from the Earth, where gravity is negligible). On 
the contrary, it demands that free particles be accelerated relative to the Lorentz 
frame by the Earth’s gravitational field. It is indifferent to the mathematical nature 
of that field (scalar, vector, tensor,...), but it does insist that the gravitational 
accelerations agree with experiment. And, of course, it demands that proper lengths 
and times be governed by the metric of special relativity. 

Schild’s argument proceeds as follows. Consider one observer at rest on the Earth’s 
surface at height z v and a second above the Earth’s surface at height z 2 = z x + h 
(Figure 7.1). The observers may verify that they are at rest relative to each other 
and relative to the Earth’s Lorentz frame by, for instance, radar ranging to free 
particles that are at rest in the Earth’s frame far outside its gravitational field. The 
bottom experimenter then emits an electromagnetic signal of a fixed standard 
frequency to & which is received by the observer on top. For definiteness, let the signal 
be a pulse exactly N cycles long. Then the interval of time* 5r bot required for the 
emission of the pulse is given by 2ttN = <$r bot . The observer at the top is then 
to receive these same N cycles of the electromagnetic wave pulse and measure the 
time interval* 5r top required. By the definition of “frequency,” it satisfies 2? tN = 

^ T top* The redshift effect, established by experiment (for us) or by energy conserva¬ 
tion (for Einstein), shows consequently the time intervals are different, 

5r to P > ^ T bot* Transfer this information to the special-relativity spacetime diagram of 
the experiment (Figure 7.1). The waves are light rays; so one might show them as 
traveling along 45° null fines in the spacetime diagram (Figure 7.1,A). In this 

* Proper time equals Lorentz coordinate time for both observers, since they are at rest in the Earth’s 
Lorentz frame. 
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simplified but slightly inadequate form of the argument, one reaches a contradiction 
by noticing that here one has drawn a parallelogram in Minkowski spacetime in 
which two of the sides are unequal, r top > r bot , whereas a parallelogram in flat 
Minkowski spacetime cannot have opposite sides unequal. One concludes that special 
relativity cannot be valid over any sufficiently extended region. Globally, spacetime, 
as probed by the tracks of light rays and test particles, departs from flatness (“curva¬ 
ture”; Parts III and IV of this book), despite the fine fit that Lorentz-Minkowski 
flatness gives to physics locally. 

Figure 7.1,B, repairs an oversimplification in this argument by recognizing that 
the propagation of light will be influenced by the gravitational field. Therefore 
photons might not follow straight lines in the diagram. Consequently, the world lines 
Yj and y 2 of successive pulses are curves. However, the gravitational field is static 
and the experimenters do not move. Therefore nothing in the experimental setup 
changes with time. Whatever the path y v the path y 2 must be a congruent path 
of exactly the same shape, merely translated in time. On the basis of this congruence 
and the fact that the observers are moving on parallel world lines, one would again 
conclude, if flat Minkowski geometry were valid, that r bot = r top , thus contradicting 
the observed redshift experiment. The experimenters do not need to understand the 
propagation of light in a gravitational field. They need only use their radar apparatus 
to verify the fact that they are at rest relative to each other and relative to the source 
of the gravitational field. They know that, whatever influence the gravitational field 
has on their radar apparatus, it will not be a time-dependent influence. Moreover, 
they do not have to know how to compute their separation in order to verify that 
the separation remains constant. They only need to verify that the round-trip time 
required for radar pulses to go out to each other and back is the same every time 
they measure it. 

Schild’s redshift argument does not reveal what kind of curvature must exist, or 
whether the curvature exists in the neighborhood of the observational equipment 
or some distance away from it. It does say, however, quite unambigously, that the 
flat spacetime of special relativity is inadequate to describe the situation, and it 
should therefore motivate one to undertake the mathematical analysis of curvature 
in Part III. 


This assumption is 
incompatible with 
gravitational redshift 


Conclusion: spacetime is 
curved 


§7.4. GRAVITATIONAL REDSHIFT AS EVIDENCE FOR 
THE PRINCIPLE OF EQUIVALENCE 


Einstein (1908, 1911) elevated the idea of the universality of gravitational interactions 
to the status of a fundamental principle of equivalence , that all effects of a uniform 
gravitational field are identical to the effects of a uniform acceleration of the coordinate 
system. This principle generalized a result of Newtonian gravitation theory, in which 
a uniform acceleration of the coordinate system in equation (7.1) gives rises to a 


Principle of equivalence: z 
uniform gravitational field 
indistinguishable from a 
uniform acceleration of a 
reference frame 
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Gravitational redshift derived 
from equivalence principle 


supplementary uniform gravitational field. However, the Newtonian theory only 
gives this result for particle mechanics. Einstein’s principle of equivalence asserts 
that a similar correspondence will hold for all the laws of physics, including Max¬ 
well’s equations for the electromagnetic field. 

The rules of the game the “scientific method”—require that experimental support 
be sought for any new theory or principle, and Einstein could treat the gravitational 
redshift as the equivalent of experimental confirmation of his principle of equival¬ 
ence. There are two steps in such a confirmation: first, the theory or principle must 
predict an effect (the next paragraph describes how the equivalence principle implies 
the redshift); second, the predicted effect must be observed. With the Pound-Rebka- 


Smder experiments, one is in much better shape today than Einstein was for this 
second step. Einstein himself had to rely on the experiments supporting the general 
concept of energy conservation, plus the necessity of a redshift to preserve energy 
conservation, as a substitute for direct experimental confirmation. 

The existence of the gravitational redshift can be deduced from the equivalence 
principle by considering two experimenters in a rocket ship that maintains a constant 
acceleration g. Let the distance between the two observers be h in the direction of 
the acceleration. Suppose for definiteness that the rocket ship was at rest in some 
inertial coordinate system when the bottom observer sent off a photon. It will require 
time t = h for the photon to reach the upper observer. In that time the top observer 
acquires a velocity v = gt = gh. He will therefore detect the photon and observe 
a Doppler redshift z = v = gh. The results here are therefore identical to equation 

(7.20) . The principle of equivalence of course requires that, if this redshift is observed 
in an experiment performed under conditions of uniform acceleration in the absence 
of gravitational fields, then the same redshift must be observed by an experiment 
performed under conditions where there is a uniform gravitational field, but no 
acceleration. Consequently, by the principle of equivalence, one can derive equation 

(7.20) as applied to the gravitational situation. 


§7.5. LOCAL FLATNESS, GLOBAL CURVATURE 


Equivalence principle implies 
nonmeshing of local Lorentz 
frames near Earth (spacetime 
curvature!) 


The equivalence principle helps one to discern the nature of the spacetime curvature, 
whose existence was inferred from Schild’s argument. Physics is the same in an 
accelerated frame as it is in a laboratory tied to the Earth’s surface. Thus, an 
Earth-bound lab can be regarded as accelerating upward, with acceleration g, relative 
to the Lorentz frames in its neighborhood.* Equivalently, relative to the lab and 
the Earth’s surface, all Lorentz frames must accelerate downward. But the downward 
(radial) direction is different at different latitudes and longitudes. Hence, local 
Lorentz frames, initially at rest with respect to each other but on opposite sides of 
the Earth, subsequently fall toward the center and go flying through each other. 
Clearly they cannot be meshed to form the single global Lorentz frame, tied to the 


*™ S “P ward acceleration of the laboratory, plus equation (6.18) for the line element in an accelerated 
coordinate system, explains the nonequality of the bottom and top edges of the parallelograms in Fig- 
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Earth, that was assumed in Schild’s argument. This nonmeshing of local Lorentz 
frames, like the nonmeshing of local Cartesian coordinates on a curved 2-surface, 
is a clear manifestation of spacetime curvature. 

Geographers have similar problems when mapping the surface of the earth. Over 
small areas, a township or a county, it is easy to use a standard rectangular coordinate 
system. However, when two fairly large regions are mapped, each with one coordi¬ 
nate axis pointing north, then one finds that the edges of the maps overlap each 
other best if placed at a slight angle (spacetime analog: relative velocity of two local 
Lorentz frames meeting at center of Earth). It is much easier to start from a picture 
of a spherical globe, and then talk about how small flat maps might be used as 
good approximations to parts of it, than to start with a huge collection of small 
maps and try to piece them together to build up a picture of the globe. The exposition 
of the geometry of spacetime in this book will therefore take the first approach. 
Now that one recognizes that the problem is to fit together local, flat spacetime 
descriptions of physics into an over-all view of the universe, one should be happy 
to jump, in the next chapter, into a broadscale study of geometry. From this more 
advantageous viewpoint, one can then face the problem of discussing the relationship 
between the local inertial coordinate systems appropriate to two nearby regions that 
have slightly different gravitational fields. 

There are actually two distinguishable ways in which geometry enters the theory 
of general relativity. One is the geometry of lengths and angles in four-dimensional 
spacetime, which is inherited from the metric structure ds 2 of special relativity. 
Schild’s argument already shows (without a direct appeal to the equivalence princi¬ 
ple) that the special-relativistic ideas of length and angle must be modified. The 
modified ideas of metric structure lead to Riemannian geometry, which will be 
treated in Chapters 8 and 13. However, geometry also enters general relativity 
because of the equivalence principle. An equivalence principle can already be stated 
within Newtonian gravitational theory, in which no concepts of a spacetime metric 
enter, but only the Euclidean metric structure of three-dimensional space. The 
equivalence-principle view of Newtonian theory again insists that the local standard 
of reference be the freely falling particles. This requirement leads to the study of 
a spacetime geometry in which the curved world fines of freely falling particles are 
defined to be locally straight. They play the role in a curved spacetime geometry 
that straight fines play in flat spacetime. This “affine geometry” will be studied in 
Chapters 10-12. It leads to a quantitative formulation of the ideas of “covariant 
derivative” and “curvature” and even “curvature of Newtonian spacetime”! 


Nonmeshing of local Lorentz 
frames motivates study of 
geometry 


Two types of geometry 
relevant to spacetime: 


Riemannian geometry 
(lengths and angles) 


Affine geometry ("straight 
lines" and curvature) 




PART 


III 


THE MATHEMATICS OF 
CURVED SPACETIME 

Wherein the reader is exposed to the charms of a new temptress — 
Modern Differential Geometry—and makes a decision: 

to embrace her for eight full chapters; or, 
having drunk his fill, to escape after one. 



CHAPTER 


'N 


DIFFERENTIAL GEOMETRY: 

AN OVERVIEW 


/ am coming more and more to the conviction that the necessity 
of our geometry cannot be demonstrated, at least neither by, nor 
for, the human intellect. . . . geometry should be ranked, not 
with arithmetic, which is purely a priori Stic, but with mechanics. 

(1817) 

We must confess in all humility that, while number is a product 
of our mind alone, space has a reality beyond the mind whose 
rules we cannot completely prescribe. (1830) 

CARL FRIEDRICH GAUSS 




§8.1. AN OVERVIEW OF PART III 


Gravitation is a manifestation of spacetime curvature, and that curvature shows up 
in the deviation of one geodesic from a nearby geodesic (“relative acceleration of 
test particles”). The central issue of this part of the book is clear: How can one 
quantify the “separation/' and the “rate of change" of “separation," of two “geodesics ” 
in “curved ” spacetime? A clear, precise answer requires new concepts. 

“Separation” between geodesics will mean “vector.” But the concept of vector as 
employed in flat Lorentz spacetime (a bilocal object: point for head and point for 
tail) must be sharpened up into the local concept of tangent vector , when one passes 
to curved spacetime. Chapter 9 does the sharpening. It also reveals how the passage 
to curved spacetime affects 1-forms and tensors. 

It takes one tool (vectors in curved geometry. Chapter 9) to define “separation” 
clearly as a vector; it takes another tool (parallel transport in curved spacetime. 
Chapter 10) to compare separation vectors at neighboring points and to define the 
“rate of change of separation.” No transport, no comparison; no comparison, no 
meaning to the term “rate of change”! The notion of parallel transport founds itself 


Concepts to be developed in 
Part III: 


Tangent vector 
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Geodesic 

Covariant derivative 


Geodesic deviation 


Spacetime curvature 


This chapter: a Track-1 
overview of differential 
geometry 


on the idea of “geodesic,” the world line of a freely falling particle. The special 
mathematical properties of a geodesic are explored in Chapter 10. That chapter uses 
geodesics to define parallel transport, uses parallel transport to define covariant 
derivative, and—completing the circle—uses covariant derivative to describe geo¬ 
desics. 

Chapter 11 faces up to the central issue: geodesic deviation (“rate of change of 
separation vector between two geodesics”), and its use in defining the curvature of 
spacetime. 

But to define curvature is not enough. The man who would understand gravity 
deeply must also see curvature at work, producing relative accelerations of particles 
in Newtonian spacetime (Chapter 12); he must learn how, in Einstein spacetime, 
distances (metric) determine completely the curvature and the law of parallel trans¬ 
port (Chapter 13); he must be the master of powerful tools for computing curvature 
(Chapter 14); and he must grasp the geometric significance of the algebraic and 
differential symmetries of curvature (Chapter 15). 

Unfortunately, such deep understanding requires time—far more time than one 
can afford in a ten-week or fifteen-week course, far more than a lone reader may 
wish to spend on first passage through the book. For the man who must rush on 
rapidly, this chapter contains a “Track-1” overview of the essential mathematical 
tools (§§8.4—8.7). From it one can gain an adequate, but not deep, understanding 
of spacetime curvature, of tidal gravitational forces, and of the mathematics of curved 
spacetime. This overview is also intended for the Track-2 reader; it will give him 
a taste of what is to come. The ambitious reader may also wish to consult other 
introductions to differential geometry (see Box 8.1). 


r 


Box 8.1 BOOKS ON DIFFERENTIAL GEOMETRY 


A 


There are several mathematics texts that may be 
consulted for a more detailed and extensive dis¬ 
cussion of modern differential geometry along the 
line taken here. Bishop and Goldberg (1968) is the 
no. 1 reference. Hicks (1965) could be chosen as 
a current standard graduate-level text, with 
O’Neill (1966) at the undergraduate level intro¬ 
ducing many of the same topics without presuming 
that the reader finds easy and obvious the current 
style in which pure mathematicians think and 
write. Auslander and MacKenzie (1963) at a 
somewhat more advanced level also allow for the 
reader to whom differential equations are more 

___ 


familiar than homomorphisms. Willmore (1959) is 
easy to read but presents no challenge, and leads 
to little progress in adapting to the style of current 
mathematics. Trautman (1965) and Misner (1964a, 
1969a) are introductions somewhat similar to ours, 
except for deemphasis of pictures; like ours, they 
are aimed at the student of relativity. Flanders 
(1963) is easy and useful as an introduction to 
exterior differential forms; it also gives examples 
of their application to a wide variety of topics in 
physics and engineering. 
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§8.2. TRACK 1 VERSUS TRACK 2: 

DIFFERENCE IN OUTLOOK AND POWER 


Nothing is more wonderful about the relation between Einstein’s theory of gravity 

and Newton’s theory than this, as discovered by Elie Cartan (1923, 1924): that both 

theories lend themselves to description in terms of curvature; that in both this 

curvature is governed by the density of mass-energy; and that this curvature allows 

itself to be defined and measured without any use of or reference to any concept 

of metric. The difference between the two theories shows itself up in this: Einstein’s 

theory in the end (or in the beginning, depending upon how one presents it!) does 

define an interval between every event and every nearby event; Newton’s theory 

not only does not, but even says that any attempt to talk of spacetime intervals 

violates Newton’s laws. This being the case, Track 2 will forego for a time (Chapters Preview of Track-2 

9-12) any use of a spacetime metric (“Einstein interval”). It will extract everything differentia! geometry 

possible from a metric-free description of spacetime curvature (all of Newton’s 

theory; important parts of Einstein’s theory). 

Geodesic deviation is a measurer and definer of curvature, but the onlooker is 
forbidden to reduce a vector description of separation to a numerical measure of 
distance (no metric at this stage of the analysis): what an impossible situation! 

Nevertheless, that is exactly the situation with which Chapters 9-12 will concern 
themselves: how to do geometry without a metric. Speaking physically, one will 
overlook at this stage the fact that the geometry of the physical world is always 
and everywhere locally Lorentz, and endowed with a light cone, but one will exploit 
to the fullest the Galileo-Einstein principle of equivalence: in any given locality one 
can find a frame of reference in which every neutral test particle, whatever its 
velocity, is free of acceleration. The tracks of these neutral test particles define the 
geodesics of the geometry. These geodesics provide tools with which one can do 
much: define parallel transport (Chapter 10), define covariant derivative (Chapter 

10) , quantify geodesic deviation (Chapter 11), define spacetime curvature (Chapter 

11) , and explore Newtonian gravity (Chapter 12). Only after this full exploitation 
of metric-free geodesics will Track 2 admit the Einstein metric back into the scene 
(Chapters 13-15). 

But to forego use of the metric is a luxury which Track 1 can ill afford; too little 
time would be left for relativistic stars, cosmology, black holes, gravitational waves, 

experimental tests, and the dynamics of geometry. Therefore, the Track-1 overview What the Track-1 reader will 
in this chapter keeps the Einstein metric throughout. But in doing so, it pays a heavy miss 
price: (1) no possibility of seeing curvature at work in Newtonian spacetime (Chapter 

12 ) ; (2) no possibility of comparing and contrasting the geometric structures of 
Newtonian spacetime (Chapter 12) and Einstein spacetime (Chapter 13), and hence 
no possibility of grasping fully the Newtonian-based motivation for the Einstein field 
equations (Chapter 17); (3) no possibility of understanding fully the mathematical 
interrelationships of “geodesic,” “parallel transport,” “covariant derivative,” “cur¬ 
vature,” and “metric” (Chapters 9, 10, 11, 13); (4) no possibility of introducing the 
mathematical subjects “differential topology” {geometry without metric or covariant 
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derivative, Chapter 9) and “affine geometry 99 (geometry with covariant derivative 
but no metric. Chapters 10 and 11), subjects which find major application in modern 
analytical mechanics [see, e.g., Arnold and Avez (1968); also exercise 4.11 of this 
book], in Lie group theory with its deep implications for elementary particle physics 
[see, e.g., Hermann (1966); also exercises 9.12, 9.13, 10.16, and 11.12 of this book], 
in the theory and solution of partial differential equations [see, e.g., Sternberg (1969)], 
and, of course, in gravitation theory. 


§8.3. THREE ASPECTS OF GEOMETRY: 

PICTORIAL, ABSTRACT, COMPONENT 


Geometry from three 
viewpoints: pictorial, abstract, 
component 


Gain the power in §8.4 and Chapter 9 to discuss tangent vectors, 1-forms, tensors 
in curved spacetime; gain the power in §8.5 and Chapter 10 to parallel-transport 
vectors, to differentiate them, to discuss geodesics; use this power in §8.7 and Chapter 
11 to discuss geodesic deviation, to define curvature;.... But full power this will 
be only if it can be exercised in three ways: in pictures, in abstract notation, and 
in component notation (Box 8.3). Elie Cartan (Box 8.2) gave new insight into both 


Box 8.2 ELIE CARTAN, 1869-1951 



Elie Cartan is a most remarkable figure in recent 
mathematical history. One learns from his obitu¬ 
ary [Chern and Chevalley (1952)] that he was bom 
a blacksmith’s son in southern France, and proved 
the value of government scholarship aid by rising 
through the system to a professorship at the Sor- 
bonne in 1912 when he was 43. At the age of 32 


he invented the exterior derivative [Cartan (1901)], 
which he used then mostly in differential equations 
and the theory of Lie groups, where he had al¬ 
ready made significant, contributions. He was 
about fifty when he began applying it to geometry, 
and sixty before Riemannian geometry specifically 
was the object of this research, including his text 
[Cartan (1928)], which is still reprinted and worth 
studying. Although universally recognized, his 
work did not find responsive readers until he 
neared retirement around 1940, when the “Bour- 
baki” generation of French mathematicians began 
to provide a conceptual framework for (among 
other things) Cartan’s insights and methods. This 
made Cartan communicable and teachable as his 
own writings never were, so that by the time of 
his death at 82 in 1951 his influence was obviously 
dominating the revolutions then in full swing in 
all the fields (Lie groups, differential equations, 
and differential geometry) in which he had prim¬ 
arily worked. 

The modern, abstract, coordinate-free approach 
to geometry, which is used extensively in this book, 
is due largely to Elie Cartan. He also discovered 
the geometric approach to Newtonian gravity that 
is developed and exploited in Chapter 12. 


V 


J 
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r 

Box 8.3 THREE LEVELS OF DIFFERENTIAL GEOMETRY 

(1) Purely pictorial treatment of geometry: just as one is accustomed nowadays in elec- 


tangent vector is conceived in terms of the 
separation of two points in the limit in which 
the points are indefinitely close; 
vectors are added and subtracted locally as in 
flat space; 

vectors at distinct points are compared by par¬ 
allel transport from one point to another; 
this parallel transport is accomplished by a 
“Schild’s ladder construction” of geodesics 
(Box 10.2); 

diagrams, yes; algebra, no; 
it is tied conceptually as closely as possible to 
the world of test particles and measure¬ 
ments. 

(2) Abstract differential geometry: 

treats a tangent vector as existing in its own 
right, without necessity to give its break¬ 
down into components, 

A "=■ + A^b± + A^b 2 -f- A^b^ 

___ 


tromagnetism to treat the electric vector E , 
without having to write out its components; 
uses a similar approach to differentiation 
(compare gradient operator V of elementary 
vector analysis, as distinguished from coor¬ 
dinate-dependent pieces of such an opera¬ 
tor, such as d/dx y 9 /dy, etc.); 
is the quickest, simplest mathematical scheme 
one knows to derive general results in differ¬ 
ential geometry. 

(3) Differential geometry as expressed in the lan¬ 
guage of components: 

is indispensible in programming large parts of 
general relativity for a computer; 
is convenient or necessary or both when one 
is dealing even at the level of elementary 
algebra with the most simple applications of 
relativity, from the expansion of the Fried¬ 
mann universe to the curvature around a 
static center of attraction. 


Newtonian gravity (Chapter 12) and the central geometric simplicity of Einstein’s 
field equations (Chapter 15), because he had full command of all three methods 
of doing differential geometry. Today, no one has full power to communicate with 
others about the subject who cannot express himself in all three languages. Hence 
the interplay between the three forms of expression in what follows. 

It is not new to go back and forth between the three languages, as witnesses the 
textbook treatment of the velocity and acceleration of a planet in Kepler motion Planetary orbit as example of 
around the sun. The velocity is written three v,ew P oints 

v = v r e' T 4- v*e^. (8.1) 

(The hats ” on q and e^ signify that these are unit vectors.) The acceleration 
is 


dv dv T dv* 

-dF = ~dF e ' r + ~dT e * 


+ 




a — 


( 8 . 2 ) 
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Figure 8.1. 

A Keplerian orbit in the sun’s gravitational field, as treated using the standard Euclidean-space version 
of Newtonian gravity. The basis vectors themselves change from point to point along the orbit [equations 
(8.3)1. This figure illustrates the pictorial aspect of differential geometry. Later (exercise 8.5) it will illustrate 
the concepts of “covariant derivative” and “connection coefficients.” 


The unit vectors are turning (Figure 8.1) with the angular velocity « = d<$>/dt; so 


de- d$ 

~di ~ = 




(8.3) 


Thus the components of the acceleration have the values 

dt , dt dt 2 \dt) 


and 


djA ~d<j> _ d / d<t>\ dr d<j> 

dt dt dt V dt / dt dt 


(8.4a) 

(8.4b) 


Here is the acceleration in the language of components; a was the acceleration 
in abstract language; and Figure 8.1 shows the acceleration as an arrow. Each of 
these three languages will receive its natural generalization in the coming sections 
and chapters from two-dimensional fiat space (with curvilinear coordinates) to 
four-dimensional curved spacetime, and from spacetime to more general manifolds 
(see §9.7 on manifolds). 

Turn now to the Track-1 overview of differential geometry. 


§8.4. TENSOR ALGEBRA IN CURVED SPACETIME 

§8.4. TENSOR ALGEBRA IN CURVED SPACETIME 

To see spacetime curvature at work, examine tidal gravitational forces (geodesic Tensor algebra: 
deviation); and to probe these forces, make measurements in a finite-sized laboratory. 

Squeeze the laboratory to infinitesimal size; all effects of spacetime curvature become (1) occurs in infinitesimal 
infinitesimal; the physicist cannot tell whether he is in flat spacetime or curved neighborhood of an event 
spacetime. Neither can the mathematician, in the limit as his domain of attention 
squeezes down to a single event, 9 0 . 

At the event 9 0 (in the infinitesimal laboratory) both physicist and mathematician 
can talk of vectors, of 1-forms, of tensors; and no amount of spacetime curvature (2) is same in curved 
can force the discussion to change from its flat-space form. A particle at <3 > 0 has a spacetime as in flat 
4-momentum p, with squared length 

p 2 =pp = p(p,p) = -m 2 . 

The squared length, as always, is calculated by inserting p into both slots of a linear 
machine, the metric g at 9 0 . The particle also has a 4-acceleration a at # 0 ; and, 
if the particle is charged and freely moving, then a is produced by the electromag¬ 
netic field tensor F: 

ma — eF (..., u). 

In no way can curvature affect such local, coordinate-free, geometric relations. And 
in no way can it prevent one from introducing a local Lorentz frame at and 
from performing standard, flat-space index manipulations in it: 

P = p a e a , p 2 - p a p p Va, B = p a p a , ma a = eF a ^u /3 . 

But local Lorentz frames are not enough for the man who would calculate in 
curved spacetime. Non-Lorentz frames (nonorthonormal basis vectors {e a }) often 
simplify calculations. Fortunately, no effort at all is required to master the rules 
of “index mechanics” in an arbitrary basis at a fixed event f? 0 . The rules are identical 
to those in flat spacetime, except that (1) the covariant Lorentz components 77^ of 
the metric are replaced by 

gap = e a -e p = g(e a , e p ); (8.5) 

(2) the contravariant components Tj a ^ are replaced by g a & y where 

\\g<*P\\ = Hg^lL 1 (matrix inverse); (8.6) 

i.e., 

s* T = V; ( 8 - 6 ') 

(3) the Lorentz transformation matrix \\A a 'p\\ and its inverse WAP^W are replaced 
by an arbitrary but nonsingular transformation matrix IIL^I! and its inverse 

= e a X ar /8 , — LP a <p a \ (8.7) Transformation of basis 

IILVl! = (8-8) 


(3) rules for component 
manipulation change slightly 
when using nonorthonormal 
basis 

Components of metric 
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(4) in the special case of “coordinate bases,” e a = d&/dx a , ep — W/8x&, 

L a 'p = dx a '/dxP, Z/ a , = dxP/dx*'; (8.9) 

Components of Levi-Civita and (5) the Levi-Civita tensor e , like the metric tensor, has components that depend- 

tensor on how nonorthonormal the basis vectors are (see exercise 8.3): if e 0 points toward 

the future and e v e 2 , e 3 are righthanded, then 

w = (-g) l/2 [«m, 

= g -i €afiy& = -(- g )-v*[apy8l> 

where [a/?yS] is the completely antisymmetric symbol 

( 4* 1 if afiyS is an even permutation of 0123, 

— 1 if afiyS is an odd permutation of 0123, 

0 if aftyS are not all different, 

and where g is the determinant of the matrix llg^ll 

g =detllg 0/9 ll = detlle a • e^ll. 

Read Box 8.4 for full discussion and proofs; work exercise 8.1 below for fuller 
understanding and mastery. 

Several dangers are glossed over in this discussion. In flat spacetime one often 
does not bother to say where a vector, 1-form, or tensor is located. One freely moves 
geometric objects from event to event without even thinking. Of course, the unwritten 
rule of transport is: hold all lengths and directions fixed while moving; i.e., hold 
all Lorentz-frame components fixed; i.e., “parallel-transport” the object. But in 


(8.10a) 


(8.10b) 


( 8 . 11 ) 


Box 8.4 TENSOR ALGEBRA AT A FIXED EVENT IN AN ARBITRARY BASIS 
A. Bases 

Tangent-vector basis: Pick e 0 , e v e 2 > e 3 at arbitrarily—but insist they be 

linearly independent. 

“Dual basis” for 1-forms: The basis {e a } determines a 1-form basis {o/ a } (its “dual 
basis”) by 


<“»*. e /3> = 5 "/J 

[see equation (2.19)]. 

Geometric interpretation (Figure 9.2): e 2 , e 3 , e 0 lie parallel to surfaces of a/ 1 ; 
e x pierces precisely one surface of a/ 1 . 

Function interpretation: (uf*, e = 8 a ^ determines the value of wP on any vector 
u — u&ep (number of “bongs of bell” as u pierces uf*): 
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{uP,u) = (uf^.uPep) ~ — u a . 


Special case: coordinate bases. Choose an arbitrary coordinate system \x a (^ > )}. 
At choose e a — d&/dx a as basis vectors. Then the dual basis is a/ 1 — dx a . 
Proof: the general coordinate-free relation (df v) = ^/[equation (2.17)], with 
f — x a and v — d^P/dx^, reads 

d&fdxP) = (d/dxP)x a = 5 ^. 


B. Algebra of Tangent Vectors and 1-Forms 

The Lorentz-frame discussion of equations (2.19) to (2.22) is completely unchanged 
when one switches to an arbitrary basis. Its conclusions: 
expansion, u = e a u a , o — o a cu a ; 

calculation of components, u a = < cu a , i/>, o a = <a, e a )\ 
value of form on vector, (a, i/> = o a u a . 

Application to gradients of functions: 
expansion, df = f a <" a [defines f ]; 

calculation of components, f a = < dfe a > = d e J [see equation (2.17)]. 
Raising and lowering of indices is accomplished with g a/3 and g afi [equations 
(8.5) and (8.6)]. Proof: 

u , the 1-form corresponding to i/, is defined by (u,v) = u a v for all v\ 
thus, u a = (u 9 e a ) = u • e a = u#e fi • e a = u*g fia ; 
inverting this equation yields u& — g^ a u a . 


C. Change of Basis 


The discussion of Lorentz transformations in equations (2.39) to (2.43) is applicable 
to general changes of basis if one replaces \ \A a f\\ by an arbitrary but nonsingular 
matrix \\L a 'p\\ [equations (8.7), (8.8)]. Conclusions: 


w a = L a ’p w 13 , 
v a ‘ = L a pV 13 , 

°oc' 


ep = e a ,L a 'p\ 

uj 13 = L& a .ui a '\ 
vf* = L^ a .v a '\ 
Op = 0 a .L a ' p. 


When both bases are coordinate bases, then L 13 a . = dx^/dx a \ L a 'p = dx a '/dx&. 
Proof: 


dx a ' dx a ' dx 13 dx a ‘ 6/8 ’ 


similarly e $ = — 0 *. 
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Box 8.4 (continued) 

D. Algebra of Tensors 

The discussions of tensor algebra given in §3.2 [equations (3.8) to (3.22)] and in 
§3.5 (excluding gradient and divergence) are unchanged, except that 

Vaft gafi r* g°*, JPft — L«ft, A*, —>- 

and the components of the Levi-Civita tensor are changed from (3.50) to (8.10) 
[see exercise 8.3[. 

Chief conclusions: 

expansion, S = S a fi y e a ® ujP ® cu y ; 
components, S a fiy = S(cu a , e fi9 e y ); 
raising and lowering indices, v = g^g vy s% y ; 
change of basis, S y ^ = L y a LP fU ,S a fiy \ 
machine operation, S(p , u, v) = S a ^ y o a u^v y \ 
tensor product, T = u ® v ■<=>• T a & = u a v&\ 

contraction, “AT = contraction of R on slots 1 and 3” -*=>- M^ v = R a mv \ 
wedge product, a A 0 has components a ti 0 p — 0^a v \ 

Dual, *J a0y = *F afi = *B a = lB^e XflPa . 


E. Commutators (exercise 8.2; §9.6; Box 9.2) 

If u and v are tangent vector fields, one takes the view that u = d u and v — d V9 
and one defines 


[f/, v] — [d m d v ] = d u d v - d ¥ d u . 

This commutator is itself a tangent vector field. 

Components in a coordinate basis: 

[u, v I = ( h'V p - v fi u a p yd/dx a ). 

*-[=•«] 

Commutation coefficients of a basis: 

[®at» “ C a p y G y? —— gyn' 

Coordinate basis (“holonomic”) c a/3 y = 0; 

Noncoordinate basis (“anholonomic”) some c a ^ y ^ 0 (see exercise 9.9). 


j 
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curved spacetime there is no global Lorentz coordinate system in which to hold 
components fixed; and objects initially parallel, after “parallel transport” along 
different curves cease to be parallel (“geodesic deviation”; Earth’s meridians, parallel 
at equator, cross at north and south poles). Thus, in curved spacetime one must 
not blithely move a geometric object from point to point, without carefully specifying 
how it is to be moved and by what route. Each local geometric object has its own 
official place of residence (event & 0 ); it can interact with other objects residing there 
(tensor algebra); but it cannot interact with any object at another event 2, until 
it has been carefully transported from ( 3 > 0 to 2. 

This line of reasoning, pursued further, leads one to speak of the “tangent space M 
at each event, in which that event’s vectors (arrows) and 1-forms (families of surfaces) 
lie, and in which its tensors (Unear machines) operate. One even draws heuristic 
pictures of the tangent space, as in Figure 9.1 (p. 231). 

Another danger in curved spacetime is the temptation to regard vectors as arrows 
linking two events (“point for head and point for tail”)—i.e., to regard the tangent 
space of Figure 9.1 as lying in spacetime itself. This practice can be useful for heuristic 
purposes, but it is incompatible with complete mathematical precision. (How is the 
tangent space to be molded into a warped surface?) Four definitions of a vector 
were given in Figure 2.1 (page 49): three definitions relying on “point for head 
and point for tail”; one, “d$/dX\ purely local. Only the local definition is wholly 
viable in curved spacetime, and even it can be improved upon, in the eyes of 
mathematicians, as follows. 

There is a one-to-one correspondence (complete “isomorphism”) between vectors 
u and directional derivative operators d u . The concept of vector is a bit fuzzy, but 
“directional derivative” is perfectly well-defined. To get rid of all fuzziness, exploit 
the isomorphism to the full: define the tangent vector u to be equal to the corre¬ 
sponding directional derivative 

u = d a . (8.12) 


Vectors and tensors must not 
be moved blithely from point 
to point 


Tangent space defined 


Definitions of vector in 
curved spacetime: 

(1) as d<?/d\ 


(2) as directional derivative 


(This practice, unfamiliar as it may be to a physicist at first, has mathematical power; 
so this book will use it frequently. For a fuller discussion, see §9.2.) 


Exercise 8.1. PRACTICE WITH TENSOR ALGEBRA EXERCISES 

Let t, jt,y, 2 be Lorentz coordinates in fiat spacetime, and let 

r = (x 2 + y 2 + z 2 ) 1/2 , 9 - cos ~\z/r), = tan _1 (y/*) 

be the corresponding spherical coordinates. Then 

e 0 = d&/dt y e r — d&/dr, e e = 0^/00, = 0 ^ 70 <f> 

is a coordinate basis, and 

_ d\? _ d°j> 1 0ff 1 0g 

e o- d t 9 e ~r — ’ e °- r cO' 6 * r sin 9 3<£> 


is a noncoordinate basis. 
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(a) Draw a picture of e e , e 0 , and e^ at several different points on a sphere of constant 
/, r. [Answer for e 9% e 6 should resemble Figure 9.I.] 

(b) What are the I-form bases and {cu a } dual to these tangent-vector bases? [Answer: 
uj 0 = dt, cu r = dr ; w* = d9 % cu° = </<>; cu° — dt. ut T = dr , uj 6 — rdO , lu^ = r sin 9 dg>.] 

(c) What is the transformation matrix linking the original Lorentz frame to the spherical 
coordinate frame {e Q }? [Answer: nonzero components are 

L* 0 — 1, L Z T - ~ = cos U e — = -r sin 0, 

or co 

L x r = sin 9 cos <>, U e = r cos 6 cos <>, L x 0 = — r sin 9 sin <£, 

L v r = sin 0 sin <>, L% = r cos 9 sin <>, = r sin 9 cos <>.] 

(d) Use this transformation matrix to calculate the metric components g af3 in the spherical 
coordinate basis, and invert the resulting matrix to get g a &. [Answer: 

goo = - I> grr = l gee = r 2 > g^ = r 2 sin 2 9, all other g a/3 = 0. 

g 00 = — 1, g" = 1, g** = r“ 2 , g 00 = r~ 2 sin“ 2 0, all other g af3 = 0.] 

(e) Show that the noncoordinate basis {e^} is orthonormal everywhere; i.e., that g^fi = 
i.e. that 

g = —a; 6 0 a> () -f uj r 0 cu r + uj 6 0 uj 6 + a/ 0 

(f) Write the gradient of a function /in terms of the spherical coordinate and noncoordi¬ 
nate bases. [Answer: 


df- if d, 4 - 0 / rfrJ . ^ dt) 4 - 

df ~ a7‘* + 37 ,fr + W d * + 0 ?^ 

3/ 6 0/ - 1 0/ » 13/ 

— — CU 0 4 -— 6U -!-— UJ 6 4 - - i 

dt dr r d9 r sin 9 ' 


(g) What are the components of the Levi-Civita tensor in the spherical coordinate and 
noncoordinate bases? [Answer for coordinate basis: 


«or«0 = -«rO00 = +«r9O0 = • • • = r 2 sin e, 

e or90 _ _ e rO90 _ +e r90O _ . . . _ _ r" 2 sin” 1 6».] 

Exercise 8.2. COMMUTATORS 

Take the mathematician’s viewpoint that tangent vectors and directional derivatives are the 
same thing, u = d u . Let u and v be two vector fields, and define their commutator in the 
manner familiar from quantum mechanics 

[u, v] = [d w 0„] = 0 U 0„ - 0„ 0„. (8.13a) 

(a) Derive the following expression for [i#, v\, valid in any coordinate basis, 

[u, v] = (u /3 v a p - v^u a J3 )e a . (8.13b) 

Thus, despite the fact that it looks like a second-order differential operator, [i/, v] is actually 
of first order—i.e., it is a tangent vector. 

(b) For any basis {e a }, one defines the “commutation coefficients' 9 c^ y a and Cp ya by 

[e^, ey] = Cp y a e a \ Cp ya — ga/i^py* 1 ' (8.14) 

Show that Cp y a = Cp ya = 0 for any coordinate basis. 
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(c) Calculate c^ a for the spherical noncoordinate basis of exercise 8.1. [Answer: All vanish 
except 

c-J = -cv? = -1 /r, 

C-ri? = -Cfr* = -1A> 

C V^ = -c= -cot e/r.] 

Exercise 8.3. COMPONENTS OF LEVI-CIVITA TENSOR IN 
NONORTHONORMAL FRAME 

(a) Show that expressions (8.10) are the components of e in an arbitrary basis, with e 0 
pointing toward the future and e v e 2 , e 3 right-handed. [Hints: (1) Review the discussion 
of e in Lorentz frames, given in exercise 3.13. (2) Calculate e a(iy8 and e afiy5 by transforming 
from a local Lorentz frame {e^}, e.g., 

e a(3y8 ~ 0^y^S e tLy%p m 

(3) Show that these expressions reduce to 

e a0ys = detlU^II^,, *** = detllL'-lie^. 

(4) Show, from the transformation law for the metric components, that 

(detll//“ll) 2 detail = -1. 

(5) Combine these results to obtain expressions (8.10).] 

(b) Show that the components of the permutation tensors [defined by equations (3.50h)- 
(3.50J)] have the same values [equations (3.50k)-(3.50m)] in arbitrary frames as in Lorentz 
frames. 

Additional exercises on tensor algebra: exercises 9.3 and 9.4 (page 234). 


§8.5. PARALLEL TRANSPORT, COVARIANT DERIVATIVE, 
CONNECTION COEFFICIENTS, GEODESICS 


The vehicle that carries one from classical mechanics to quantum mechanics is the 
correspondence principle. Similarly, the vehicle between flat spacetime and curved 
spacetime is the equivalence principle: “The laws of physics are the same in any 
local Lorentz frame of curved spacetime as in a global Lorentz frame of flat space- 
time.” But to apply the equivalence principle, one must first have a mathematical 
representation of a local Lorentz frame. The obvious choice is this: A local Lorentz 
frame at a given event < 3 > 0 is the closest thing there is to a global Lorentz frame at 
that event; i.e., it is a coordinate system in which 


Equivalence principle as 
vehicle between flat 
spacetime and curved 


Local Lorentz frame: 
mathematical representation 


g^o) = V- (8.15a) 

and in which gholds as tightly as possible to ij in the neighborhood of $ 0 : 

- °- ( 8 - l5b ) 

More tightly than this it cannot hold in general [g^ Vya ^ 0 ) cannot be set to zero]; 
spacetime curvature forces it to change. [Combine §11.5 with equations (8.24) and 
(8.44).] 
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Parallel transport defined 


Covariant derivative defined 


Gradient defined 


Connection coefficients 
defined 


An observer in a local Lorentz frame in curved spacetime can compare vectors 
and tensors at neighboring events, just as he would in flat spacetime. But to make 
the comparison, he must parallel-transport them to a common event. For him the 
act of parallel transport is simple: he keeps all Lorentz-frame components fixed, 
just as if he were in flat spacetime. But for a man without a local Lorentz frame— 
perhaps with no coordinate system or basis vectors at all—parallel transport is less 
trivial. He must either ask his Lorentz-based friend the result, or he must use a more 
sophisticated technique. One technique he can use—a “Schild’s ladder” construction 
that requires no coordinates or basis vectors whatsoever—is described in §10.2 and 
Box 10.2. But the Track-1 reader need not master Schild’s ladder. He can always 
ask a local Lorentz observer what the result of any given parallel transport is, or 
he can use general formulas worked out below. 

Comparison by parallel transport is the foundation on which rests the gradient 
of a tensor field, V7. No mention of parallel transport was made in §3.5, where 
the gradient was first defined, but parallel transport occurred implicitly: one defined 
V7 in such a way that its components were T a ^ y = 07^/0*? [for T a (}) tensor]; 
i.e., one asked V7 to measure how much the Lorentz-frame components of T change 
from point to point. But “no change in Lorentz components” would have meant 
“parallel transport,” so one was implicitly asking for the change in T relative to 
what T would have been after pure parallel transport. 

To codify in abstract notation this concept pf differentiation, proceed as follows. 
First define the “covariant derivative” V U T of T along a curve ^(A), whose tangent 
vector is u = dtP/dX: 

(▼ u r) at * 0 ) = Lim I Il f^WaUel-transported to ~ (g 16 ) 

(See Figure 8.2 for the special case where T is a vector field v.) Then define 77 
to be the linear machine, that gives V a T when u is inserted into its last slot: 

▼ 7"(. = Y U T. (8.17) 

The result is the same animal (“gradient”) as was defined in §3.5 (for proof see 
exercise 8.8). But this alternative definition makes clear the relationship to parallel 
transport, including the fact that 

V U T = 0 -*==*- T is parallel-transported along u = dty/dX. (8.18) 

In a local Lorentz frame, the components of V T are directional derivatives of 
the components of T : T^ ay . Not so in a general basis. If { e^)) is a basis that 
varies arbitrarily but smoothly from point to point, and {w a ($>)} is its dual basis, 
then VT = ^(T^ a e^ <8> w a ) will contain contributions from Ve^ and Va/ a , as well 
as from V7*, = dT? a = T^ a y ujy. 

To quantify the contributions from Ve^ and Va/ a , i.e., to quantify the twisting, 
turning, expansion, and contraction of the basis vectors and 1-forms, one defines 
“connection coefficients 
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Figure 8.2. 

Definition of the covariant derivative of a vector field v along a curve ^(X), 

with tangent vector u == d L f/dX: (1) choose a point ^(0) on the curve, at which to 
evaluate 7 u v, (2) Choose a nearby point on the curve. (3) Parallel-transport 
^[^(e)] along the curve back to S?(0), getting the vector v n [eP(e)]. (4) Take the 
difference 5*= Vj|[£?(e)] — vpP(0)]. (5) Then V u v is defined by 




Lim 

e -*0 


8v 


■ = Lim 

e-*0 


vm*)\ - ^(O)ii 


_ a [Note reversal of fi and y to make the"] 

py \ w ^y e ^> [ differentiating index come last on T J ^ 

Usv ey ] 

( a component of change in e^, relative\ 
to parallel transport, along e y / 

and one proves (exercise 8.12) that 

<V“,e,) = -^V ( 8 - 19b > 

In terms of these coefficients and 

T\y = Vy T\ = a U T\ = a y TP a , (8.20) 

the components of the gradient, denoted T^ a;y , are 

Tfi a;y = + r^ y T\ - r\ y T\ ( 8 . 21 ) 

(see exercise 8.13). If the basis at the event where VT is calculated were a local 
Lorentz frame, the components of VT would just be T& a T Because it is not, one 
must correct this “Lorentz-frame” value for the twisting, turning, expansion, and 
contraction of the basis vectors and 1-forms. The ‘TF terms in equation (8.21) 
are the necessary corrections—one for each index of T. The pattern of these correction 
terms is easy to remember: (1) “ + ” sign if index being corrected is up, “ — ” sign 
if it is down; (2) differentiation index (y in above case) always at end of T; (3) 
index being corrected (ft in first term, a in second) shifts from T onto r and gets 
replaced on T by a dummy summation index (ja). 


Components of gradient 
arbitrary frame 
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Components of covariant 
derivative 


Calculation of connection 
coefficients from metric and 
commutators 


Knowing the components (8.21) of the gradient, one can calculate the components 
of the covariant derivative V U T by a simple contraction into u y [see equation (8.17)]: 

= ® w a . ( 8 . 22 ) 

When u is the tangent vector to a curve u = dfP/dX, one uses the notation 
DT p a /dX for the components of V U T: 


dX 


— T 13 u y = T 13 — 

— «;y dx 

— (TP a y + "f 7 ” corrections) dx y /dX 

dTP dx y 

=- 9L + (fB jn _ rn jB \ _ 

dX + 1 M “ ay dX 


(8.23) 


The in T l3 a . y reminds one to correct T& a y with “TT” terms; similarly, the “Z>” 
in DT^JdX reminds one to correct dT^JdX with “/T terms. 

This is all well and good, but how does one find out the connection coefficients 
r a p y for a given basis? The answer is derived in exercise 8.15. It says: (1) take the 
metric coefficients in the given basis; (2) calculate their directional derivatives along 
the basis directions 


&By,ii — ^n§By 

^ [=d e J ^ a coordinate basis, = d&fdx 11 , is being used] 


(8.24a) 


(3) calculate the commutation coefficients of the basis [equations (8.14) in general; 
c nBy = ® * n s P ec i a l case of coordinate basis]; (4) calculate the “covariant connection 
coefficients” 


^nBy 2 "i” &t*y,P &By,n ^jiPy ~h ^ij-'/B ^ (8.24b) 


[ these terms are 0 fori_ 

L coordinate basis J 

(5) raise an index to get the connection coefficients: 

r %y = ( 8 - 24 c) 

[Note on terminology: a coordinate basis always has c a/Jy = 0, and is sometimes called 
holonomic; a noncoordinate basis always has some of its c a ^ y nonzero, and is 
sometimes called anholonomic . In the holonomic case, the connection coefficients 
are sometimes called Christoffel symbols.] 

The component notation, with its semicolons, commas, Z)’s, connection coefficients, 
etc., looks rather formidable at first. But it bears great computational power, one 
discovers as one proceeds deep into gravitation theory; and its rules of manipulation 
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are simple enough to be learned easily. By contrast, the abstract notation (V7, V U T, 
etc.) is poorly suited to complex calculations; but it possesses great conceptual power. 

This contrast shows clearly in the way the two notations handle the concept of 
geodesic. A geodesic of spacetime is a curve that is straight and uniformly parame- Geodesic and affine 
trized, as measured in each local Lorentz frame along its way. If the geodesic is parameter defined 
timelike, then it is a possible world line for a freely falling particle, and its uniformly 
ticking parameter A (called “affine parameter ”) is a multiple of the particle’s proper 
time, A = ar + b. (Principle of equivalence: test particles move on straight lines in 
local Lorentz frames, and each particle’s clock ticks at a uniform rate as measured 
by any Lorentz observer.) This definition of geodesic is readily translated into 
abstract, coordinate-free language: a geodesic is a curve A) that parallel-transports 

its tangent vector u = d*?/dX along itself— 

y u a = 0. (8.25) 

(See Figure 10.1.) What could be simpler conceptually? But to compute the geodesic, 
given an initial event 9 0 and initial tangent vector u(& 0 ) there, one must use the 
component formalism. Introduce a coordinate system x a (S?), in which u a = dx a /dX, 
and write the component version of equation (8.25) as 

D(dx a /dX) d(dx a /dX) / dx fX \dx y 
° = dX = dX + \ ra » y -dX!~dX 

[see equation (8.23), with one less index on T]\ i.e., 

iLjL - y fa dx y q (8.26) Geodesic equation 

dX 2 * y dX dX K ' 

This geodesic equation can be solved (in principle) for the coordinates of the geodesic, 
x a (X), when initial data [x a and dx a /dX at A = A 0 ] have been specified. 

The geodesics of the Earth’s surface (great circles) are a foil against which one 
can visualize connection coefficients; see Figure 8.3. 

The material of this section is presented more deeply and from a different view¬ 
point in Chapters 10 and 13. The Track-2 reader who plans to study those chapters 
is advised to ignore the following exercises. The Track-1 reader who intends to skip 
Chapters 9-15 will gain necessary experience with the component formalism by 
working exercises 8.4-8.7. Less important to him, but valuable nonetheless, are 
exercises 8.8-8.15, which develop the formalism of covariant derivatives and con¬ 
nection coefficients in a systematic manner. The most important results of these 
exercises will be summarized in Box 8.6 (pages 223 and 224). 



Exercise 8.4. PRACTICE IN WRITING COMPONENTS OF GRADIENT EXERCISES 

Rewrite the following quantities in terms of ordinary derivatives (f y = d e J^^ y f) and 
“FT” correction terms: (a) T. y where T is a function, (b) T a . y where T is a vector, (c) T a . y 
where T is a 1-form, (d) T a ^. y . [Answer: 


(a) T. y = T y . (b) T*. y = T\ y + r% 7 7> (c) T a . y = T a y - r\ y Tf 

+ ra fl y Tft 0 8 


(d) rv ;Y ^ r V,y 




Pfi T a e 4- r* T a V 
1 hy* fin ' 1 ny* fi& 
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Figure 8.3. " 

The why of connection coefficients, schematically portrayed. The aviator pursuing his great circle route 
from Peking to Vancouver finds himself early going north, but later going south, although he is navigating 
the straightest route that is at all open to him (geodesic). The apparent change in direction indicates 
a turning, not in his route, but in the system of coordinates with respect to which his route is described. 
The vector v of his velocity (a vector defined not on spacetime but rather on the Earth’s two-dimensional 
surface), carried forward by parallel transport from an earlier moment to a later moment, finds itself 
in agreement with the velocity that he is then pursuing; or, in the abstract language of coordinate-free 
differential geometry, the covariant derivative V v v vanishes along the route (“equation of a geodesic”). 
Though v is in this sense constant, the individual pieces of which the navigator considers this vector 
to be built, v = v*e e + i^e 0 , are not constant. 

In the language of components, the quantities v 9 and v * are changing along the route at a rate that 
annuls the covariant derivative of v; thus 


V v v = a = + a 9 e e = 0, 

or 


0 = ^ = IF + 

In this sense the connection coefficients r j mn serve as “turning coefficients” to tell how fast to “turn” 
the components of a vector in order to keep that vector constant (against the turning influence of the 
base vectors). 

Alternatively, the navigator can use an “automatic pilot system” which parallel-transports its own base 
vectors along the plane’s route: 

= 0 ; 

solid vectors at tf become dotted vectors at Then the components of v must be kept fixed to achieve 
a great-circle route, 

dv e> _ dv*' _ q 
dt ~~dT~ ’ 

and the turning coefficients are used to describe the turning of the lines of latitude and longitude relative 
to this parallel-transported basis: 

V v e e = e m r m $n v n , 

= e/V n 

The same turning coefficients enter into both viewpoints. The only difference is in how these coefficients 
are used. 




§8.5. PARALLEL TRANSPORT AND COVARIANT DERIVATIVE 


213 


Exercise 8.5. A SHEET OF PAPER IN POLAR COORDINATES 

The two-dimensional metric for a flat sheet of paper in polar coordinates (r, 6) is ds 2 = dr 2 
+ r 2 d $ 2 — or, in modem notation, g = dr dr + r 2 d<$> </<*>. 

(a) Calculate the connection coefficients using equations (8.24). [Answer: T r 00 = — r; 
r 0 r0 = r 0 0r = 1/r; all others vanish.] 

(b) Write down the geodesic equation in (r, <». coordinates. [Answer: d 2 r/d\ 2 - 
r(d$/d\) 2 = 0 ; d 2 $/d\ 2 + ( 2 /r){dr/d\)(d$/dX) = 0 .] 

(c) Solve this geodesic equation for r(X) and <£>(A), and show that the solution is a uniformly 
parametrized straight line (x = r cos <£> = ak + b for some a and b; y = r sin 4> = j\ + k for 
some j and k). 

(d) Verify that the noncoordinate basis e T = e T = df/dr, e^ — r^e^ r* 1 0^/04>, 
u) T - dr, tv* = rd<$> is orthonormal, and that < cu a , e#) = Then calculate the connection 
coefficients of this basis from a knowledge [part (a)] of the connection of the coordinate 
basis. [Answer: 

r*# = <w*. ▼*•*> = (rd<$>, 

= r(d<j>, (V r /- _1 )e 0 + = r ( d $< ~ r 2e 0> + <<*£• v r e <i> 

= -r~ l + r% r = -r-' + r 1 = 0; 

similarly, = + \/r, r' T ^ = -1/r; all others vanish.] 

(e) Consider the Keplerian orbit of Figure 8.1 and §8.3 as a nongeodesic curve in the 
sun's two-dimensional, Euclidean, equatorial plane. In place of the old notation dv/dt , de- r /dt, 
etc., use the new notation V v v, V v e T , etc. Then v = dPfdt is the tangent to the orbit, and 
a — y v | s t he acceleration. Derive equations (8.4) for a T and a 0 using component manipu¬ 
lations and connection coefficients in the orthonormal basis. 

Exercise 8.6. SPHERICAL COORDINATES IN FLAT SPACETIME 

The spherical noncoordinate basis {e^} of Exercise 8.1 was orthonormal, g- a $ = tj^ 9 but 
had nonvanishing commutation coefficients [part (c) of Exercise 8.2], 

(a) Calculate the connection coefficients for this basis, using equations (8.24). [Answer: 

r** = r** = -^#1= i/n 

r%= -r*# = cote/n 

all others vanish.] 

(b) Write down expressions for in terms of By, and verify the correctness of these 

expressions by drawing sketches of the basis vectors on a sphere of constant t and r. [Answer: 

^6 e r ” r le b ^e e 6 = — r le r» — r e & 

= (cot 6/r)e& - (cot 0/r)e § . 

All others vanish.] 

(c) Calculate the divergence of a vector, V • A = A“.^ in this basis. [Answer: 

V • A = A\ { + r~ 2 (r 2 A ? ) ? -h (sin ^(sin 0A*)j + A*j 

_ dA* 1 d(r 2 A f ) 1 9(sin 6A 9 ) 1 dA* 

“ 9/ r 2 dr rsin# 00 rsin0 0<> 

This answer should be familiar from flat-space vector analysis.] 

Exercise 8.7. SYMMETRIES OF CONNECTION COEFFICIENTS 

From equation (8.24b), the symmetry of the metric, and the antisymmetry 7 (c^ = — c y/jM ) 
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of the commutation coefficients, show that: r a[ p y] = 0 (last two indices are symmetric) in 
a coordinate basis: r\^ )y — 0 (first two indices are antisymmetric) in a globally orthonormal 
basis, g- afi — Vafi' 

SYSTEMATIC DERIVATION OF RESULTS IN §8.5 

Exercise 8.8. NEW DEFINITION OF V7 COMPARED WITH 
OLD DEFINITION 

The new definition of V7" is given by equations (8.16) and (8.17). Use the fact that parallel 
transport keeps local-Lorentz components fixed to derive, from (8.16), the Lorentz-frame 
equation = T^ ay u y e^ ® ui a . From this and equation (8.17), infer that the Lorentz- 
frame components of VT are T # ay —which accords with the old definition of VT. 

Exercise 8.9. CHAIN RULE FOR V„7" 

(a) Use calculations ifl a local Lorentz frame to show that “V u ” obeys the standard chain 
rule for derivatives: 

\(f A ® B ) = (▼„/)-* ® fl + f(V u A) ® B +fA ® (V U B). (8.27) 

Here A and B are arbitrary vectors, 1-forms, or tensors; and/is an arbitrary function. [Hint: 
assume for concreteness that A is a (}) tensor and B is a vector. Then this equation reads, 
in Lorentz-frame component notation, 

+f(A a ^ i u s )Bf + (8.27') 

(b) Rewrite equation (8.27) in component notation in an arbitrary basis. [Answer: same 
as (8.27'), except is replaced everywhere by But note that f. s u 5 -f <s u \because the 
function f “has no components to correct”.] 

Exercise 8.10. COVARIANT DERIVATIVE COMMUTES WITH 
CONTRACTION 

(a) Let S be a (|) tensor. Using components in a local Lorentz frame show that 

(contraction on slots 1 and 2 of S) = (contraction on slots 1 and 2 of VS). (8.28) 

[Hint: in a local Lorentz frame this equation makes the trivial statement 

(S “ y = £ (S'Vy" 7 )-] 

Exercise 8.11. ALGEBRAIC PROPERTIES OF V 

Use calculations in a local Lorentz frame to show that 

^ au+bv^ — ci ^ U S + b VJ V S (8.29) 

for all tangent vectors i/, v and numbers a , b\ also that 

V U (S + M) = V U S + V U M 

for any two tensor fields S and M of the same rank; also that 


(8.30) 
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*U W - ▼«*!# = [U, w]. 


(8.31) 



commutator of u and 
discussed in exercise 


w; 

8.2 




for any two vector fields u and iv. 

Exercise 8.12. CONNECTION COEFFICIENTS FOR 1-FORM BASIS 

Show that the same connection coefficients r a jiy that describe the changes in {e^} from 
point to point [definition (8.19a)] also describe the changes in {w a } 9 except for a change 
in sign [equation (8.19b)]. { Answer: (1) (a/ Q , e^) = 8 a p is a constant function (0 or 1, de¬ 
pending on whether a = /?). (2) Thus, V y (uj a , e p ) = d e <a/ a , = 0. (3) But < cv a , e^) is 

the contraction of cv a <S> e^, so equation (8.28) implies 0 = y y (contraction of cv a 9 e^) 
= contraction of [y y (o/ a 9 e^)]. (4) Apply the chain rule (8.27) to conclude 0 = contraction 
of [(V y oF a ) 9 ©£ 4- (v a 9 (VyC^)] = (V y cu a , ep) 4- ( cv a , Vye^). (5) Finally, use defini¬ 
tion (8.19a) to arrive at the desired result, (8.19b).} 

Exercise 8.13. ' TV ' CORRECTION TERMS FOR T^ ay 

Derive equation (8.21) for T^ a;y in an arbitrary basis by first calculating the components 
of VJ for arbitrary u , and by then using equation (8.17) to infer the components of V7. 
[Answer; (1) Use the chain rule (8.27) to get 

= (^ u T\)e 0 9 w« + T»jy m e $ ) 9 cu« + T^ a e 0 9 (V>«). 

(2) Write u in terms of its components, u = u y e y ; use linearity of y u in u from equation 
(8.29), to get V u — u y y y ; and use this in y U T\ 

▼J = u y {T\ y e p 9 cu« + TP a (V y e 0 ) 9 a/« + 9 (y y cu«)}. 

(3) Use equations (8.19a,b), rewritten as 

V 7 e /3 = r V,e„> ▼r"“ = (8.32) 


to put y„r in the form 

= «n^ a , v e^ 0 a/“ + ® <v° - r% y T^ fi 0 a/"}. 

(4) Rename dummy indices so that the basis tensor 0 a/" can be factored out: 

▼„?- = «nrv T + r^ y r\ - ® «/“. 

(5) By comparison with 

▼„r = vr(.® 

read off the value of 7'^ a .. / .] 

Exercise 8.14. METRIC IS COVARIANTLY CONSTANT 

Show on physical grounds (using properties of local Lorentz frames) that 


Vg = 0 


(8.33) 
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or, equivalently, that V u g = 0 for any vector u. Then deduce as a mathematical consequence 
the obviously desirable product rule 

V (4 • B) = (Y^) * B + A - (Y U B). 

[Answer: (1) As discussed following equation (8.18), the components of Vg in a local Lorentz 
frame are g^,^. Just use g for T in that discussion. But these components all vanish by equation 
(8.15b). Therefore equation (8.33) holds in this frame, and—as a tensor equation—in all 
frames. (2) The product rule is also a tensor equation, true immediately via components 
in a local Lorentz frame. (3) Prove the product rule also the hard way, to see where equation 
(8.33) enters. Use the chain rule of exercise 8.9 to write 

V u (g ® A ® B) = (Y u g) ® A ® B + g ® (Y^) ® B 
+ 9® A® (Y U B). 

Use Equation (8.33) to drop one term, then contract, forming 

A • B = contraction (c/ ® B) 

and the other inner products. Exercise 8.10 is used to justify commuting the contraction 
with Y w on the lefthand side.] 

Exercise 8.15. CONNECTION COEFFICIENTS 
IN TERMS OF METRIC 

Use the fact that the metric is covariantly constant [equation (8.33)] to derive equation (8.24b) 
for the connection coefficients. Treat equation (8.24c) as a definition of r^ y in terms of 
r a py' [Answer: (1) Calculate the components of Vg in an arbitrary frame: 

&*0;Y 0 — — r ^ayg^fi — ^^(iygfia 

~ ^p a y ^afty* 

thereby conclude that g afit y = 2 f {a(i)y . (Round brackets denote symmetric part.) (2) Construct 
the metric terms in the claimed answer for r^ y : 

~2 ““ £fty,p) “ (if})y + ^{fiy)ft ~ ^(fty)ti 

~ ~2 ^M0Y + ^ ft py + rpyft + FyiLft ~ ^ftyp *“ ?yft /J 
= rnfty + ( ^p[fty) + ^ftlny) + ^ylpft])- 

(3) Infer from equation (8.31), with u and w chosen as two basis vectors (u — e w = e ) 
that 

= [•„. e w ] = Y p e, - Y r e p = (/*%„ * r^ v )e p = 2 r^e p ; 

i.e., 

r p [ftp i — ~2 c n* P * — ~ y c pvp’ (8.34) 

(4) This, combined with step (2) yields the desired formula for r^ y .\ 
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§8.6. LOCAL LORENTZ FRAMES: 

MATHEMATICAL DISCUSSION 


An observer falling freely in curved spacetime makes measurements in his local 
Lorentz frame. What he discovers has been discussed extensively in Parts I and II 
of this book. Try now to derive his basic discoveries from the formalism of the last 
section. 

Pick an event $ ) 0 on the observer’s world line. His local Lorentz frame there is 
a coordinate system x a {^) in which 


Sap — e a* e p 




(8.35a) 


(Lorentz metric at f? 0 ), and in which 

dg af3 /dx* = 0 at 9 0 (8.35b) 

(metric as Lorentz as possible near & 0 ). [See equation (8.15).] In addition, by virtue 
of equations (8.24), 

r*n = 0 at 3> 0 (8.36) 

(no “correction terms” in covariant derivatives). Of course, the observer must be 
at rest in his local Lorentz frame; i.e., his world line must be 


x’ = x j (^ 0 ) = constant; x° varying. (8.37) 

Query: Equations (8.35) to (8.37) guarantee that the observer is at rest in a local 
Lorentz frame. Do they imply that he is freely falling? (They should!) Answer: 
Calculate the observer’s 4-acceleration a = du/dr (notation of chapter 6) = 
(notation of this chapter). His 4-velocity, calculated from equation (8.37) is 


u = (dx a /dr)e a = ( dx°/dr)e 0 = e 0 ; 

f because u and e 0 both] 

L have unit length 


(8.38) 


so his 4-acceleration is 


a — ▼„!/ — V 0 e 0 — r a 00 e a 
= 0 at 


(8.39) 


Thus, he is indeed freely falling (a =0); and he moves along a geodesic (▼„*/ = 0). 

Query: Do freely falling particles move along straight lines (d 2 x a /dT 2 = 0) in the 
observer’s local Lorentz frame at ^ 0 ? (They should!) Answer: A freely falling particle 
experiences zero 4-acceleration 

a particle “ ^u parXicle U p article ~ 

i.e., it parallel-transports its 4-velocity; i.e., it moves along a geodesic of spacetime 


Local Lorentz frame: 


Origin falls freely along a 
geodesic 


Freely falling particles move 
on straight lines 
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with affine parameter equal to its proper time. The geodesic equation for its world 
line, in local Lorentz coordinates, says 

d 2 x a fa dx* dx v 
dr 2 dr dr 

= 0 at 

The particle’s world line is, indeed, straight at <? 0 . 

Basis vectors at origin are Query : Does the freely falling observer Fermi-Walker-transport his spatial basis 

Fermi-Walker transported vectors ey 9 i.e., can he attach them to gyroscopes that he carries? (He should be 

able to!) Answer: Fermi-Walker transport (Box 6.2) would say 

= u (* * - *(«* • e,). 

f 

new 

notation 

But u = e 0 , e 0 * = 0, and a = 0 for the observer; so Fermi-Walker transport in 

this case reduces to parallel transport along e 0 : thus V 0 e ; . = 0. This is, indeed, how 
e j is transported through because 

▼<>•/ = fa i 0 *a = 0 at 9 0 . 


dj 

t 

old 

notation 


§8.7. GEODESIC DEVIATION AND 

THE RIEMANN CURVATURE TENSOR 

“Gravitation is a manifestation of spacetime curvature, and that curvature shows 
up in the deviation of one geodesic from a nearby geodesic (relative acceleration 
of test particles).” To make this statement precise, first quantify the “deviation” or 
“relative acceleration” of neighboring geodesics. 

Focus attention on a family of geodesics n)\ see Figure 8.4. The smoothly 
varying parameter n (“selector parameter”) distinguishes one geodesic from the next. 
For fixed n, n) is a geodesic with affine parameter A and with tangent vector 


u = 0^/0A; 

(8.40) 

thus V u u — 0 (geodesic equation). The vector 


n = d^/dn 

(8.41) 


measures the separation between points with the same value of A on neighboring 
geodesics. 

An observer falling freely along the “fiducial geodesic” n = 0 watches a test 
particle fall along the “test geodesic” n = 1. The velocity of the test particle relative 
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test geodesic, n = 1. 


to him he quantifies by V„n This relative velocity, like the separation vector n, 
is an arbitrary “initial condition.” Not arbitrary, however, is the “relative accelera¬ 
tion,” ▼„ V a n of the test particle relative to the observer (see Boxes 11.2 and 11.3). 
It would be zero in flat spacetime. In curved spacetime, it is given by 

V u V u n + Riemann (. .. ,u,n,u) = 0, (8-42) 

or, in component notation, 

= (8.43) 

This equation serves as a definition of the “Riemann curvature tensor;” and it can 
also be used to derive the following expressions for the components of Riemann 
in a coordinate basis: 


R a p y S = <C/-V“,[V y,^]ep> 


dr a 


pi 


d r a 


(8.44) 


Sx Y 


dx 


ix. 4. r ° — r “ „ r ^ 

x ' * ny* 08 ■* 


(35 


ix8 J py 


Riemann curvature tensor 
defined by relative 
acceleration of geodesics 


Components of Riemann 
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Effects of curvature 


(For proof, read Box 11.4, Box 11.5. and exercise 11.3, in that order.) For a glimpse 
of the man who first analyzed the curvature of spaces with three and more dimen¬ 
sions, see Box 8.5. 

Spacetime curvature causes not only geodesic deviation, but also route dependence 
in parallel transport (parallel transport around a closed curve changes a vector or 
tensor—Box 11.7); it causes covariant derivatives to fail to commute [equation 
(8.44)]; and it prevents the existence of a global Lorentz coordinate system (§11.5). 

At first sight one might think Riemann has 4x4x4x4 = 256 independent 
components. But closer examination (§13.5) reveals a variety of symmetries 


Symmetries of Riemann 


— R[ap)[ys) — ^[ysHajS]’ 


R[apy&] “ 


^a[/3ySj — 0 


(8.45) 


f 

Box 8.5 GEORG FRIEDRICH BERNHARD RIEMANN 

September 17, 1826, Breselenz, Hanover—July 20, 1866, 


Selasca, Lake Maggiore 



v. 


/ 


With his famous doctoral thesis of 1851, “Founda¬ 
tions for a general theory of functions of a single 
complex variable,” Riemann founded one branch 
of modem mathematics (the theory of Riemann 
surfaces); and with his famous lecture of three 
years later founded another (Riemannian geom¬ 
etry). These and other writings will be found in 
his collected works, edited by H. Weber (1953). 

“The properties which distinguish space from 
other conceivable triply-extended magnitudes are 
only to be deduced from experience.... At every 
point the three-directional measure of curvature 
can have an arbitrary value if only the effective 
curvature of every measurable region of space 
does not differ noticeably from zero.” [G. F. B. 
Riemann, “On the hypotheses that lie at the foun¬ 
dations of geometry,” Habilitationsvorlesung of 
June 10, 1854, on entry into the philosophical 
faculty of the University of Gottingen.] 

Dying of tuberculosis twelve years later, occu- 
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(antisymmetry on first two indices; antisymmetry on last two; symmetry under 
exchange of first pair with last pair; vanishing of completely antisymmetric parts). 
These reduce Riemann (in four dimensions) from 256 to 20 independent compo¬ 
nents. 

Besides these algebraic symmetries, Riemann possesses differential symmetries 
called “Bianchi identities/ 9 



R a p[\n'v] ~ 0* (8.46) Bianchi identities 

which have deep geometric significance (Chapter 15). 

From Riemann one can form several other curvature tensors by contraction. The 
easiest to form are the “Ricci curvature tensor/ 9 


"\ 


pied with an attempt at a unified explanation of 
gravity and electromagnetism, Riemann commun¬ 
icated to Betti his system of characterization of 
multiply-connected topologies (which opened the 
door to the view of electric charge as “lines of force 
trapped in the topology of space”), making use of 
numbers that today are named after Betti but that 
are identified with a symbol, R n , that honors Rie¬ 
mann. 

“A more detailed scrutiny of a surface might 
disclose that what we had considered an elemen¬ 
tary piece in reality has tiny handles attached to 
it which change the connectivity character of the 
piece, and that a microscope of ever greater mag¬ 
nification would reveal ever new topological com¬ 
plications of this type, ad infinitum . The Riemann 
point of view allows, also for real space, topologi¬ 
cal conditions entirely different from those realized 
by Euclidean space. I believe that only on the basis 
of the freer and more general conception of geom¬ 
etry which had been brought out by the develop¬ 
ment of mathematics during the last century, and 
with an open mind for the imaginative possibilities 
which it has revealed, can a philosophically fruitful 


attack upon the space problem be undertaken.” 
H. Weyl (1949, p. 91). 

“But.. . physicists were still far removed from 
such a way of thinking; space was still, for them, 
a rigid, homogeneous something, susceptible of ho 
change or conditions. Only the genius of Riemann, 
solitary and uncomprehended, had already won its 
way by the middle of the last century to a new 
conception of space, in which space was deprived 
of its rigidity, and in which its power to take part 
in physical events was recognized as possible.” 
A. Einstein (1934, p. 68). 

Riemann formulated the first known model for 
superspace (for which see Chapter 43), a super¬ 
space built, however, not of the totality of all 3- 
geometries with positive definite Riemannian met¬ 
ric (the dynamic arena of Einstein’s general 
relativity), but of all conformally equivalent closed 
Riemannian 2-geometries of the same topology, a 
type of superspace known today as Teichmtiller 
space, for more on Riemann’s contributions to 
which and the subsequent development of which, 
see the chapters by L. Bers and J. A. Wheeler in 
Gilbert and Newton (1970). 
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Ricci curvature tensor 


Scalar curvature 


Einstein curvature tensor 


Contracted Bianchi identities 


EXERCISE 


R^t, — R a nav 


— fa _ p a . pa p jS _ p Q rf3 

I J ^ J Pa 1 fiv 1 p v J pa' 

*—[in a coordinate frame] 


(8.47) 


and the “scalar curvature, 9 


R = R\- (8.48) 

But of much greater geometric significance is the “Einstein curvature tensor ” 

= R\ - j S* P R. (8.49) 

Of all second-rank curvature tensors one can form by contracting Riemann, only 
Einstein = G retains part of the Bianchi identities (8.46): it satisfies 

G» v . v = 0. (8.50) 

For the beautiful geometric meaning of these “contracted Bianchi identities 99 (“the 

boundary of a boundary is zero”), see Chapter 15. 

Box 8.6 summarizes the above equations describing curvature, as well as the 
fundamental equations for covariant derivatives. 


[The following exercises from Track 2 are appropriate for the Track-1 reader who wishes 
to solidfy his understanding of curvature: 11.6, 11.9, 11.10, 13.7-11, and 14.3.] 

Exercise 8.16. SOME USEFUL FORMULAS IN COORDINATE FRAMES 

In any coordinate frame, define g to be the determinant of the matrix g a(i [equation 8.11]. 
Derive the following relations, valid in any coordinate frame. 

(a) Contraction of connection coefficients: 

r a »« = (In V^g\ 0 . (8.51a) 

[Hint: Use the results of exercise 5.5.] 

(b) Components of Ricci tensor: 

R<xp = (V— gR^ap )“ 0 n (8.51b) 

(c) Divergence of a vector A a or antisymmetric tensor F a P\ 

Aa . a = -^=(V-i-4“), a . = yL=(V^F“%. (8.51c) 

(d) Integral of a scalar field 4' over the proper volume of a 4-dimensional region ‘T"; 

J * ^(proper volume) = f * y/ZJ dt dx dy dz. 


(8.51 d) 
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[Hint: In a local Lorentz frame, ^(proper volume) = dtdx dy dz. Use a Jacobian to transform 
this volume element to the given coordinate frame, and prove from the transformation law 


Safi 


dx}_ cx“ 
dx a dx B ^ 


that the Jacobian is equal to V—£■] 


— 

Box 8.6 COVARIANT DERIVATIVE AND CURVATURE: FUNDAMENTAL EQUATIONS 


Entity 


Abstract notation 


Component notation 


Covariant 

Derivative 


▼„r = vr(......., u) 


V- v=<#«> 


T B „. y u y = DT»Jdk (u = dS’/dK) 
dT B „ 

- + ( r ^J\ - r*Ti>,)u 


dh 


f.a u “ = /««* 


algebraic 
properties 
(Exercise 8.11) 


^au+bv^ ~~ & ^ 

▼„(S + M) — V u s + Y U M 

V u iv — V w u = [«, w] for u. w 

both vector fields 


T» a . y (Mi y + bv‘) = aT B tf + bT». y v y 

(S B a + M B a ). yU y = s» a . yU y + M» a . y u y 

= - J c M y P [equation (8.34)] 


chain rule 


Y u and 

contraction 

commute 


V U (A <8>B) = (Y U A) ® B + A( 8> (Y U B) 

▼„(/*) = (▼„/)* +fV u A 

Y u {contraction of S) 

= (contraction of Y u S) 


(A\B y ).^ = A a p. ll B y u‘‘ + A a pB y . tl u“ 

(2 S “ay) «* = 2 (S^-X) 


* metric 
covariantly 
constant 


▼„ff = 0 


g a g. y U y = 0 


Gradient 


▼r 


Connection 

Coefficients 


r° By = ▼*•„> 

= e /J> 


* Local Lorentz 
frame at Sf* 0 


**.;y = ^a.y + ^y^. - ^7^, 

= r“^y.* 

Pfiy = y J" £**Y,/8 — 

+ c */3y + c /xy/3 “ <W* 

C /3YM “ §fia C (3y a 

= £«<»*, [e*, e yl>* 

Coordinate system with 

o> = V' r<X fiyW*) ~ 0 


Parallel trans¬ 
port 


▼„S = 0 
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Box 8.6 ( continued ) 


Entity 

Abstract notation 

Component noiatton 

Geodesic 

Equation 

o 

II 

£ 

<T-x“idk- + r% r (dx»/dk)(dx , /dk) = 0 
in a coordinate basis 

Riemann 

Curvature 

Tensor 

Riemann (a. C, A, B) = <a. B)C ) 

‘MA, *)==[▼„. y B ] 

(not track-one formulas; see 

Chapter 11) 

R a _ ^' ra fiy 

0yt a** ix s 

+ r* u - r\ t r“ Py 

in coordinate frame 

{see equation (11.13) for formula in non¬ 
coordinate frame] 

Ricci Curvature 
Tensor 

Ricci = contraction on slots 

1 and 3 of Riemann 

R — R n — r° — r a + r* fP 

- r%,r 0 m 

in coordinate frame 

* Curvature 

Scalar 

R = (contraction of Ricci) 

R = R\ 

* Einstein 

Curvature 

Tensor 

G = Ricci — y gR 

G a 0 — Rap ~ Y SapR 

Useful formulas for computing G a e (derived in §14.2): 
C°0 = -(^ 12 !2 + ^ 23 23 + * 31 31 ), 

G°1 = *°*M + ^ 03 13> 

G 1 ! = — (/?°“o2 + ^ 03 03 + ^ 23 23)’ 

G 1 2 = R l0 20 + R 13 23 , etc. 

* Symmetries of 
Curvature 

Tensors 


R<xpy& = R[apn y«] “ R ly&}lapb R[a0y&\ = Rc&0y&) = 0 

Rap R(a0)' ^ap G( a p) 

Bianchi 

Identities 


R Q plfiv;\) — 0 

* Contracted 

Bianchi 

Identities 


- 0 

Geodesic 

Deviation 

\ P + Riemann <i) = 0 

D 2 n a 

Parallel 

Transport 
around closed 
curve (§11.4) 

8A + Riemann (.. . A, u, v) = 0 
if u , v are edges of curve 

SA a + R a pyS = 0 


* If metric is absent, these starred formulas cannot be formulated. All other formulas are valid in absence of metric. 


CHAPTER 

DIFFERENTIAL TOPOLOGY 


In analytic geometry , many relations which are independent of 
any frame must be expressed with respect to some particular 
frame. It is therefore preferable to devise new methods — 
methods which lead directly to intrinsic properties without any 
mention of coordinates. The development of the topology of 
general spaces and of the objects which occur in them, as well 
as the development of the geometry of general metric spaces , 

are steps in this direction. 

KARL MENGER, in Schilpp (1949), p. 467. 





§9.1. GEOMETRIC OBJECTS IN METRIC-FREE, 

GEODESIC-FREE SPACETIME 

Curved spacetime without metric or geodesics or parallel transport, i.e., “differential 
topology,” is the subject of this easy chapter. It is easy because all the necessary 
geometric objects (event, curve, vector, 1-form, tensor) are already familiar from 
flat spacetime. Yet it is also necessary, because one’s viewpoint must be refined when 
one abandons the Lorentz metric of flat spacetime. 


Events 

The primitive concept of an event <3* (Figure 1.2) needs no refinement. The essential 
property here is identifiability, which is not dependent on the Lorentz metric struc¬ 
ture of spacetime. 


-- 

This chapter is entirely Track 2. 

It depends on no preceding 
Track-2 material. 

It is needed as preparation 
for 

(1) Chapters 1 0-13 
(differential geometry; 
Newtonian gravity), 
and 

(2) Box 30.1 (mixmaster 
cosmology). 

It will be helpful in 

(1) Chapter 14 (calculation 
of curvature) and in 

(2) Chapter 15 (Bianchi 
identities). 

_ J 
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Metric is abandoned 

Geometric concepts must 
refined 


Old definitions of vector 
break down when metric 
abandoned 


Curves 

Again no refinement. A “curve” iJ?(A) is also too primitive to care whether spacetime 
has a metric —except that, with metric gone, there is no concept of “proper length” 
along the curve. This is in accord with Newton’s theory of gravity, where one talks 
of the lengths of curves in “space,” but never in “spacetime.” 


Vectors 

Here refinement is needed. In special relativity one could dress primitive (“identifi¬ 
able”) events in enough algebraic plumage to talk of vectors as differences <$ — 2, 
between “algebraic” events. Now the plumage is gone, and the old bilocal (“point 
for head and point for tail”) version of a vector must be replaced by a purely local 
version (§9.2). Also vectors cannot be moved around; each vector must be attached 
to a specific event (§§9.2 and 9.3). 


1-Forms 

Almost no refinement needed, except that, with metric gone, there is no way to tell 
which 1-form corresponds to a given vector (no way to raise and lower indices), 
and each 1-form must be attached to a specific event (§9.4). 


Tensors 

Again almost no refinement, except that each slot of a tensor is specific: if it accepts 
vectors, then it cannot accommodate 1-forms, and conversely (no raising and lower¬ 
ing of indices); also, each tensor must be attached to a specific event (§9.5). 


§9.2. "VECTOR" AND "DIRECTIONAL DERIVATIVE" 

REFINED INTO TANGENT VECTOR 

Flat spacetime can accommodate several equivalent definitions of a vector (§2.3): 
a vector is an arrow reaching from an event to an event 2 0 ; it is the parameterized 
straight line, ^(A) = 9 0 + X(2 0 — ^ 0 ) extending from at A = 0 to 2 0 at A = 1; 
it is the rate of change of the point ^(A) with increasing A, d&/dX. 

With Lorentz metric gone, the “arrow” definition and the “parametrized-straight 
line” definition must break down. By what route is the arrow or line to be laid out 
between and 2 0 1 There is no concept of straightness; all routes are equally straight 
or bent. 

Such fuzziness forces one to focus on the “rate-of-change-of-point-along-curve” 



§9.2. TANGENT VECTOR AND DIRECTIONAL DERIVATIVE 
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f Box 9.1 TANGENT VECTORS AND TANGENT SPACE 




A tangent vector d^P/dX is defined to be “the limit, 
when N —oo, of N times the displacement of 
9 as X ranges from 0 to 1 /N” One cannot think 
of this final displacement deP/dX as lying in space- 
time; fuzziness forbids (no concept of straightness). 
Instead, one visualizes dty/dX as lying in a “tan¬ 
gent plane” or “tangent space,” which makes con¬ 
tact with spacetime only at ^(0), the event where 
dty/dX is evaluated. All other tangent vectors at 
^(0)—e.g., d 0 ?/dp, d/P/di), d/P/d^— he in this same 
tangent space. 

To make precise these concepts of tangent space 
and tangent vector, one may regard spacetime as 
embedded in a flat space of more than four di- 

V_._ 


mensions. One can then perform the limiting 
process that leads to dtf/dX, using straight arrows 
in the flat embedding space. The result is a higher¬ 
dimensional analog of the figure shown above. 

But such a treatment is dangerous. It suggests, 
falsely, that the tangent vector d/P/dX and the 
tangent space at ZP 0 depend on how the embedding 
is done, or depend for their existence on the em¬ 
bedding process. They do not. And to make clear 
that they do not is one motivation for defining the 
directional derivative operator “ d/dX ” to be the 
tangent vector, rather than using Cartan’s more 
pictorial concept “d/P/dX”. 


definition, dPJfdX. It, under the new name “ tangent vector,” is explored briefly in 
Box 9.1, and in greater depth in the following paragraphs. 

Even “dtf/dX” is a fuzzy definition of tangent vector, most mathematicians would 
argue. More acceptable, they suggest, is this definition: the tangent vector u to a 
curve /P(X) is the directional derivative operator along that curve 

U = d u = (d/dk) A ongcurve . (9.1) 


Best new definition: "tangent 
vector equals directional 
derivative operator" 
u = d/dX 


Tangent vector equals directional derivative operator? Preposterous! A vector 
started out as a happy, irresponsible trip from to 2 0 , It ended up loaded with 
the social responsibility to tell how something else changes at ^ 0 . At what point 
did the vector get saddled with this unexpected load? And did it really change its 
character all that much, as it seems to have done? For an answer, go back and try 
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Alternative definition, 
u = dtf/dX, requires 
embedding in flat space of 
higher dimensionality 


Refinement of d9/dX into 
d/dX 


to redo the “rate-of-change-of-point” definition, d^P/dX, in the form of a limiting 
process: 

0. Choose a curve c f(X) whose tangent vector dPP jdX at X = 0 is desired. 

1. Take the displacement of 9 as X ranges from 0 to 1; that is not dpP jdX. 

2. Take twice the displacement of S? as X ranges from 0 to \\ that is not d L f/dX. 

N. Take N times the displacement of 9 as X ranges from 0 to 1 jN\ that is not 
d$>/dX. 

oo. Take the limit of such displacements as N —oo; that is dtf/dX. 

This definition has the virtue that dty/dX describes the properties of the curve £?(X), 
not over the huge range from X = 0 to X = 1, where the curve might be doing wild 
things, but only in an infinitesimal neighborhood of the point ff 0 = ^(0). 

The deficiency in this definition is that no meaning is assigned to steps 1,2,..., 
N, ..., so there is nothing, yet, to take the limit of To make each “displacement 
of 9” a definite mathematical object in a space where “limit” has a meaning, one 
can imagine the original manifold to be a low-dimensional surface in some much 
higher-dimensional flat space. Then &(\/N) — ( PP{ 0) is just a straight arrow connect¬ 
ing two points, i.e. a segment of a straight line, which, in general, will not lie in 
the surface itself—see Box 9.1. The resulting mental picture of a tangent vector makes 
its essential properties beautifully clear, but at the cost of some artifacts. The picture 
relies on a specific but arbitrary way of embedding the manifold of interest (metric- 
free spacetime) in an extraneous flat space. In using this picture, one must ignore 
everything that depends on the peculiarities of the embedding. One must think like 
the chemist, who uses tinkertoy molecular models to visualize many essential prop¬ 
erties of a molecule clearly, but easily ignores artifacts of the model (colors of the 
atoms, diameters of the pegs, its tendency to collapse) that do not mimic quantum- 
mechanical reality. 

Elie Cartan’s approach to differential geometry, including the d^P/dX idea of a 
tangent vector, suggests that he always thought of manifolds as embedded in flat 
spaces this way, and relied on insights that he did not always formalize to separate 
the essential geometry of these pictures from their embedding-dependent details. 
Acceptance of his methods of calculation came late. Mathematicians, who mistrusted 
their own ability to distinguish fact from artifact, exacted this price for acceptance: 
stop talking about the movement of the point ifself, and start dealing only with 
concrete measurable changes that take place within the manifold, changes in any 
or all scalar functions / as the point moves. The limiting process then reads: 

O. Choose a curve ^(X) whose tangent vector at X = 0 is desired. 

1. Compute the number f\flP{ 1)] — /[<?(0)], which measures the change in / as 
the point ^(X) moves from = ^(0) to = <?( 1). 

2. Compute 2 {/[^®] —/p?(0)]), which is twice the change in/ as the point 
goes from ^(0) to 

N. Compute N{f[ < 3 ) (\/N)\ — /[<?(0)]), which is N times the change in/as the 
point goes from ^(0) to 
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oo. Same in the limit as N —>- oo: (change in/) = df/dX. 

0. The vector is not itself the change in/ It is instead the operation d/dX, which, 
when applied to f gives the change df/dX. Thus 

tangent vector = d/dX 


[cf. definition (9.1)]. 

The operation d/dX clearly involves nothing but the last steps N —oo in this 
limiting process, and only those aspects of these steps that are independent of / 
But this means it involves the infinitesimal displacements of the point 9 and nothing 
more. 

One who wishes both to stay in touch with the present and to not abandon Cartan’s 
deep geometric insight (Box 9.1) can seek to keep alive a distinction between: 

(A) the tangent vector itself in the sense of Cartan, the displacement dW/dX of 
a point; and 

(B) the “tangent vector operator,” or “directional derivative operator,” telling 
what happens to a function in this displacement: (tangent vector operator) 
= d/dX. 

However, present practice drops (or, if one will, “slurs”) the word “operator” in (B), 
and uses the phrase “tangent vector” itself for the operator, as will be the practice 
here from now on. The ideas (A) and (B) should also slur or coalesce in one’s mind, 
so that when one visualizes an embedding diagram with arrows drawn tangent to 
the surface, one always realizes that the arrow characterizes an infinitesimal motion 
of a point ddP/dX that takes place purely within the surface, and when one thinks 
of a derivative operator d/dX , one always visualizes this same infinitesimal motion 
of a point in the manifold, a motion that must occur in constructing any derivative 
df{^P)/dX. In this sense, one should regard a vector dtf/dX — d/dX as both “a 
displacement that carries attention from one point to another” and “a purely geo¬ 
metric object built on points and nothing but points.” 

The hard-nosed physicist may still be inclined to say “Tangent vector equals 
directional derivative operator? Preposterous!” Perhaps he will be put at ease by 
another argument. He is asked to pick an event ^ 0 . At that event he chooses any 
set of four noncoplanar vectors (vectors defined in whatever way seems reasonable 
to him); he names them e 0 , e v e 2 , e 3 ; and he uses them as a basis on which to 
expand all other vectors at ^ 0 : 


u — u a e a , v ~ v a e a . 


(9.2) 


He is asked to construct the four directional derivative operators d a = d e along his 

four basis vectors. As in fiat spacetime, so also here; the same expansion coefficients Isomorphism between 

that appear in u = u a e a also appear in the expansion for the directional derivative: d'rectiona! derivatives and 

vectors 


d„ = u a d n 


9 , = V* d a . 


(9.3) 
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Tangent space defined 


Coordinate-induced basis 
defined 


Changes of basis: 
transformation matrices 
defined 


Hence, every relation between specific vectors at induces an identical relation 
between their differential operators: 

u = aw -f bv ■<=*- u a = aw a -f bv a 

~ 2 L ^ ( 9 . 4 ) 

d u — a d w -f- b 8„. 

There is a complete “isomorphism” between the vectors and the corresponding 
directional derivatives. So how can the hard-nosed physicist deny the hard-nosed 
mathematician the right to identify completely each tangent vector with its direc¬ 
tional derivative? No harm is done; no answer to any computation can be affected. 

This isomorphism extends to the concept “tangent space” Because linear relations 
(such as d u — a d w + b 8„) among directional derivatives evaluated at one and the 
same point $ 0 are meaningful and obey the usual addition and multiplication rules, 
these derivative operators form an abstract (but finite-dimensional) vector space 
called the tangent space at In an embedding picture (Box 9.1) one uses these 
derivatives (as operators in the flat embedding space) to construct tangent vectors 
u = v = dj:P, in the form of straight arrows. Thereby one identifies the abstract 
tangent space with the geometrically visualized tangent space. 


§9.3. BASES, COMPONENTS, AND 

TRANSFORMATION LAWS FOR VECTORS 

An especially useful basis in the tangent space at an event is induced by any 
coordinate system [four functions, xx 2 (f), x 3 ^)}: 


( . I directional derivative along the\ 

—o') = curve with constant (x l , x 2 , x 3 ) 

X xl » x2 ’ x3 \and with parameter A = x° J at ^ o , (9.5) 




8 

8 * 1 ’ 


e 2 


8 

dx 2 9 




J 


8 

dx 3 


(See Figure 9.1.) 

A transformation from one basis to another in the tangent space at ^ 0 , like any 
change of basis in any vector space, is produced by a nonsingular matrix, 


o' = e P LP 


a'> 


(9.6) 


and, as always (including the Lorentz frames of flat spacetime), the components of 
a vector must transform by the inverse matrix 


= L a \uP\ 


\\L a ' p \\ = II L^y\\-\ i.e„ 


'L«\L\ = 5“V, 

L\L a \ = *V 


(9.7) 

(9.8) 



§9.4. 


1-FORMS 
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Figure 9.1. 

The basis vectors induced, by a coordinate sys¬ 
tem, into the tangent space at each event. Here 
a truncated, two-dimensional spacetime is shown 
(two other dimensions suppressed), with coordi¬ 
nates xi?) a nd 'P(&), and with corresponding 
basis vectors d/d% and d/d$. 


This “inverse” transformation law guarantees compatibility between the expansions 
u — e a ,u a ' and u — e^u^: 


u = e a .u«' = (eyLy^Lo’oU*) = eJPM* 


= e p uP - 


In the special case of transformations between coordinate-induced bases, the 
transformation matrix has a simple form: 


dx& d 


(by usual rules of calculus), 


so 


dx a ' dx a ’ dx& 

ffi , — (dx^/'dx a 1 

<x v /at event ^o wh ere tangent space lies’ 


(9.9) 


(Note: this generalizes the Lorentz-transformation law x& = A& a ,x a \ which has the 
differential form A$ a , — dx^/dx a '\ also, it provides a good way to remember the 
signs in the A matrices.) 


§9.4. 1-FORMS 

When the Lorentz metric is removed from spacetime, one must sharpen up the 
concept of a 1-form a by insisting that it, like any tangent vector i/, be attached 
to a specific event ?P 0 in spacetime. The family of surfaces representing a resides 
in the tangent space at not in spacetime itself. The piercing of surfaces of a 
by an arrow u to produce the number <a, i/> (“bongs of bell”) occurs entirely in 
the tangent space. 
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Dual basis of 1-forms defined 


Component-manipulation 

formulas 



Figure 9.2. 

The basis vectors e a and dual basis 1-Forms uj& in the tangent space of an event (? 0 . The condition 

<»** O = 8 \ 

dictates that the vectors e 2 and e 3 lie parallel to the surfaces of cu l , and that e x extend from one surface 
of w l to the next (precisely 1.00 surfaces pierced). 

Notice that this picture could fit perfectly well into a book on X-rays and crystallography. There the 
vectors e v e 2 , e 3 would be the edges of a unit cell of the crystal; and the surfaces of a/ 1 , a/ 2 , tu 3 would 
be the surfaces of unit cells. Also, for an X-ray diffraction experiment, with wavelength of radiation 
and orientation of crystal appropriately adjusted, the successive surfaces of w l would produce Bragg 
reflection. For other choices of wavelength and orientation, the surfaces of cu 2 or uj 3 would produce 
Bragg reflection. 


Given any set of basis vectors {e 0 , e v e 2 , e 3 } at an event ^ 0 , one constructs the 
“dual basis ” of 1-forms {uj 0 , uj 1 , uj 2 , uj 3 } by choosing the surfaces of uj& such that 
that 

e a > = SV (9.10) 

See Figure 9.2. A marvelously simple formalism for calculating and manipulating 
components of tangent vectors and 1-forms then results: 


u = e a u a 

(definition of components of t/), 

(9.11a) 

a = OpUJ 13 

(definition of components of a), 

(9.11b) 

U a = (Ul a , £/> 

(way to calculate components of u), 

(9.11c) 

°/3 = <<*» e j3> 

(way to calculate components of a), 

(9.1 Id) 

<cr, u> = o a u a 

(way to calculate <a, i/> using 
components), 

(9. lie) 

w a ' = L a ' pUJ 13 

(transformation law for 1-form 
basis, corresponding to equation 9.6), 

(9.1 If) 

<V = o j S L\, 

(transformation law for l-form 
components). 

(9-1 lg) 


(Exercise 9.1 below justifies these equations.) 
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In the absence of a metric, there is no way to pick a specific 1-form u at an event 
and say that it corresponds to a specific tangent vector u at ^ 0 . The correspond¬ 
ence set up in flat spacetime, 

<i/, v) = u- v for all v , 

was rubbed out when was rubbed out. Restated in component language: the 
raising of an index, u a — is impossible because the 77 ^ do not exist; similarly, 

lowering of an index, u^ is impossible. 

The 1-form gradient df was introduced in §2.6 with absolutely no reference to 
metric. Consequently, it and its mathematical formalism are the same here, without 
metric, as there with metric, except that, like all other 1 -forms, df now resides in 
the tangent space rather than in spacetime itself. For example, there is no change 
in the fundamental equation relating the projection of the gradient to the directional 
derivative: 

<*//,«> = 3„/= !#[/]. (9.12) 


f old notation for 

J L 

new notation; 

j_directional derivative 


jecall u = d u .~ 


Similarly, there are no changes in the component equations, 

df = f a cu a (expansion of df in arbitrary 

basis), 

% 

f a = d a f = e a [f] (way to calculate components 

of df), 

f a = df/dx a if ( e a } is a coordinate basis, 

except that they work in arbitrary bases, not just in Lorentz bases. And, as in Lorentz 
frames, so also in general: the one-form basis ( dx a } and the tangent-vector basis 
{d/dx a }, which are induced into a tangent space by the same coordinate system, 
are the duals of each other, 


(9.13a) 

(9.13b) 


(dx a ,d/dxPy - 8 a p. (9.14) 

(See exercise 9.2 for proofs.) Also, most aspects of Cartan’s “Exterior Calculus” (parts 
A, B, C of Box 4.1) are left unaffected by the removal of metric. 


§9.5. TENSORS 

A tensor S, in the absence of Lorentz metric, differs from the tensors of fiat, Lorentz 
spacetime in two ways. (1) S must reside at a specific event ^ 0 , just as any vector 
or 1-form must. (2) Each slot of S is specific; it will accept either vectors or 1-forms, 
but not both, because it has no way to convert a 1 -form u into a “corresponding 


Correspondence between 
vectors and 1-forms rubbed 
out 


Gradient of a function 


* 


Specificity of tensor slots 
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EXERCISES 


vector” u as it sends u through its linear machinery. Thus, if S is a (J) tensor 


S( . 


insert 1-form here- 


J t Li 


(9.15) 


insert vector here 


insert vector here 


then it cannot be converted alternatively to a 0 tensor, or a (g) tensor, or a (§) tensor 
by the procedure of §3.2. In component language, the indices ofS cannot be raised 
and lowered. 

Except for these two restrictions (attachment to a specific event; specificity of slots), 
a tensor S is the same linear machine as ever. And the algebra of component 
manipulations is the same: 

S a p y = S(cu a , e y ) (S, u) <x , must all reside at same event) (9.16) 

S = S<* fiy e a ® cut ® <v y , (9.17) 

S(cr, w, v) = S a p y o a nPv y . (9-18) 


Exercise 9.1. COMPONENT MANIPULATIONS 

Derive equations (9.11c) through (9.1 lg) from (9.10), (9.11a, b), (9.6), (9.7), and (9.8). 

Exercise 9.2. COMPONENTS OF GRADIENT, AND DUALITY OF 
COORDINATE BASES 

In an arbitrary basis, define f a by the expansion (9.13a). Then combine equations (9.1 Id) 
and (9.12) to obtain the method (9.13b) of computing^. Finally, combine equations (9.12) 
and (9.13b) to show that the bases {dx a } and {d/dx&} are the duals of each other. 

Exercise 9.3. PRACTICE MANIPULATING TANGENT VECTORS 

Let be the point with coordinates (x = 0,y = l,z = 0) in a three-dimensional space; 
and define three curves through by 

<?(\) = (\,U), 

^(0 = (sin f, cos f, f)> # 

^(p) = (sinh p, cosh p, p + p 3 ). 

(a) Compute ( d/d\)f (d/dl)f and ( d/dp)f for the function/= x 2 — y 2 + z 2 at the point 
^ 0 . (b) Calculate the components of the tangent vectors d/dX , d/di, and d/dp at ^ 0 , using 
the basis {d/dx, d/dy , d/dz). 

Exercise 9.4. MORE PRACTICE WITH TANGENT VECTORS 

In a three-dimensional space with coordinates (x,y, 2 ), introduce the vector field v = y 2 
d/dx — x d/dz , and the functions f = xy, g = z 3 . Compute 

(a) v[f] (c) v[fg] (e) v[f 2 + g 2 ] 

(b) v[g] (d) fv[g] - gv[f] (f) v{v[f}} 
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Exercise 9.5. PICTURE OF BASIS 1-FORMS INDUCED BY COORDINATES 

In the tangent space of Figure 9.1, draw the basis 1-forms dp and dx induced by the 
\p, x -coor ^ nate system. 

Exercise 9.6. PRACTICE WITH DUAL BASES 

In a three-dimensional space with spherical coordinates r, 6 , <£, one often likes to use, instead 
of the basis d/dr, d/d8 , d/d<p, the basis 

-_JL _ _J_JL _ i a 

r 0r’ * r d6 9 * r sin 8 d<p’ 

(a) What is the 1-form basis {co r , u/ 9 y cu^} dual to this tangent-vector basis? (b) On the 
sphere r= 1, draw pictures of the bases {d/dr, d/00, d/d<p} y {e^e§,e^}, [dr, dS, d<p}, 
and { iv r , io 9 , co^}. 


§9.6. COMMUTATORS AND PICTORIAL TECHNIQUES 

A vector u 0 given only at one point suffices to compute the derivative u 0 [f] = 
d t /, which is simply a number associated with the point L f 0 . In contrast, a vector 
field u provides a vector u(^P )—which is a differential operator 3^—at each point 
in some region of spacetime. This vector field operates on a function/ to produce 
not just a number, but another function u[f] = d u f A second vector field v can 
perfectly well operate on this new function, to produce yet another function 

«'{«[/]} = 3 „( 9 «/)- 

Does this function agree with the result of applying v first and then ul Equivalently, 
does the “commutator” 


[U,v][f] = u{v[f]} - v{u[f]} (9.19) 


vanish? 

The simplest special case is when^i/ and v are basis vectors of a coordinate system, 
u = d/dx a , v = d/dx&. Then the commutator does vanish, because partial deriva¬ 
tives always commute: 

[d/dx a , d/dx £][/] = d 2 f/dx& dx a - d 2 f/dx a dx? = o. 


Commutator defined 


But in general the commutator is nonzero, as one sees from a coordinate-based 
calculation: 
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Commutator of two vector 
fields is a vector field 


Notice however, that the commutator [i/, v], like u and v themselves, is a vector 
field, i.e., a linear differential operator at each event: 


[u, v] = (i/[y£] - y[ M 0]) JL- = (u a v^ a - v<XuP , a )-^J- ( 9 - 20 ) 

Such results should be familiar from quantum theory’s formalism for angular mo¬ 
mentum operators (exercise 9.8). 

The three levels of geometry—pictorial, abstract, and component—yield three 
different insights into the commutator. (1) The abstract expression [i/, v] suggests 
the close connection to quantum theory, and brings to mind the many tools devel¬ 
oped there for handling operators. But recall that the operators of quantum theory 
need not be first-order differential operators. The kinetic energy is second order and 
the potential is zeroth order in the familiar Schrodinger equation. Only first-order 
operators are vectors. (2) The component expression u a vP a — valid in any 

coordinate basis, brings the commutator into the reaches of the powerful tools of 
index mechanics. (3) The pictorial representation of [w, v] (Box 9.2) reveals its 
Commutator as a "closer of fundamental role as a “closer of curves”—a role that will be important in Chapter 
curves f lFs analysis of curvature. 

Commutators find application in the distinction between a coordinate-induced 
basis, (e a ) = {d/dx a }, and a noncoordinate basis. Because partial derivatives always 
commute, 


[e a , e^] = [d/dx a , d/dx&] = 0 in any coordinate basis. (9.21) 


[ Box 9.2 THE COMMUTATOR AS A CLOSER OF QUADRILATERALS 




A. 

1 . 

2 . 

3. 

4. 



Pictorial Representation in Flat Spacetime 


For ease of visualization, consider flat space- 
time, so the two vector fields u(^P) and v{^P) 
can be laid out in spacetime itself. 

Choose an event f^ 0 where the commutator 
[i/, v] is to be calculated. 

Give the names <? 2 , ^ 3 , to the events 
pictured in the diagram. 

Then the vector — ^ 3 , which measures 
how much the four-legged curve fails to close, 
can be expressed in a coordinate basis as 


turns out to 
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6 


& 4-<?3 = [«W + «W] - [*W + K^o)] 

= m&i) - k^o)] - [«e?2) - 

= - (<«”"«*))*„ + errors 
= [«, v]<? o + errors. 

terms such as ^uWep] 


5. Notice that if u and v are halved everywhere, then [u, v] is cut down by a 
factor of 4, while the error terms in the above go down by a factor of 8. Thus, 
[u, v] represents accurately the gap in the four-legged curve (“quadrilateral”) 
in the limit where u and v are sufficiently short; i.e., [«, v] “closes the quadrila¬ 
teral” whose edges are the vectors fields u and v .. 


B. Pictorial Representation in Absence of Metric, 
or in Curved Spacetime with a Metric 



1. The same picture must work, but now one 
dares not (at least initially) lay out the vector 
fields in spacetime itself Instead one lays out 
two families of curves: the curves for which 
u( c f) is the tangent vector; and the curves for 
which v(eP) is the tangent vector. 

2. The gap “?P 4 — c f 3 ” in the four-legged curve 
can be characterized by the difference 
f(& 4 ) — /(^ 3 ) in the values of an arbitrary 
function at 0 J 4 and & 3 . That difference is, in 
a coordinate basis, 
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Box 9.2 ( continued ) 

(/>“ + \f, a pv a v^ (/>“ + 

- im 2) -/(^o)] - \m 3 ) ~/(^)] 

(/>“ + y/a/3^) (/o““ + y/a/3" 11 ^) 

*"* ‘ r O ^*2 

= Kf,aV a \ 0 ti fi - (/a wa ),/3 i;/? k + “ cubic errors” 

= \{u fi v a fi — ^ /J w a >/3 )0//0x a ]^ o + “cubic errors” 

= {[w, v][/]} t?o + “cubic errors.” 

Here “cubic errors” are cut down by a factor of 8, while [u, i^]/is cut down 
by one of 4, whenever u and v are cut in half. 

3. The result 

/(^ 4 ) -/(^ 3 ) = {[". + “cubic errors” 

says that [u, v) is a tangent vector at that describes the separation between 
the points <? z and i? 4 . Its description gets arbitrarily accurate when u and v 
get arbitrarily short. Thus, [ 11 , v] closes the quadrilateral whose edges are the 
projections of u and v into spacetime. 

C. Philosophy of Pictures 

1. Pictures are no substitute for computation. Rather, they are useful for (a) 
suggesting geometric relationships that were previously unsuspected and that 
one verifies subsequently by computation; (b) interpreting newly learned 
geometric results. 

2. This usual noncomputational role of pictures permits one to be sloppy in 
drawing them. No essential new insight was gained in part B over part A, when 
one carefully moved the tangent vectors into their respective tangent spaces, 
and permitted only curves to lie in spacetime. Moreover, the original picture 
(part A) was clearer because of its greater simplicity. 

3. This motivates one to draw “sloppy” pictures, with tangent vectors lying in 
spacetime itself—so long as one keeps those tangent vectors short and occa¬ 
sionally checks the scaling of errors when the lengths of the vectors are halved. 
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Conversely, if one is given a field of basis vectors (“frame field”) {effP)}, but one 
does not know whether a coordinate system exists in which (e a ) = {d/dx a }, 

one can find out by a simple test: calculate all (4 x 3)/2 = 6 commutators [ e a , e^]; 
if they all vanish, then there exists such a coordinate system. If not, there doesn’t. 
Stated more briefly, (e a (£P)} is a coordinate-induced basis if and only if[e a , ep] — 0 
for all e a and e$. (See exercise 9.9 for proof; see §11.5 for an important application.) 
Coordinate-induced bases are sometimes called “ holonomic In an “anholonomic 
basis” (noncoordinate basis), one defines the commutation coefficients c^* by 

[e M , e v ] = Cji fe a . (9.22) 

They enter into the component formula for the commutator of arbitrary vector fields 
u and v: 


l U , v] = (i u[vP] - v[uP] + u^c^ep (9.23) 

(see exercise 9.10). 

[ Warning! In notation for functions and fields, mathematicians and physicists often 
use the same symbols to mean contradictory things. The physicist may write I when 
considering the length of some critical component in an instrument he is designing, 
then switch to i(T) when he begins to analyze its response to temperature changes. 
Thus i is a number, whereas i(T) is a function. The mathematician, in contrast, 
will write / for a function that he may be considering as an element in some 
infinite-dimensional function space. Once the function is supplied with an argument, 
he then contemplates /(x), which is merely a number: the value of/ at the point 
x. Caught between these antithetical rituals of the physics and mathematics sects, 
the authors have adopted a clear policy: vacillation. Usually physics-sect statements, 
like “On a curve c f(K )...are used; and the reader can translate them himself 
into mathematically precise language: “Consider a curve 6 on which a typical point 
is 9 — 6fK)\ on this curve...But on occasion the reader will encounter a pedan¬ 
tic-sounding paragraph written in mathematics-sect jargon (Example: Box 23.3). 
Such paragraphs deal with concepts and relationships so complex that standard 
physics usage would lead to extreme confusion. They also should prevent the reader 
from becoming so conditioned to physics usage that he is allergic to the mathematical 
literature, where great advantages of clarity and economy of thought are achieved 
by consistent reliance on wholly unambiguous notation.] 


Exercise 9.7. PRACTICE WITH COMMUTATORS 

Compute the commutator [e*. e^] of the vector fields 

_ _j_ 

* r d9 ’ ^ r sin 6 

Express your result as a linear combination of e$ and e^. 


Vanishing commutator: a test 
for coordinate bases 

Commutation coefficients 
defined 


Physicists' notation vs. 
mathematicians' notation 


EXERCISES 
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Exercise 9.8. ANGULAR MOMENTUM OPERATORS 

In Cartesian coordinates of three-dimensional Euclidean space, one defines three “angular- 
momentum operators" (vector fields) L ■ by 

L 5 = t ikl x*(d/dx l y 

Draw a picture of these three vector fields. Calculate their commutators both pictorially and 
analytically. 

Exercise 9.9. COMMUTATORS AND COORDINATE-INDUCED BASES 

Let u and v be vector fields in spacetime. Show that in some neighborhood of any given 
point there exists a coordinate system for which 

u = d/dx 1 , v — d/dx 2 , 

if and only if u and v are linearly independent and commute: 

[«/, v] = 0. 

First make this result plausible from the second figure in Box 9.2; then prove it mathe¬ 
matically. Note: this result can be generalized to four arbitrary vector fields e 0 , e v e 2 , e 3 . 
There exists a coordinate system in which e a — d/dx a if and only if e 0 , e v e 2 , e 3 are linearly 
independent and [e^, e v ] = 0 for all pairs e , e v . 

Exercise 9.10. COMPONENTS OF COMMUTATOR IN NON-COORDINATE BASIS 

Derive equation (9.23). 

Exercise 9.11. LIE DERIVATIVE 

The “Lie derivative” of a vector field v(&) along a vector field «/(£P) is defined by 

£ u v = [u, v ]. (9.24) 

Draw a space-filling family of curves (a " congruence”) on a sheet of paper. Draw an arbitrary 
vector v at an arbitrary point & 0 on the sheet. Transport that vector along the curve through 
by means of the “Lie transport law” £ u v = 0, where u = d/dt is the tangent to the curve. 
Draw the resulting vector v at various points $*(t) along the curve. 

Exercise 9.12. A CHIP OFF THE OLD BLOCK 

(a) Prove the Jacobi identity 

lU , [v, IV]] + [v y [IV, u]] + [IV, [u, v}} = 0 (9.25) 

by picking out all terms of the form d u d v d w , showing that they add to zero, and arguing 
from symmetry that all other terms, e.g., d w d u d v terms, must similarly cancel. 

(b) State this identity in index form. 

(c) Draw a picture corresponding to this identity (see Box 9.2). 


§9.7. MANIFOLDS AND DIFFERENTIAL TOPOLOGY 

Spacetime is not the only arena in which the ideas of this chapter can be applied. 
Points, curves, vectors, 1-forms, and tensors exist in any “differentiable manifold.” 
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Their use to study differentiable manifolds constitutes a branch of mathematics called 
“differential topology” —hence the title of this chapter. 

The mathematician usually begins his development of differential topology by 
introducing some very primitive concepts, such as sets and topologies of sets, by 
building a fairly elaborate framework out of them, and by then using that framework 
to define the concept of a differentiable manifold. But most physicists are satisfied 
with a more fuzzy, intuitive definition of manifold: roughly speaking, an w-dimen- 
sional differentiable manifold is a set of “points” tied together continuously and 
differentiably, so that the points in any sufficiently small region can be put into a 
one-to-one correspondence with an open set of points of R n . [£" is the number space 
of n dimensions, i.e., the space of ordered ^-tuples (jc 1 , x 2 ,..., x n ).] That corre¬ 
spondence furnishes a coordinate system for the neighborhood. 

A few examples will convey the concept better than this definition. Elementary 
examples (Euclidean 3-spaces, the surface of a sphere) bring to mind too many 
geometric ideas from richer levels of geometry; so one is forced to contemplate 
something more complicated. Let R 3 be a three-dimensional number space with the 
usual advanced-calculus ideas of continuity and differentiability. Points £ of R 3 are 
triples, £ = (£ 2 , £ 2 , £ 3 ), of real numbers. Let a ray 9 in R 3 be any half-line from the 
origin consisting of all £ of the form £ = \r) for some fixed r\ ^ 0 and for all positive 
real numbers X > 0. (See Figure 9.3.) A good example of a differentiable manifold 
then is the set S 2 of all distinct rays. If / is a real-valued function with a specific 
value f{9) for any ray 9 [so one writes /: S 2 — R: 9 — f(9)] y it should be 
intuitively (or even demonstrably) clear that we can define what we mean by saying 
f is continuous or differentiable. In this sense S 2 itself is continuous and differen¬ 
tiable. Thus S 2 is a manifold, and the rays 9 are the points of S 2 . There are many 
other manifolds that differential topology finds indistinguishable from S 2 . The 
simplest is the two-dimensional spherical surface (2-sphere), which is the standard 
representation of S 2 ; it is the set of points £ of R 3 satisfying (£j) 2 + (£ 2 ) 2 + (£ 3 ) 2 = 1. 
Clearly a different point 9 of S 2 (one ray in R 3 ) intersects each point of this standard 
2-sphere surface, and the correspondence is continuous and differentiable in either 
direction (ray to point; point to ray). The same is true for £ny ellipsoidal surface 
in R 3 enclosing the origin, and for any other surface enclosing the origin that has 


Differentiable manifold 
"defined" 


Examples of differentiable 
manifolds 


The manifold S 2 



Figure 9.3. 

Three different representations of the differentiable manifold 
S 2 . The first is the set of all rays emanating from the origin; 
the second is the sphere they intersect; the third is an odd¬ 
shaped, closed surface that each ray intersects precisely once. 
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The manifold T 2 


The manifold 50(3) (rotation 
group) 


Affine geometry and 
Riemannian geometry defined 


EXERCISES 


a different ray through each point of itself. They each embody the same global 
continuity and differentiability concepts, and represent the same abstract differ¬ 
entiable manifold 5 2 , the 2-sphere. They, and the bundle of rays we started with, 
all have the same geometric properties at this rudimentary level of geometry. A 
two-dimensional manifold that has a different geometric structure at this level (a 
different "differentiable structure”) is the torus T 2 , the surface of a donut. There 
is no way to imbed this surface smoothly in R 3 so that a distinct ray *p £ S 2 intersects 
each of its points; there is no invertible and differentiable correspondence between 
T 2 and 5 2 . 

Another example of a manifold is the rotation group 50(3), whose points are 
all the 3 X 3 orthogonal matrices of unit determinant, so = ll/yi with y> T y> = 1 
and det if = 1. This is a three-dimensional space (one often uses the three Euler- 
angle parameters in computations), where differential ideas (e.g., angular velocity) 
are employed; hence, it is a manifold. So is the Lorentz group. 

The differentiability of a manifold (i.e., the possibility of defining differentiable 
functions on it) permits one to introduce coordinate systems locally, if not globally, 
and also curves, tangent spaces, tangent vectors, 1-forms, and tensors, just as is done 
for spacetime. But the mere fact that a manifold is differentiable does not mean 
that such concepts as geodesics, parallel transport, curvature, metric, or length exist 
in it. These are additional layers of structure possessed by some manifolds, but not 
by all. Roughly speaking, every manifold has smoothness properties and topology, 
but without additional structure it is shapeless and sizeless. 

That branch of mathematics which adds geodesics, parallel transport, and curva¬ 
ture (shape) to a manifold is called affine geometry; that branch which adds a metric 
is called Riemannian geometry . They will be studied in the next few chapters. 


EXERCISES ON THE ROTATION GROUP 

As the exposition of differential geometry becomes more and more sophisticated in the 
following chapters, the exercises will return time and again to the rotation group as an 
example of a manifold. Then, in Box 30.1, the results developed in these exercises will 
be used to analyze the “Mixmaster universe,” which is a particularly important cosmological 
solution to Einstein’s field equation. 

Before working these exercises, the reader may wish to review the Euler-angle parametri- 
zation for rotation matrices, as treated, e.g., on pp. 107-109 of Goldstein (1959). 

Exercise 9.13. ROTATION GROUP: GENERATORS 

Let 3f t be three 3x3 matrices whose components are ( K t ) mn = c lmn . 

(a) Display the matrices (. t r i ) 3 , and (5T X ) 4 . 

(b) Sum the series 


«,(*) = exp(^) = 2 W- (9.26) 

n = 0 

Show that $ x {6) is a rotation matrix and that it produces a rotation through an angle 
0 about the x-axis. 
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(c) Show similarly that #*(<£) = exp($f 3 <£) and #„(x) = exp(3T 2 x) are rotation matrices, 
and that they produce rotations through angles <p and x about the z- and y-axes, 
respectively. 

(d) Explain why 9 — defines the Euler-angle coordinates, ip, 9, <p for 

the generic element £ SO( 3) of the rotation group. 

(e) Let 6 be the curve = $l z (t) through the identity matrix, 6(0) = $ £ SO( 3). Show 
that its tangent, (d6/dt)(0) = 6( 0) does not vanish by computing 6(0 )/ 12 , where f 12 
is the function / 12 (^) = P T2 i whose value is the 12 matrix element of < 3 > . 

(f) Define a vector field e 3 on SO( 3) by letting e 3 (^) be the tangent (at t = 0) to the 

curve 6(t) = through Show that e 3 (^) is nowhere zero. Note: e 3 (^) is called 

the “generator of rotations about the z-axis,” because it points from c f toward neigh¬ 
boring rotations, which differ from $ by a rotation about the z-axis. 

(g) Show that e 3 = (d/d\p) e< p. 

(h) Derive the following formulas, valid for t < 1: 

— t s in ^ cot 9)& x (9 + t cos^)^(<> + rsin^/sintf); 
& y (t)6L x ty)€lJ(Q)& g ($) = $ z (\p + t cos\p cot9)<% x (9 + tsimp)$ x (<p — t cos^/sintf). 

(i) Define e x (&) and e 2 (&) to be the tangent vectors (at t = 0) to the curves 6(t) — $t x (t)?P 
and 6(t) — 6l y (t)$, respectively. Show that 

, a • , l a i a \ 

1 Y d8 y \ dip sin 8 d<f>/ 

e 0 = sin ip 4- COS ^ ( cot ^ ^ - r~r “-• ) • 

2 Y 38 y \ dxp sin (9 3$/ 

e 1 and e 2 are the “generators of rotations about the x- and j-axes.” 


Exercise 9.14. ROTATION GROUP: STRUCTURE CONSTANTS 

Use the three vector fields constructed in the last exercise, 


— r* AC « r t 

_ci t*i </, 1 

^ co t 0 i - 

1 

A) 

6^ LU j y 

30 un *\ 

l cot % 

sin 8 

d<j>) 

jr%, — ci n 1 1f 

^ i nn*e . ft 1 

(cot 0 — 

1 

J_\ 

c?2 Mil y 

3 0 + C0S,H 

l COt % 

sin 6 

d<f>) 


0 

* 3= a*’ 


(9.27) 


as basis vectors for the manifold of the rotation group. The above equations express this 
“basis of generators” in terms of the Euler-angle basis. Show that the commutation coefficients 
for this basis are 


c 7 — _ f 

L afi *-a(2yy 


(9.28) 


independently of location in the rotation group. These coefficients are also called the 
structure constants of the rotation group. 
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AFFINE GEOMETRY: 

GEODESICS, PARALLEL TRANSPORT, 

AND COVARIANT DERIVATIVE 

Galilei's Principle of Inertia is sufficient in itself to prove 
conclusively that the world is affine in character. 

HERMANN WEYL 


V 


- N \ 

This chapter is entirely Track 2. 

Chapter 9 is necessary 
preparation for it. 

It will be needed as 
preparation for 

(1) Chapters 11-13 
(differential geometry; 
Newtonian gravity), 

(2) the second half of 
Chapter 14 (calculation 
of curvature), and 

(3) the details, but not the 
message, of Chapter 15 
(Bianchi identities). 

___ J 


Freely falling particles and 
their clocks 


§10.1. GEODESICS AND THE EQUIVALENCE PRINCIPLE 

Free fall is the “natural state of motion,” so natural, in fact, that the path through 
spacetime of a freely falling ,, neutral test body is independent of its structure and 
composition (the “weak equivalence principle” of Einstein, Eotvos, Dicke; see 
Box 1.2 and §38.3). 

Picture spacetime as filled with free-fall trajectories. Pick an event. Pick a velocity 
there. They determine a unique trajectory. 

Be more precise. Ask for the maximum amount of information tied up in each 
trajectory. Is it merely the sequence of points along which the test body falls? No; 
there is more. Each test body can carry a clock with itself (same kind of 
clock—“good” clock in sense of Figure 1.9—regardless of structure or composition 
of test body). The clock ticks as the body moves, labeling each event on its trajectory 
with a number: the time A the body was there. Result: the free-fall trajectory is 
not just a sequence of points; it is a parametrized sequence, a “ curve 99 ^(A). 

But is the parametrization unique? Not entirely. Quite arbitrary are (1) the choice 
of time origin, ^(0); and (2) the units (centimeters, seconds, furlongs,...) in which 
clock time A is measured. Hence, A is unique only up to linear transformations 


^new ^old 


( 10 . 1 ) 
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Figure 10.1. 

A geodesic viewed as a rule for “straight-on parallel transport.” Pick an event and a tangent vector 
u = d/dX there. Construct the unique geodesic ^(A) that (1) passes through e?(0) = ^ 0 ; and (2) 
has u as its tangent vector there: (dtP /d \) x=0 = u. This geodesic can be viewed as a rule for picking 
up u from ^(0) and laying it down again at its tip, *?(1), in as straight a manner as possible, 

"x=i = (d#/d\\ =1 ; 

and for then picking it up and laying it down as straight as possible again at ^(2), 

u \-2 = 

etc. This sequence of “straight as possible,” “tail-on-tip” transports gives meaning to the idea that 
(c/^/dA) x=17 and u = (d6 /c/A) x=0 are “the same vector” at different points along the geodesic; or, 
equivalently, that one has been obtained from the other by “ straight-on parallel transport .” 


b (“new origin of clock time”) is a number independent of location on this specific 
free-fall trajectory, and a (“ratio of new units to old”) is also. 

In the curved spacetime of Einstein (and in that of Cartan-Newton, Chapter 12), 
these parametrized free-fall trajectories are the straightest of all possible curves. 
Consequently, one gives these trajectories the same name, “geodesics” that mathe¬ 
maticians use for the straight lines of a curved manifold; and like the mathematicians, 
one uses the name “affine parameter” for the parameter X along a free-fall geodesic. 
Equation (10.1) then says “the affine parameter of a geodesic is unique up to linear 
transformations.” 

The affine parameter (“clock time”) along a geodesic has nothing to do, k priori, 
with any pietric. It exists even in the absence of metric (e.g., in Cartan-Newtonian 
spacetime). It gives one a method for comparing the separation between events on 
a geodesic ($ and tf are “twice as far apart” as 61 and 2 if [A$ — X a ] = 2[X (jl — X^]). 
But the affine parameter measures relative separations only along its own geodesic. 
It has no means of reaching off the geodesic. 

The above features of geodesics, and others, are summarized in Figure 10.1 and 
Box 10.1. 


Geodesic defined as a 
free-fall trajectory 


Affine parameter defined 
clock time along free-fall 
trajectory 


§10.2. PARALLEL TRANSPORT AND COVARIANT 
DERIVATIVE: PICTORIAL APPROACH 

Two test bodies, initially falling through spacetime on parallel, neighboring geodesics, 
get pushed toward each other or apart by tidal gravitational forces (spacetime 
curvature). To quantify this statement, one must quantify the concepts of “parallel” 
and “rate of acceleration away from each other.” Begin with parallelism. 
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----v 

Box 10.1 GEODESICS 

Geodesic in brief Give point, give tangent vector; get unique, affine-parametrized curve 

(“geodesic”). 

Geodesic: in context of World line of a neutral test particle (“Einstein’s geometric theory of 

gravitation physics gravity”; also “Cartan’s translation into geometric terms of Newton’s 

theory of gravity”): 

(1) “given point”: some event on this world line; 

(2) “given vector”: vector (“displacement per unit increase of parameter”) 
tangent to world line at instant defined by that event; 

(3) “unique curve”: every neutral test particle with a specified initial 
position and a specified initial velocity follows the same world line, 
regardless of its composition and regardless of its mass (small; test 
mass!; “weak equivalence principle of Einstein-Eotvos-Dicke”); 

(4) “affine parameter”: in Cartan-Newton theory, Newton’s “universal 
time” (which is measured by “good” clocks); in the real physical 
world, “proper time” (as measured by a “good” clock) along a timelike 
geodesic; 

(5) “parametrized curve”: (a) affine parameter unique up to a transfor¬ 
mation of the form X —>- aX + b y where a and b are constants (no 
arbitrariness along a given geodesic other than zero of parameter and 
unit of parameter); or equivalently (b) given any three events d, 

6 on the geodesic, one can find by well-determined physical con¬ 
struction (“clocking”) a unique fourth event fy on the geodesic such 
that (Xq — X e ) is equal to (A ( ^ — X a )\ or equivalently (c) [differential 
version] given a tangent vector with components ( dx a /dX)j at 
point d, one can find by physical construction (again “clocking”) 

“the same tangent vector” at point G with uniquely determined 
components (dx a /dX) e (vector “equal”; components ordinarily not 
equal because of twisting and turning of arbitrary base vectors be¬ 
tween d and G). 

L J 


Comparison of vectors at 
different events by parallel 
transport 


Consider two neighboring events S and SB connected by a curve 9(X). A vector 
v a lies in the tangent space at S, and a vector v# lies in the tangent space at S3. 
How can one say whether v a and v (fl are parallel, and how can one compare their 
lengths? The equivalence principle gives an answer: an observer travels (using rocket 
power as necessary) through spacetime along the world line S?(A). He carries the 
vector v a with himself as he moves, and he uses flat-space Newtonian or Minkows- 
kian standards to keep it always unchanging (flat-space physics is valid locally 
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according to the equivalence principle!). On reaching event # the observer compares 
his “parallel-transported vector’' v a with the vector If they are identical, then 
the original vector v a was (by definition) parallel to v t1] , and they had the same length. 
(No metric means no way to quantify length; nevertheless, parallel transport gives 
a way to compare length!) 

The equivalence principle entered this discussion in a perhaps unfamiliar way, 
applied to an observer who may be accelerated, rather than to one who is freely 
falling. But one cannot evade a basic principle by merely confronting it with an 
intricate application. (Ingenious perpetual-motion machines are as impossible as 
simpleminded ones!) The equivalence principle states that no local measurement 
that is insensitive to gravitational tidal forces can detect any difference whatsoever 
between flat and curved spacetime. The spaceship navigator has an inertial guidance 
system (accelerometers, gyroscopes, computers) capable of preserving an inertial 
reference frame in flat spacetime; and in flat spacetime it can compute the attitude 
and velocity of any object in the spaceship relative to a given inertial frame. The 
purchaser may specify whether he wants a guidance computer programmed with 
the laws of zero-gravity Newtonian mechanics, or with those of special-relativity 
physics. Use this same guidance system—including the same computer program—in 
curved spacetime. A vector is being parallel transported if the guidance system’s 
computer says it is not changing. 

Will the result of transport in this way be independent of the curve used to link 
d and <@? Clearly yes, in gravity-free spacetime, since this is a principal performance 
criterion that the purchaser of an inertial guidance system can demand of the 
manufacturer. But in a curved spacetime, the answer is “NO!” If v a agrees with 
v after parallel transport along one curve, it need not agree with v$ after parallel 
transport along another. Spacetime curvature produces discrepancies. But one is not 
ready to study and quantify those discrepancies (Chapter 11), until one has developed 
the mathematical formalism of parallel transport, which, in turn, cannot be done 
until one has made precise the “flat-space standards for keeping the vector v^ always 
unchanging” as it is transported along a curve. 

The flat-space standards are made precise in Box 10.2. They lead to (1) a “Schild’s 
ladder” construction for performing parallel transport; (2) the concept “covariant 
derivative,” V u v , of a vector field v along a curve with tangent u ; (3) the “equation 
of motion” V u i/ = 0 for a geodesic, which states that “a geodesic parallel transports 
its own tangent vector along itself;” and (4) a link between the tangent spaces at 
adjacent events (Figure 10.2). 


Parallel transport defined 
using inertial guidance 
systems and equivalence 
principle 


Result of parallel transport 
depends on route 


Schild's ladder for performing 
parallel transport; its 
consequences 


§10.3. PARALLEL TRANSPORT AND COVARIANT 
DERIVATIVE: ABSTRACT APPROACH 


From the “Schild’s ladder” construction of Box 10.2, one learns the following 
properties of spacetime’s covariant derivative: 


(continued on page 252 ) 
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Box 10.2 FROM GEODESICS TO PARALLEL TRANSPORT TO COVARIANT 
DIFFERENTIATION TO GEODESICS TO . . . 


"Parallel transport" as 
defined by geodesics 



/ 

/ 

/ 

/ 

/ 

/ 

/ 



A. Transport any sufficiently short stretch of a 
curve dX (i.e., any tangent vector) parallel to 
itself along curve d*B to point $ as follows: 


1. Take some point 911 along d$ close to d. 
Take geodesic X9JI through % and 9IL. Take 
any affine parametrization X of f X9JI and 
define a unique point 91 by the condition 
A# = Ai*) (“equal stretches of time 

in XX and XXl”). 



2. Take geodesic that starts at d and passes 
through X, and extend it by an equal pa¬ 
rameter increment to point 


3. Curve Xlty gives vector d% as propagated 
parallel to itself from d to * (for suffi¬ 
ciently short d c X and dXl). This construc¬ 
tion certainly yields parallel transport in flat 
spacetime (Newtonian or Einsteinian). 
Moreover, it is local (vectors dX, dXl, etc., 
very short). Therefore, it must work even in 
curved spacetime. (It embodies the equival¬ 
ence principle.) 
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Repeat process over and over, and eventu¬ 
ally end up with d% propagated parallel to 
itself from d to fS. Call this construction 
“Schild’s Ladder,” from Schild’s (1970) sim¬ 
ilar construction. [See also Ehlers, Pirani, 
and Schild (1972).] Note that curve d$ need 
not be a geodesic. There is no requirement 
that be the straight-on continuation of 
d L M similar to the geodesic requirement in 
the “cross-brace” that be the straight-on 
continuation of d%. 


"Schild’s Ladder" 



5. Result of propagating d% parallel to itself 
from d to 3 depends on choice of world line 
d $ (“evidence of curvature of spacetime”). 

B. Ask how rapidly a vector field v is changing 
along a curve with tangent vector u = d/dX . 
The answer, dv/dX = V u v = “rate of change 
of v with respect to X” = “covariant derivative 
of v along i/,” is constructed by the following 
obvious procedure: (1) Take v at X = X 0 + e. 
(2) Parallel transport it back to X = X 0 . (3) 
Calculate how much it differs from v there. (4) 
Divide by e (and take limit as e —>- 0): 


"Covariant differentiation" as 
defined by parallel transport 




Lim f [^^0 ^parallel transported to \ 0 1 

f —> 0 1 £ J 


If u = d/dX is short compared to scale of inho¬ 
mogeneities in the vector field v , then V u v can 
be read directly off drawing I, or, equally well, 
off drawing II. 
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L Consequence of this (as seen by following 
through definition of covariant derivative, 
and by noting that any vector u can be 
regarded as the tangent vector to a freely 
falling world line): 

Y u (v + w) - V u v + Y u \v 

for any vector u and vector fields v and w. 

2. Consequence of this, combined with sym¬ 
metry of covariant derivative, and with 
additivity of the “closer of quadrilaterals” 

[«,*)- 


V v = V V + V v 

Y u+n v 

(See exercise 10.1.) This can be inferred, 
alternatively, from the equivalence princi¬ 
ple: in a local inertial frame, as in special 
relativity or Newtonian theory, the change 
in v along u + n should equal the sum of 
the changes along u and along n. 

3. Consequence of above: choose n to be a 
multiple of u\ thereby conclude 

V v — aV v. 

au u 


F. The “Schild’s ladder” construction process for 
parallel transport (beginning of this box), ap¬ 
plied to the tangent vector of a geodesic (exer¬ 
cise 10.6) guarantees: a geodesic parallel trans¬ 
ports its own tangent vector along itself. 
Translated into covariant-derivative language: 

( u = d/d\ is a tangent 
vector to a curve, and 
V u u=0 

Thus closes the circle: geodesic to parallel 
transport to covariant derivative to geodesic. 


( the curve is\ 
a geodesic / 


Geodesics as defined by 
parallel transport or covariant 
differentiation 
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Covariant derivative: basic 
properties 


Symmetry: 

V u v — ^v u = \ u , v] f° r any vector fields u and v; 

(10.2a) 

Chain rule: 

V u (fv) = fV u v + v d u f for any function/ 
vector field v , and vector u\ 

(10.2b) 

Additivity: 

V u (v + W) = V u v + V u w for any vector 
fields v and iv, and vector u; 

(10.2c) 


V au +bn v = aV u v + bV n v for any vector 

field v , vectors or vector fields u and /?, 
and numbers or functions a and b. 

(10.2d) 



Figure 10.2. 

The link between the tangent spaces at neighboring points, made possible by a parallel-transport law. 
Choose basis vectors e l and e 2 at the event d. Parallel transport them to a neighboring event (Schild’s 
ladder for transport of e 1 is shown in the figure.) Then any other vector v that is parallel transported 
from d to f# will have the same components at the two events (parallel transport cannot break the legs 
of a triangle; see Box 10.2): 


v = v 1 e l + v 2 e 2 at d 


Hsame numerically as at d] 

rn 

v = v l e, + v-e 9 at $. 

t t t 


'—{parallel transported from d to $] 


Thus, parallel transport provides a unique and complete link between the tangent space at d and the 
tangent space at $. It identifies a unique vector at $ with each vector at d in a way that preserves 
all algebraic relations. Similarly (see §10.3), it identifies a unique 1-form at $ with each 1-form at d, 
and a unique tensor at $ with each tensor at d , preserving all algebraic relations such as <<r, if) = 19.9 
and S(<r, if, w) = 37 1. 

Actually, all this is true only in the limit when d and $ are arbitrarily close to each other. When 
d and $ are close but not arbitrarily close, the result of parallel transport is slightly different for different 
paths; so the link between the tangent spaces is slightly nonunique. But the differences decrease by a 
factor of 4 each time the affine-parameter distance between d and $ is cut in half; see Chapter 11. 
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Any “rule” V, for producing new vector fields from old, that satisfies these four 
conditions, is called by differential geometers a “symmetric covariant derivative 
Such a rule is not inherent in the more primitive concepts (Chapter 9) of curves, 
vectors, tensors, etc. In the arena of a spacetime laboratory, there are as many ways 
of defining a covariant derivative rule V as there are of rearranging sources of the 
gravitational field. Different free-fall trajectories (geodesics) result from different 
distributions of masses. 

Given the geodesics of spacetime, or of any other manifold, one can construct 
a unique corresponding covariant derivative by the Schild’s ladder procedure of Box 
10.2. Given any covariant derivative, one can discuss parallel transport via the 
equation 


dv/dX = V u v = 0 the vector field v is parallel transported (10.3) 
along the vector u = d/dX; 

and one can test whether any curve is a geodesic via 

Y u u = 0 the curve <?(X) with tangent vector u = d/dX 

parallel transports its own tangent vector u (10.4) 
-<=►■ i?( X ) is a geodesic. 

Thus a knowledge of all geodesics is completely equivalent to a knowledge of the 
covariant derivative. 

The covariant derivative V generalizes to curved spacetime the flat-space gradient 
V. Like its flat-space cousin, it can be viewed as a machine for producing a number 
(a , V u v ) out of a 1-form a, a vector </, and a vector field v . This machine viewpoint 
is explored in Box 10,3. Note there an important fact: despite its machine nature, 
V is not a tensor; it is a nontensorial geometric object. 

In curved as in flat spacetime, V can be applied not only to vector fields, but 
also to functions, 1-form fields, and tensor fields. Its action on functions is defined 
in the obvious manner: 


▼/=<//; ▼„/= a„/= u[f] = (df «/>. (10.5) 

Its action on 1-form fields and tensor fields is defined by the curved-space generali¬ 
zation of equation (3.39): VS is a linear machine for calculating the change in output 
of S, from point to point, when “constant 11 (i.e., parallel transported) vectors are 
inserted into its slots. Example: the gradient of a (^) tensor, i.e., of a 1-form field 
a. Pick an event pick two vectors u and v in the tangent space at ^ 0 ; construct 
from v a “constant” vector field v (^) by parallel transport along the direction of 
i/, V u v — 0. Then Va is a (£) tensor, and Y u a is a (?) tensor defined at 9 0 by 

▼ o(v, u) = < V u a,v) = y u «cr, v» = {a, v >, (10.6) 

where u — d/dX. This defines Va and because it states their output for any 

{continued on page 257) 


Equation for parallel transport 


Knowledge of all geodesics is 
equivalent to knowledge of 
covariant derivative 


Covariant derivative 
generalizes flat-space gradient 


Action of covariant derivative 
on functions, 1-forms, and 
tensors 
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Box 10.3 COVARIANT DERIVATIVE VIEWED AS A MACHINE; 

CONNECTION COEFFICIENTS AS ITS COMPONENTS 

A. The Machine View 

1* The covariant derivative operator Y, like most other geometric objects, can 
be regarded as a machine with slots. There is one such machine at each event 
in spacetime. In brief, the machine interpretation of Y at eP Q says 

▼( ? < v(f) , u) = <a. Y^). 


a vector u that “the covariant de¬ 
resides in the rivative of the vec- 

tangent space at fP 0( tor field v with 

Respect to u ,”_ 

[Note: this slot notation for Y serves no useful purpose except to emphasize the 
“machine”-nature of V. This box is the only place it will be used.] 

2. Geometrically, the output of the machine, (a, Y», is obtained as follows: 
(a) Calculate the rate of change of v, y u v, along the vector u, when u and v 
are infinitesimally small, the calculation can be represented pictorially: 



V u v. a vector that resides 
in the tangent space at 


(b) Count how many surfaces of the 1-form a are pierced by the vector 
(piercing occurs in tangent space at if 0 ) 




First slot; insert 
a 1-form a that 
resides in the tan¬ 
gent space at S? 0 



Second slot; insert 
a vector field v{ c f) 
defined on a neigh¬ 
borhood of 
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r 

This number is the output of the machine V, when a, v{9) and u are inserted into 
its slots. 

3. Another, equivalent, statement of covariant derivative as a machine. Leave first 
slot empty (no mention of any 1-form a); get a new vector field from original vector 
field v: 

▼( , v(n 

empty 

= “covariant derivative of vector field v along vector t/.” 

4. A third machine operation. Leave first and third slots empty (no mention of 
any 1-form o\ no mention of any vector u along which to differentiate); get a ({) 
tensor field from original vector field v: 

▼( , y(f) 9< J = Vv 

empty empty 

= “covariant derivative” or “gradient” of vector field v . 

This tensor field, Vv, is the curved-space generalization of the flat-space Vv studied 
in §3.5. It has two slots (the two left empty in its definition). Its output for given 
input is 

= ▼«y 

empty 

▼if(a,i#) = (a, V„ v >. 

5. Summary of the quantities defined above: 

(a) ▼ is a covariant derivative operator; to get a number from it, insert a, v{ f?), 
and u\ the result is <a, ▼/). 

(b) yv is the gradient of v\ to get a number from it, insert a and u\ the result 
is <a, V u v) [same as in (a)]. 

(c) Y u v is the covariant derivative of v along u ; to get a number from it, insert 
a; the result is <a, V u v) [same as in (a) and (b)]. 


B. How V Differs from a Tensor 


The machine ▼ differs from a tensor in two ways. (1) The middle slot of ▼ will 
not accept a vector; it demands a vector field—the vector field that is to be differ¬ 
entiated. (2) ▼ is not a linear machine (whereas a tensor must be linear!): 
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Box 10.3 ( continued ) 


V bu) = (ao, V 6u » 

= abf(a, V u v) + ab(o. v) ▼„/ 


this would be absent if V were a 
linear machine. 


C. The "Connection Coefficients" as Components of V 

Given a tensor S of rank (£), a basis of tangent vectors {e a } at the event f? 0 where 
S resides, and the dual basis of 1-forms {o/ a }, one defines the components of S 
by 

S^ y = S(cv a ,e e ,e y ). 

One defines the components of V similarly, except that for V one needs not only 
a basis {e a } at the event but also a basis (e a (^)} at each event in its neighbor¬ 
hood: 

r a p y = components of ▼ = V(o/ a , e y ) 

=«v^> 

( “a-component of change in basis vector e^, when\ 
in evaluating one moves from tail to tip of e y / 

These components of V are called the “connection coefficients ” of the basis {e a }. 
They are the “coordinate representation” of the covariant derivative operator ▼. 

The covariant derivative operator ▼ and the connection coefficients r a ^ v provide 
different mathematical representations of the same geometric animal? Preposterous! 
The one animal runs from place to place and barks, or at least bites (takes difference, 
for example, between vector fields at one place and at a nearby place). The other 
animal, endowed with forty faces (see exercise 10.9) sits quietly at one spot. It would 
be difficult for two animals to look more different. Yet they do the same jobs in any 
world compatible with the equivalence principle: (1) they summarize the properties 
of all geodesics that go through the point in question; and, so doing, (2) they provide 
a physical means (parallel transport) to compare the values of vector fields and tensor 
fields at two neighboring events. 
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given input vectors v and u. If v(f) is not constrained to be “constant” along 
u = d/d\, then (d/dX) <a, v) has contributions from both the change in v and the 
change in a: 


< a , v> = V„<a, v> = <V u a, v> + <a, V„v> (10.7) 

(see exercise 103). 

Similarly, if S is a © tensor field, then its gradient VS is a © tensor field defined 
as follows. Pick an event pick three vectors u, v, w, and a 1-form a in the tangent 
space at 9^ turn v, w, and a into “constant” vector fields and a “constant” 1-form 
field near 9 0 by means of parallel transport (V„v = V u w = V„a = 0 at S? 0 ); then 
define 


VS(a, v, w, u) = (V u S)(a, v, w) = ▼„[S(a, v, w)] 
= 3«,[5(a, v, iv)]. 


( 10 . 8 ) 


Exercise 10.1. ADDITIVITY OF COVARIANT DIFFERENTIATION 

Show that the commutator (“closer of quadrilaterals”) is additive: 

[u, V + w] - [u, V] + [u, w\, [u + /», v] = [u, v] + [/», v}. 

Use this result, the additivity condition V u (v + w) = V u v + V u w, and symmetry of the 
covariant derivative, V u v — V v u = [u, v], to prove that 

= Vu* + Vn* 

Exercise 10.2. CHAIN RULE FOR COVARIANT DIFFERENTIATION 

Use pictures, and the “take-the-difference-and-take-the-limit” definition of V u v (Box 10.2) 
to show that 


+ v 3«[/]. (10.9) 

Exercise 10.3. ANOTHER CHAIN RULE 

Derive equation (10.7), using the “take-the-difference-and-take-the-limit” definitions of 
derivatives. Hint: Before taking the differences, parallel transport a[^(X)] and vfsP(X)] back 
from £?(X) to ‘f(O). 

Exercise 10.4. STILL ANOTHER CHAIN RULE 

Show that, as in flat spacetime, so also in curved spacetime, 

▼„(v <S> w) = (V u v) ® w + v ® (V u iv). (10.10) 

Write down the more familiar component version of this equation in flat spacetime. 

Solution to first part of exercise: Choose 1-forms a and p at the event c f 0 in question, 
and extend them along the vector u = d/d\ by parallel transport, V u p = V u a - 0. Then 


EXERCISES 
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[▼„(* ® w)](P- O) = -S-[(v ® w)(p,a )] 


= d\^ P ' V ^^°' w ^ 

d(p , v) J(a, w) 

- - <a, iv> + (p, v> - 


<A 


<A 


(def of V w on a tensor) 
(def of tensor product “®”) 
(chain rule for derivatives) 


= <P< w> + <P, v)(cr, V u iv> 

(by equation 10.7 with p, a const) 
= [(▼„*) ® w){p . a) + [v ® (▼ l# iv)](p, a) 

(def of tensor product “®”). 


Exercise 10.5. ONE MORE CHAIN RULE 

Show, using techniques similar to those in exercise 10.4, that 


V„(a ® p ® v) = (V u a) ® p ® v + a ® (V u p) ® v + a ® p <S> (\v). (10.11) 


Exercise 10.6. GEODESIC EQUATION 

Use the “Schild’s ladder” construction process for parallel transport (beginning of Box 10.2) 
to show that a geodesic parallel transports its own tangent vector along itself (end of Box 
10 . 2 ). 


§10.4. PARALLEL TRANSPORT AND COVARIANT 
DERIVATIVE: COMPONENT APPROACH 

The pictorial approach motivates the mathematics; the abstract approach makes the 
pictorial ideas precise; but usually one must use the component approach in order 
to actually do complex calculations. 

To work with components, one needs a set of basis vectors { e a } and the dual 
set of basis 1-forms {o/ a }. In flat spacetime a single such basis suffices; all events 
can use the same Lorentz basis. Not so in curved spacetime! There each event has 
its own tangent space, and each tangent space requires a basis of its own. As one 
travels from event to event, comparing their bases via parallel transport, one sees 
the bases twist and turn. They must do so. In no other way can they accommodate 
themselves to the curvature of spacetime. Bases at points *P 0 and which are the 
same when compared by parallel transport along one curve, must differ when 
compared along another curve (see “Curvature”; Chapter 11). 

To quantify the twisting and turning of a “field” of basis vectors {ej*?)} and 
forms {o/“(^)}, use the covariant derivative. Examine the changes in vector fields 
along a basis vector e^, abbreviating 

▼e„ = ^ (def of V^); (10.12) 

and especially examine the rate of change of some basis vector: V^e^. This rate 
of change is itself a vector, so it can be expanded in terms of the basis: 
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v p e a = (def of r\ p ); 

note reversal of order of a and /?! 


(10.13) 



Connection coefficients 
defined 


and the resultant “connection coefficients” T ^ aj8 can be calculated by projection on 
the basis 1-forms: 


V (10.14) 

(See exercise 10.7; also Box 10.3.) Because the basis 1-forms are “locked into” the 
basis vectors {{cu v , e a ) = S" a ), these same connection coefficients r* a £ tell how the 
1-form basis changes from point to point: 

(10.15) 

(10.16) 

(See exercise 10.8.) 

The connection coefficients do even more. They allow one to calculate the compo¬ 
nents of the gradient of an arbitrary tensor S. In a Lorentz frame of flat spacetime, 
the components of VS are obtained by letting the basis vectors e a = d^P/dx a = 
d/dx a act on the components of S. Thus for a Q) tensor field S one finds that 

VS has components S a p y B = -~-j[S a 0 y ]> 


= -r\ $ . 


Not so in curved spacetime, or even in a non-Lorentz basis in flat spacetime. There 
the basis vectors turn, twist, expand, and contract, so even if S were constant 
(VS = 0), its components on the twisting basis vectors would vary. The connection 
coefficients, properly applied, will compensate for this twisting and turning. As one 

learns in exercise 10.10, the components of VS, called S a p y;6 so that Components of the gradient 

of a tensor field 

VS = S a py . s e a a/ 5 , (10.17) 

can be calculated from those of S by the usual flat-space method, plus a correction 
applied to each index (i.e., to each basis vector): 


[“ + ” when correcting “up” indices] interchange and sum ] 

_on index being correctedj 

-[differentiating index] 


s %r.» = + S ^y r \'s T - SW (10.18) 


[“ —” when correcting “down” indices 

interchange and sum 
on index being corrected 


vs i ^ vy* 08 

^ vt 


differentiating 

index 


Here 




(10.19) 
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Components of the covariant 
derivative of a tensor field 


Chain rule for gradient 


EXERCISES 


Equation (10.18) looks complicated; but it is really very simple, once the pattern 
has been grasped. 

Just as one uses special notation, S a p y;S , for the components of VS, so one 
introduces special notation, DS a p y /dX , for components of the covariant derivative 
V U S along u — d/dX: 


DS a 


dX 


V U S = (DS a p y /dX)e a ® LU$ ® a/ y ; 

= S a p y . s u s — (S a p y 8 4- correction terms)w 5 . 


( 10 . 20 ) 


Since for any / 


= 9 „/= df/dk 


this reduces to 
DS a 


dS a 


dX 


-= + y - * W“ s - ( io - 21 ) 


The power of the component approach shows up clearly when one discusses chain 
rules for covariant derivatives. The multitude of abstract-approach chain rules 
(equations 10.2b, 10.7, 10.10, 10.11) all boil down into a single rule for components: 
The gradient operation obeys the standard partial-differentiation chain rule of 
ordinary calculus. Example: 

</"")» +/»’:, 00.22a) 

1-1 = / because / has no indices to correct] 


(contract this with to get chain rule 10.2b). Another example: 
K y % = V a + ° a v a ;il 

t 


(10.22b) 


-[ = (a a v a )^ because o a v a has no free indices to correct] 

(contract this with u* to get chain rule 10.7). Another example: 

K PfiV' 1 )-,* = ° a; »Pp vy + ° a pp-^ y + ° a PftV y ;ll (10.22c) 


(contract this with to get chain rule 10.11). Another example: see Exercise (10.12) 
below. 


Exercise 10.7. COMPUTATION OF CONNECTION COEFFICIENTS 

Derive equation (10.14) for from equation (10.13). 

Exercise 10.8. CONNECTION FOR 1-FORM BASIS 

Derive equations (10.15) and (10.16), which relate V (i uj v to T\ fiy from equation (10.14). 
Hint: use equation (10.7). 
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Exercise 10.9. SYMMETRY OF CONNECTION COEFFICIENTS 

Show that the symmetry of spacetime’s covariant derivative (equation 10.2a) is equivalent 
to the following symmetry condition on the connection coefficients: 

(antisymmetric part of 7^) = 1 (/> a/s - 

= r, Wj = -J <•»*, [•«. «/?) > =-J V- (10.23) 

[commutator of basis vectors]— 

As a special case, r M a/8 is symmetric in a and ft when a coordinate basis (e a = d/dx a ) is used . 
Show that in a coordinate basis this symmetry reduces the number of independent connection 
coefficients at each event from 4 X 4 X 4 = 64 to 4 X 10 = 40. 

Exercise 10.10. COMPONENTS OF GRADIENT 

Derive equation (10.18) for the components of the gradient, S a p y;i . Hint: Expand S in terms 
of the given basis, and then evaluate the righthand side of 

V U S = V u (S% y e a aft), 

for an arbitrary vector i/. Use the chain rules (10.2b) and (10.11). By comparing the result 
with 


V u S = S a p y;B ii s e a ® ® w y , 

read off the components S a p y 8 . 

Exercise 10.11. DIVERGENCE 

Let T be a Q tensor field, and define the divergence on its second slot by the same process 
as in flat spacetime: V • T = contraction of V7"; i.e., 

(V • Tf = (10.24) 

Write the components T a $. B in terms of T a & ^ plus correction terms for each of the two 
indices of T. 

[Answer: 

t**# = T'fij + r a ^T^ + r\ e T^.\ 

Exercise 10.12. VERIFICATION OF CHAIN RULE 

Let S a & y be components of a (f) tensor field, and M p y be components of a (J) tensor field. 
By contracting these tensor fields, one obtains a vector field S a ^ y M /3 y . The chain rule for 
the divergence of this vector field reads 

(S“\A//). a = S a P y . a Mp y + S«e y M B ' :a . 

Verify the validity of this chain rule by expressing both sides of the equation in terms of 
directional derivatives (< e ) plus connection-coefficient corrections. Hint: the left side becomes 

(s*' T V)* = (S a<s y*V>.« + r^srtyMf). 


s a \ a M fi y + s“VV.« 

by chain rule for directional derivative 



Geodesic equation: 
version 
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The right side has many more correction terms (three on S aP y . a ; two on M p y . a ), but they 
must cancel against each other, leaving only one. 

Exercise 10.13. TRANS FORMATION LAW FOR CONNECTION COEFFICIENTS 

Let {e a } and {e^.} be two different fields of basis vectors related by the transformation law 

e^*) = L'f&yeJ?). ( 10 . 25 ) 

Show that the corresponding connection coefficients are related by 

r *rr = ( 10 . 26 ) 

standard transformation law 
for components of a tensor 

Exercise 10.14. POLAR COORDINATES IN FLAT 2-DIMENSIONAL SPACE 

On a sheet of paper draw an (r, $) polar coordinate system. At neighboring points, draw 
the basis vectors e- = d/dr and e^ = r -1 3/3<£. (a) Use this picture, and Euclid’s version 
of parallel transport, to justify the relations 

= 0, T f . A = 0, 

(b) From these relations write down the connection coefficients, (c) Let A = A r e f + A*e^ 
be a vector field. Show that its divergence, V * A = A a .* = A*^ + can be calculated 

using the formula 

y . A = 1 dA* l 1 d(rA') 
r d<p r dr 

(which should be familiar to most readers). 


§10.5. GEODESIC EQUATION 

Geodesics—the parametrized paths of freely falling particles—were the starting point 
of this chapter. From them parallel transport was constructed (Schild’s ladder; Box 
10.2); and parallel transport in turn produced the covariant derivative and its 
abstract connection coefficients. Given the covariant derivative, one recovered the geodesics: 

they were the curves whose tangent vectors, u = dftP/dX, satisfy V u u = 0 (u is 
parallel transported along itself). 

Let a coordinate system {jc a (&)} be given. Let it induce basis vectors e a = d/dx a 
into the tangent space at each event. Let the connection coefficients r a ^ y for this 
“coordinate basis” be given. Then the component version of the “geodesic equation” 
V u u — 0 becomes a differential equation for the geodesic x a (X): 

(1) u = -lL = =►- components of u are u a - 

d\ dX dx a r dX 


§10.5. GEODESIC EQUATION 


263 


(2) then components of ▼„(/ = 0 are 

o = u a . pU p = (u a >$ + r a yP u y )u^ 

_ 9 / dx^ \dx^_ ra 
dxP\ dk) dk 

which reduces to the differential equation 


dx y dx& 
yP d\ dX ’ 


d 2 x a 

dX 2 


+ r a 


dx y dxl 
yP dX dX 


= 0. 


(10.27) 


This component version of the geodesic equation gives an analytic method 
(“translation” of Schild’s ladder) for constructing the parallel transport law from 
a knowledge of the geodesics. Pick an event and set up a coordinate system in 
its neighborhood. Watch many clock-carrying particles pass through (or arbitrarily 
close to) & 0 . For each particle read off the values of d 2 x a /dX 2 and dx a /dX at 9 0 . 
Insert all the data for many particles into equation (10.27), and solve for the connec¬ 
tion coefficients. Do not be disturbed that only the symmetric part of T a yp is obtained 
thereby; the antisymmetric part, r a lyp] , vanishes identically in any coordinate frame! 
(See exercise 10.9.) Knowing r a yf} , use them to parallel transport any desired vector 
along any desired curve through fP 0 : 


y u v = o 


dv a 

dX 


+ f a y pv y 


dxf_ 

dX 


= 0. 


(10.28) 


Exercise 10.15. COMPONENTS OF PARALLEL-TRANSPORT LAW 

Show that equation (10.28) is the component version of the law for parallel transporting 
a vector v along the curve S?(X) with tangent vector u = d^P/dX. 

Exercise 10.16. GEODESICS IN POLAR COORDINATES 

In rectangular coordinates on a flat sheet of paper, Euclid’s straight lines (geodesics) satisfy 
d 2 x/d\ 2 = d 2 y/d \ 2 = 0. Transform this geodesic equation into polar coordinates (x - 
r cos <j>, y = r sin < p); and read off the resulting connection coefficients by comparison 
with equation (10.27). These are the connection coefficients for the coordinate basis (d/dr, 
d/d<(>). From them calculate the connection coefficients for the basis 

e - - A- e . _ ±JL 

T ~dr' r d$- 

The answer should agree with the answer to part (b) of Exercise 10.14. Hint: Use such 
relations as 


Component version 

How to construct parallel 
transport law from 
knowledge of geodesics 


EXERCISES 


^e^ e r — ^U/rXa/a0)(°/d r ) — “ ^(a/90)(^/^ r )- 
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Exercise 10.17. ROTATION GROUP: GEODESICS AND CONNECTION 
COEFFICIENTS 

[Continuation of exercises 9.13 and 9.14.] In discussing the rotation group, one must make 
a clear distinction between the Euclidean space (coordinates x,y, z; basis vectors d/dx, d/dy, 
d/dz) in which the rotation matrices act. and the group manifold 50(3) (coordinates \j/, 0, 
(p; coordinate basis d/d\p, d/dd , d/d<p; basis of “generators” e v e 2 , e 3 ), whose points /P are 
rotation matrices. 

(a) Pick a vector 

n = n x d/dx -f n y d/dv + n z d/dz 
in Euclidean space. Show that 

$„(t) = exp + n»th\ + n*X z )t) (10.29) 

is a rotation matrix that rotates the axes of Euclidean space by an angle 

t\n\ = t[(n*) 2 + (n^) 2 + («*) 2 ] 1/2 

about the direction n. are matrices defined in exercise 9.13.) 

(b) In the group manifold 50(3), pick a point (rotation matrix) & 9 and pick a tangent 
vector u = u a e a at Let u be a vector in Euclidean space with the same components as 
u has in 50(3): 


u = u l e 1 + u 2 e 2 + « 3 e 3 ; u — u 1 d/dx -f u 2 d/dy + u 3 d/dz . (10.30) 

Show that u is the tangent vector (at t = 0) to the curve 

<?(t) = ® u (t)<?. (10.31) 

The curve 6(t ) through the arbitrary point </? with arbitrary tangent vector u = (d(?/dt) t = 0 
is a very special curve: every point on it differs from 9 by a rotation & u (t) about one and 
the same direction u. No other curve in 50(3) with “starting conditions” i/} has such 
beautiful simplicity. Hence it is natural to decree that each such G(t) is a geodesic of the 
group manifold 50(3). This decree adds new geometric structure to 50(3); it converts 50(3) 
from a differentiable manifold into something more special: an affine manifold . 

One has no guarantee that an arbitrarily chosen family of curves in an arbitrary manifold 
can be decreed to be geodesics. Most families of curves simply do not possess the right 
geometric properties to function as geodesics. Most will lead to covariant derivatives that 
violate one or more of the fundamental conditions (10.2). To learn whether a given choice 
of geodesics is possible, one can try to derive connection coefficients (for some given 
basis) corresponding to the chosen geodesics. If the derivation is successful, the choice of 
geodesics was a possible one. If the derivation produces inconsistencies, the chosen family 
of curves have the wrong geometric properties to function as geodesics. 

(c) For the basis of generators {e a } derive connection coefficients corresponding to the 
chosen geodesics, G(t) = tH u (i)^P, of 50(3). Hint: show that the components u a = < cu a ,u ) 
of the tangent u = d€jdt to a given geodesic are independent of position G(t) along the 
geodesic. Then use the geodesic equation V u u = 0, expanded in the basis {e a }, to calculate 
the symmetric part of the connection r a ipy) . Finally use equation (10.23) to calculate /'°' [/3y] . 
[Answer: 

~ J £ ajSy> (10.32) 

where c a/Jy is the completely antisymmetric symbol with e 123 = +1. This answer is inde¬ 
pendent of location 9 in 50(3)!] 




§11.1. CURVATURE, AT LAST! 

Spacetime curvature manifests itself as gravitation, by means of the deviation of 
one geodesic from a nearby geodesic (relative acceleration of test particles). 

Let the geodesics of spacetime be known. Then the covariant derivative V and 
its connection coefficients / , “ j8y are also known. How, from this information, does 
one define, calculate, and understand geodesic deviation and spacetime curvature? 
The answer unfolds in this chapter, and is summarized in Box 11.1. To disclose the 
answer one must (1) define the “relative acceleration vector” V u V u n, which meas¬ 
ures the deviation of one geodesic from another (§11.2); (2) derive an expression 
in terms of V or for the “Riemann curvature tensor,” which produces the 

geodesic deviation (§11.3); (3) see Riemann curvature at work, producing changes 
in vectors that are parallel transported around closed circuits (§ 11.4); (4) see Riemann 
curvature test whether spacetime is flat (§11.5); and (5) construct a special coordinate 
system, “Riemann normal coordinates,” which is tied in a special way to the Riemann 
curvature tensor (§11.6). 

§11.2. THE RELATIVE ACCELERATION OF 
NEIGHBORING GEODESICS 

Focus attention on a family of geodesics (Figure 11.1). Let one geodesic be distin¬ 
guished from another by the value of a “selector parameter” n. The family includes 
not only geodesics n = 0, 1,2,... but also geodesics for all intervening values of 


Geometry of a family of 
geodesics: 

Selector parameter 


This chapter is entirely 
Track 2. Chapters, 9 and 10 are 
necessary preparation for it. 

It will be needed as 
preparation for 

(1) Chapters 12 and 13 
(Newtonian gravity; 
Riemannian geometry), 

(2) the second half of 
Chapter 14 (calculation 
of curvature), and 

(3) the details, but not the 
message, of Chapter 15 
(Bianchi identities). 


Overview of chapter 
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Box 11.1 GEODESIC DEVIATION AND RIEMANN CURVATURE IN BRIEF 


“Geodesic separation” n is displacement (tangent 
vector) from point on fiducial geodesic to point on 
nearby geodesic characterized by same value of 
affine parameter X. 

Geodesic separation changes with respect to X 
(i.e., changes along the tangent vector u = d/dX) 
at a rate given by the equation of geodesic deviation 

▼„ V„n *f Riemann (...,«/, n, u) = 0 (1) 

(second-order equation; see §§1.6 and 1.7; Figures 

1 . 10 , 1 . 11 , 1 . 12 ). 

In terms of components of the Riemann tensor 
the driving force (“tidal graviational force”) is 

*uPn y u & . 


in later chapters, takes the following operational- 
computational form in a given coordinate system: 

(a) Write down trial formula for dynamic evol¬ 
ution of metric coefficients g with time. 

(b) Calculate the connection coefficients from 

r\, = g a0 r^ ( 4 ) 

- 1 4. _ dg ^ \ 




dx* 


+ 


a* 1 ' dxP / 


( 5 ) 


Riemann (. 


( 2 ) 


,u,n,u) = e a R a pyS 

The components of the Riemann curvature ten¬ 
sor in a coordinate frame are given in terms of 
the connection coefficients by the formula 


(derived in Chapter 13). 

(c) Calculate Riemann curvature tensor from 
equation (3). 

(d) Calculate Einstein curvature tensor from 

= - < 6 ) 

(geometric significance in Chapter 15). 

(e) Insert into Einstein’s equations (Chapter 


Da _ 

A /3ys — 


a r a 

_ C _± _£5 


dr a 


3jc 7 

-f r a 


Hi 1 J8* 


1 a 

£I 

17): 


X 5 

ra r ji 

~ 1 n* 1 fir 

<v = o 
<v = 

(empty space), 

(when mass-energy is 

only quantifies the 


present). 


concept of “tidal gravitational force,” but also 
enters into Einstein’s law, by which “matter tells 
spacetime how to curve.” That law, to be studied 


(f) Test whether the trial formula for the dy¬ 
namic evolution of the geometry was cor¬ 
rect, and, if not, change it so it is. 


Affine parameter 


Tangent vector 


n. The typical point 9 on the typical geodesic will be a continuous, doubly differen¬ 
tiable function of the selector parameter n and the affine parameter X; thus 


The tangent vector 


or 


9 = 9(X, n). 



(Cartan notation) 



(notation of this book) 


(n.i) 


( 11 . 2 ) 


is constant along any given geodesic in this sense: the vector u at any point, trans- 
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which geodesic. Ihe affine parameter A tells where on a given 
geodesic. The two tangent vectors indicated in the diagram are u — 
d/d\ (Cartan: d&/d\) and n = d/dn (Cartan: c&/dn). 


ported parallel to itself along the geodesic, arrives at a second point coincident in 
direction and length with the u already existing at that point. 

The “separation vector” 



(Cartan notation) 


or 



(notation of this book) 


(113) 


measures the separation between the geodesic n , regarded as the fiducial geodesic, 
and the typical nearby geodesic, n + An (for small An), in the sense that 


(An)n = 




' position ' 

dn 

measures the 


An — 

dn . 

change in 

any 

t function. 


(11.4) 


brought about by transfer of attention from the one geodesic to the other at a fixed 
value of the affine parameter A. This vector is represented by the arrow in the 
first diagram in Box 11.2. 


Separation vector 


(continued on page 270) 
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Box 11.2 GEODESIC DEVIATION REPRESENTED AS AN ARROW 


“Fiducial geodesic” n . Separation vector n An — 
9IU2 leads from point 9 Z on it, to point i? with 
same value of affine parameter X (timelike quan¬ 
tity) on neighboring “test geodesic” n + An. 


Parallel transport of 91% by “Schild’s ladder con¬ 
struction” (Box 10.2) to and £<X. If the test 
geodesic n + An had kept a constant separation 
from the fiducial geodesic its tracer point would 
have arrived at d at the value (X — /IX) of the 
affine parameter, and at # at (X + AX). 


& ^n + An 


Actual location of tracer point of test geodesic at 
values of the timelike affine parameter (X — /IX), 

X, and (X + /IX). 

/ X - A\ 






Confrontation between actual course of tracer 
point on test geodesic and “canonical course”: 
course it would have had to take to keep constant 
separation from the tracer point moving along the 
fiducial geodesic. 


§11.2. RELATIVE ACCELERATION OF NEIGHBORING GEODESICS 


269 




Test geodesic same as before, except for uniform 
stretchout in scale of affine parameter. Any meas¬ 
ure of departure of the actual course of geodesic 
from the canonical course (<? 2 $), to be useful, 
should be independent of this stretchout. Hence, 
take as measure of geodesic deviation, not the 
vector alone, nor the vector ( 2 $^ but the 
stretch-independent combination d 2 = ($<%) + 
Here the sign of addition implies that the 
two vectors have been transported parallel to 
themselves, before addition, to a common point 
(2 in the diagram; L M in the differential calculus 
limit An —0, A\ —0). 


€t 




Alternative courses that the test geodesic of D 
could have taken through 2 (families of geodesics 
characterized by different degrees of divergence 
from the left or convergence towards the right). 
Tilt changes values of (!°J (to <2 C ?) and (to $S) 
individually, but not value of the sum d 2 = ($&) 
+ (<T 0 *) (“lever principle”). 


Note that arrow SBf® is of first order in AX and 
of first order in An; similarly for hence the 
combination d 2 is of second order in AX and first 
order in An. Conclude that the arrow d 2 /(AX) 2 (An) 
is the desired measure of geodesic deviation in the 
sense that: 

size of mesh (ultimately to go to zero) cancels 
out; 

parameterization of test geodesic cancels out; 

slope of test geodesic cancels out. 

Give this arrow the name “ relative-acceleration 
vector and by examining it more closely (Box 
11.3), discover the formula 

d 2 /(AKf(An) = V u n 


for it. 
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Box 11.2 illustrates what it means to speak of geodesic deviation. One transports 
the separation n An = parallel to itself along the fiducial geodesic. The tip of 
this vector traces out the canonical course that the nearby tracer point would have 
to pursue if it were to maintain constant separation from the fiducial tracer point. 
The actual course of the test geodesic deviates from this “canonical” course. The 
deviation, a vector {d9 of Box 11.2), changes with the affine parameter (dty at d, 
0 at 2, $$ at $). The first derivative of this vector with respect to the affine parameter 
is sensitive to the scale of parameterization along the test geodesic, and to its slope 
(Box 11.2, F). Not so the second derivative. It depends only on the tangent vector 
u of the fiducial geodesic, and on the separation vector n An. Divide this second 
Relative-acceleration vector derivative of the deviation by An and give it a name: the “relative-acceleration 

vector Discover (Box 11.3) a simple formula for it 

(relative-acceleration vector) = V u ▼„/». (11.5) 


§11.3. TIDAL GRAVITATIONAL FORCES AND 
RIEMANN CURVATURE TENSOR 

With “relative acceleration” now defined, turn to the “tidal gravitational force” (i.e., 
“spacetime curvature”) that produces it. Use a Newtonian analysis of tidal forces 


( ' ^ 
SECOND COVARIANT DERIVATIVE 


The arrow d 2 in Box 11.2 measures, not the rate of change of the separation of 
the test geodesic n + An from the “canonical course” d2$ as baseline, but the second 
derivative: 

(first derivative at X + — Ax\ = V n = ~ • 

V 2 / AX An AX An 9 

(first derivative at X —- dx) = V n — f~d — £*? _ — d9 . 

V 2 / " AX An AX An 

Transpose to common location X, take difference, and divide it by JX to obtain the 
second covariant derivative with respect to the vector t/; thus 

v v n - + ~~ ( v t/ w )\-j ax 

AX 

(i&tfl -f d9 s ) vectors transported to * 

_ _ common location _ 

~ (4\) 2 An -j£kfAn 

= “relative acceleration vector” for neighboring geodesics. 




J 
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(left half of Box 11.4) to motivate the geometric analysis (right half of same box). 
Thereby arrive at the remarkable equation 

+ [▼„> VJu = 0. (11.6) 

t I 

1 “relative 1 1 “tide-producing 1 
, acceleration’’ gravitational forces’’ 

This equation is remarkable, because at first sight it seems crazy. The term 
involves second derivatives of u , and a first derivative of : 

[V„, ▼> = V„ V u u - ▼„ V n u (11.7) 

It thus must depend on how u and n vary from point to point. But the relative 
acceleration it produces, V u V u n, is known to depend only on the values of u and 
n at the fiducial point, not on how u and n vary (see Box 11.2, F). How is this 
possible? 

Somehow all derivatives must drop out of the tidal-force quantity [▼„, ▼„]«• One 
must be able to regard [▼,▼]... as a purely local, algebraic machine with 
three slots, whose output is a vector. If it is purely local and not differential, then 
it is even linear (as one sees from the additivity properties of V), so it must be a 
tensor. Give this tensor the name Riemann, and give it a fourth slot for inputting 
a 1-form: 

Riemann (... ,C,A,B) = [V A , V B ]C; 

Riemann (a, C,A,B) = <cr, [V A , V B ]C>. 

This is only a tentative definition of Riemann . Before accepting it, one should 
verify that it is, indeed, a tensor. Does it really depend on only the values of A , 
B , C at the point of evaluation, and not on how they are changing there? The answer 
(derived in Box 11.5) is “almost.” It fails the test, but with a slight modification 
it will pass. The modification is to replace the commutator [V A , V s ] by the “curvature 
operator” 


*(*,«) = [▼„,▼,]- V a] , (11.8) 

where is the derivative along the vector [A,B] (commutator of A and B ). 

(<%(A, B) = [V^, V 0 ] for the fields A = n and B = u of the geodesic-deviation 
problem, because [n, u] = 0.) Then the modified and acceptable definition of the 
Riemann curvature tensor is 


Riemann (..., C, A B) = ${A, B)C\ 
Riemann (a, C,A , B) = <a, $?(>!, B)C). 


(11.9) 


To define Riemann thus, and to verify its tensorial character (exercise 11.2), does 
not by any means teach one what curvature is all about. To understand curvature, 
one must scrutinize Riemann from all viewpoints. That is the task of the rest of 
this chapter. 


Tide-producing gravitational 
forces expressed in terms of 
a commutator 


Curvature operator defined 


Riemann curvature tensor 
defined 


{continued on page 275) 
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Box 11.4 RELATIVE ACCELERATION OF TEST PARTICLES— 

GEOMETRIC ANALYSIS PATTERNED ON NEWTONIAN ANALYSIS 


Newtonian Analysis 


Geometric Analysis 


1. Consider a family of test-particle trajectories 1. Consider a family of test-particle trajectories 
xi{t, n) in ordinary, three-dimensional space: (geodesics), ^(A, n), in spacetime: “A” is affine- 

“f” is time measured by particle’s clock, or any parameter, i.e., time measured by particle’s 

clock; “w” is “selector parameter.” clock; “w” is “selector parameter.” 


v * 

■> v=- 

\x \ 


n + 4" 


2. Equation of motion for each trajectory: 2. Geodesic equation for each trajectory: 

/afxA 30 _ 0 V„u = 0. 

V 3 1 2 /_ 

[Looks like first-order equation; is actually 
where 0 is Newtonian potential. second-order because the “u” being differen¬ 

tiated is itself a derivative, u = (3f?/3\) n .] 


3. Take difference between equations of motion 3. Take difference between geodesic equations for 
for neighboring trajectories, n and n + Zlrc, and neighboring geodesics n and n + and take 
take limit as An —0—i.e., take derivative limit as An —0—i.e., take covariant deriva- 

(JLUfei) + 3tl = o. tive 

V dn / ( Lv /„ ▼„[▼„*#] = 0. 


4. When 3/3n acts on second term, rewrite it as 4. There is no second term, so leave equation in 
/ 3 \ _ / dx^\ 3 _ k 3 . form 

\dn/ t \ dn ) t dx k ” 3x fc ’ ▼„ [▼„«] = 0. 


dx k ~ n dx kl 


Thereby obtain 


dr)M(¥). 


, a 2 <P k n 

H-:- r n K — 0. 

dx } dx k 




5. To obtain equation for relative acceleration, 
move (9 /dn) t through both of the (0/3 t) n terms 
(permissible because partial derivatives com¬ 
mute!): 


This is equivalent to 


+ 


d 2 0 

dx j dx k 


_ 


0. 


5. To obtain equation for relative acceleration, 
▼„#», move V n through and through the 

d/d\ of u = d*P/d\: 

a. First step: In V n V u u = 0, move ▼„ through 
V w . The result: 

(▼„▼„ + [▼»*▼„])«= 0. 


“relative 
acceleration 


id V'\ 

VdFJ 


+ 


a 2 <p 


~k — 


= o. 


P 


dx* dx k 

^‘tide-producing 

gravitational forces” 


'commutator; must be included 1 
as protection against possibihty 
that ^ V„ V . 

j_ u n f n u _ | 

b. Second step: Move V„ through d/dX of 
u = d^/dX; i.e., write 

V »-fr = V " u = = ▼„ 

o\ ± 4* dn 





Why? Because symmetry of covariant 
derivative says V n u — V u n = [n , u] 


_ [ji_ JL] _ d 2 _ 3 2 _ 

L _ Ldn’ d\_ dn 0X aX dn 

c. Result: 


f _ I _i___ 

'“relative 1 f “tide-producing 
, acceleration”, gravitational forces”; 

, i.e.. “spacetime curvature” 


j 


f Bor 115 RIEMANN CURVATURE TENSOR 




A. Definition of Riemann Motivated by 
Tidal Gravitational Forces: 

1. Tidal forces (spacetime curvature) produce relative acceleration of test particles 
(geodesics) given by 




( 1 ) 
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r 


Box 11.5 (continued) 


2. This motivates the definition 


Riemann (. .. ,C,A,B) = [▼„, V fl ]C. 

^-[empty slot for inserting a one-form] 


( 2 ) 


B. Failure of this Definition 

1. Definition acceptable only if Riemann (... ,C.A, B) is a linear machine, inde¬ 
pendent of how A, B, C vary from point to point. 

2. Check, in part: change variations of C , but not C itself, at event f? 0 : 

^NEw(^) = /(^ > )COLd(^)* 

t-[arbitrary function except /(ff 0 ) = 1] 

3. Does this change [V^VgJC? Yes! Exercise 11.1 shows 

{ [ Y 4 5 ] ^NEw)at ] ^OLD) 6> 0 = ^OLD ^[A,B]f 


C. Modified Definition of Riemann : 

1. The term causing trouble, ^OLD ^[A. S1 /, can be disposed of by subtracting a 
“correction term” resembling it from Riemann —i.e., by redefining 


Riemann (..., C, >1, £) = &(A, B)C, (3) 

%(A,B) = [V a ,V b ]-V [AM1 (4) 

2. The above calculation then gives a result independent of the “modifying func¬ 
tion”/: 


{$(A, B)C NEW } at ? o — B)C 0 hD } at p o . 


D. Is Modified Definition Compatible with Equation 
for Tidal Gravitational Forces? 

L One would like to write V u V u n + Riemann (..., 1 /, i», u) = 0. 

2. This works just as well for modified definition of Riemann as for original defini¬ 
tion, because 
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$(n,u) — [▼„, ▼„] — 


| 0 because n = (d/dn) x and" 

u — (d/d\) n commute 




Geodesic deviation and tidal forces cannot tell the difference between u ) 

and [V„, V„], nor consequently between old and new definitions of Riemann . 


E. Is Modified Definition Acceptable? 


I.e., is Riemann (..., C y A,B) = &l(A, B)C a linear machine with output independ¬ 
ent of how A , B , C vary near point of evaluation? YES! (See exercise 11.2.) 






Take stock, first, of what one knows already about the Riemann curvature tensor. 

(1) Riemann is a tensor; despite the appearance of V in its definition (11.9), no 
derivatives actually act on the input vectors A, £, and C. (2) Riemann is a (\) tensor; 
its first slot accepts a 1-form; the others, vectors. (3) Riemann is determined entirely 
by V, or equivalently by the geodesics of spacetime, or equivalently by spacetime’s 
parallel transport law; nothing but V and the input vectors and 1-form are required 
to fix Riemann'* output. (4) Riemann produces the tidal gravitational forces that 
pry geodesics (test-particle trajectories) apart or push them together; i.e., it charac¬ 
terizes the “curvature of spacetime”: 

V u n + Riemann (..., i/, #i, u) = 0. (11.10) 

(This “ equation of geodesic deviation” follows from equations 11.6, 11.8, and 11.9, 
and the relation [u, u] = 0.) 

All these facets of Riemann are pictorial (e.g., geodesic deviation; see Boxes 11.2 
and 11.3) or abstract (e.g., equations 11.8 and 11.9 for Riemann in terms of V). 

Riemann s component facet, Components of Riemann 

expressed in terms of 

R a p yS — Riemann (iv a ,e / 3 ,e y ,e s ) = <W°, X(0 r 0 t ) a fi >, (1U1) connection coefficients 

is related to the component facet of V by the following equation, valid in any 
coordinate basis {e a } = {d/dx a }: 


Tide-producing gravitational 
forces expressed in terms of 

Riemann 


Da — 

A - 


df a 


05 


a r a 


dx^ 


07 


0JC* 




05 


^5 1 fir 


( 11 . 12 ) 


(See exercise 11.3 for derivation, and exercise 11.4 for the extension to noncoordinate 
bases.) These components of Riemann , with no sign of any derivative operator 
anywhere, may leave one with a better feeling in one’s stomach than the definition 
(11.8) with its nondifferentiating derivatives! 
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EXERCISES 


Exercise 11. 1- [▼*.▼*]C DEPENDS on derivatives of c 

(Based on Box 11.5.) Let C XEW and C 0LD be vector fields related by 

^-[arbitrary function, except f(i? 0 ) = 1] 


Show that 


{[Tfl’ - {[74’ V s] C OLD}at;P 0 = C OLD V [AS]/ 


Exercise 11.2. PROOF THAT Riemann IS A TENSOR 

Show from its definition (11.8, 11.9) that Riemann is a tensor. Hint: Use the following 
procedure. 

(a) If/($) is an arbitrary function, show that 

$(A y B)fC =f$(A,B)C. 

(b) Similarly show that 

<:/i{JA,B)C = /$(4, B)C and $(4,/S)C = /$(A,B)C. 

(c) Show that $(4, B)C is linear; i.e., 

+ a, B)C = $(A, B)C + $(a, B)C\ 
tt(A, B + 6)C = $(A, S)C + $(A, 6)C; 

$(A,S)(C + c) = $(A,S)C + $(4,S)c. 

(d) Now use the above properties to prove the most crucial feature of $(A, B)C. Modify 
the variations (gradients) of 4, B , and C in an arbitrary manner, but leave A,B C unchanged 
at$ 0 : 


4 —A + o a e a 
B —B + b a e a 
C — y- C 4- c a e a 


*“($), &“($>), c tt ($) arbitrary except 
they all vanish at $ = $ 0 . 


Show that this modification leaves $(4,fl)C unchanged at $ 0 . 
(e) From these facts, conclude that Riemann is a tensor. 


Exercise 11.3. COMPONENTS OF Riemann IN COORDINATE BASIS 

Derive equation (11.12) for the components of the Riemann tensor in a coordinate basis. 
[Solution: 


Ra fiys = Riemann (o/ a , e fi , e y , e B ) 
= <o/ a ,$(e y , e 6 )e 0 ) 

= {w a , (V Y V 5 - ▼,▼*)•*> 


[ standard way to 1 
calculate components] 

[by definition (11.9)] 

[*by definition (11.8) plus 1 
L[e y , e 8 ] = 0 in coord, basis] 


= <«'“> e ^ r>l 0s,y + - (e.r^r^) 

= ir*et. y - e„> + - r'^r^xw, e,y. 


which reduces (upon using <a;“, e M > = 8“^) to (11.12).] 
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Exercise 11.4. COMPONENTS OF RIEMANN 
IN NONCOORDINATE BASIS 

In a noncoordinate basis with commutation coefficients c a p y defined by equation (9.22), derive 
the following equation for the components of Riemann : 

R a p y8 = y — r a p y8 + r a fiy r fi p 8 — r a ll8 r ti p y — r a p fl c y8 tL . (11 . 13 ) 


§11.4. PARALLEL TRANSPORT AROUND A CLOSED CURVE 

What are the effects of spacetime curvature, and how can one quantify them? One 
effect is geodesic deviation (relative acceleration of test bodies), quantified by equa¬ 
tion (11.10). Another effect, almost as important, is the change in a vector caused 
by parallel transport around a closed curve. This effect shows up most clearly in 
the same problem, geodesic deviation, that motivated curvature in the first place. 
The relative acceleration vector 'V u V^/i is also the change 8u in the vector u caused 
by parallel transport around the curve whose legs are the vectors n and u: 

▼„ V„/i = 8u. 

(See Box 11.6 for proof) Hence, in this special case one can write 
8u + Riemann (..., u, n, u) = 0. 

The expected generalization is obvious: pick a closed quadrilateral with legs u Aa 
and v Ab (Figure 11.2; Aa and Ab are small parameters, to go to zero at end of 
discussion). Parallel transport the vector A around this quadrilateral. The resultant 
change in A should satisfy the equation 

8A -b Riemann (..., A, u Aa , v Ab) = 0; (11.14) 

or, equivalently, 

8A + AaAb$l(u,v)A =0; (11.14') 


or, more precisely, 

Lim ( — - ) + Riemann (..., A, u, v) = 0. (11.14") 

ja^oXAaAb/ 

The proof is enlightening, for it reveals the geometric origin of the correction term 
V [w ir] in the curvature operator. 

The circuit of transport (Figure 11.2) is to be made from two arbitrary vector 
fields u Aa and v Ab. However, a circuit made only of these fields has a gap in it, 
for a simple reason. The magnitude of u varies the wrong way from place to place. 
The displacement u Aa that reaches, across at the bottom of the quadrilateral from 


Change in a vector due to 
parallel transport around a 
closed curve: 

Related to geodesic deviation 


Equation for change 


Derivation of equation for 
change 




Figure 11.2. 

The change <5^ in a vector ^ asa result of parallel transport around 
a closed curve. The edges of the curve are the vector fields u Aa and 
v Ab , plus the “closer of the quadrilateral” [v Ab , u Aa] = [ v , u] Aa 
Ab (see Box 9.2). 


one line of iris to another cannot make the connection at the top of the quadrilateral. 
Similarly the iris vary the wrong way from place to place to connect the i/’s. To 
close the gap and complete the circuit, insert the “closer of quadrilaterals” [v Ab, 
u Aa] = [v, u] Aa Ab. (See Box 9.2 for why this vector closes the gap.) 

With the route now specified, the vector A is to be transported around it. One 
way to do this, “geometrical construction” by the method of Schild’s ladder applied 
over and over, is the foundation for planning a possible experiment. For planning 
an abstract and coordinate-free calculation (the present line of action), introduce 
a “fiducial field,” only to take it away at the end of the calculation. Plan: Conceive 
of A, not as a localized vector defined solely at the start of the trip, but as a vector 
field (defined throughout the trip). Purpose: To provide a standard of reference 
(comparison of A transported from the origin with A at the place in question). 
Principle: The standard of reference will cancel out in the end. Procedure: 


( Net change made in taking the vector A, originally localized at the 
start of the circuit, and transporting it parallel to itself (“mobile A ”) 
around the closed circuit. This quantity cannot be evaluated until 
completion of circuit because there is no preexisting standard of 
reference along the way. 


/A quantity subject to analysis for each leg of circuit individually. This \ 
new quantity is defined by introducing throughout the whole region 
a vector field 4 (tield) , smoothly varying, and in agreement at starting 
point with the original localized A, but otherwise arbitrary. This new 
quantity is then given by yi (tield) at starting point (same as >| focalized) 
\at starting point) minus ,4< m obiie) f^ n j s h point (after transit). / 


= 2 
legs of 
circuit 


Change inA ,fieldJ relative toA ,moblle) in the course of transport along \ 
specified leg. Value for any one leg depends on the arbitrary choice 
of A but this arbitrariness cancels out in end because of closure 
of circuit. / 



§11.4. PARALLEL TRANSPORT AROUND A CLOSED CURVE 


279 


/Change in A (field) relative to the parallel-transported A (mobile) as standard\ 

' of reference, made up of contributions along following legs of Figure ' 

11 . 2 : 

v Ab , giving y^4 (field) Ab (on line displaced u Aa from start) 

— v Ab , giving - y,A (field) Ab (on line through starting point) 

— u Aa, giving — V uJ 4 (field) Aa (on line displaced v Ab from start) 

-f -u Aa, giving V u A (field) Aa (on line through starting point) . 

\ + [v, w] Aa Ab, giving A (field) Aa Ab J 

= {▼. ▼„ -▼*▼.+ Aa Ab 

= Riemann (... ,A iheldi , u, v) Aa Ab = ( u, v)A ifield) Aa Ab. ( 11 . 15 ) 


Profit: The curvature operator 

Riemann u, v) = <%(u, v) = [V„, VJ - V [u v] , 


Box 11.6 GEODESIC DEVIATION AND PARALLEL TRANSPORT AROUND 
CLOSED CURVE: TWO ASPECTS OF SAME CONSTRUCTION 


Geodesic Deviation 


V V n — Lim 

4A 


An —> 0 

(See Boxes 11.2 and 11.3) 


f gg -f ffg 1 
o l (d\) 2 j * 


& 




Geodesic Deviation 

Same result; different construction. To 
simplify the connection with closed-curve 
transport, change the tilt and dilate the 
parametrization of geodesic in A. 

The result: B, where 9 and d coincide. 
From F of Box 11.2 one knows (fd + 
m = a# + 0S— i.e. Y u n is the 



Box 11.6 ( continued ) 


same for this family of geodesics as for the original family 


[ ‘Ml 


V Vn = Lim .- 

^ - o l (AXf An 

An 0 7 


) 


Also, to simplify discussion set An = AX = 1, and assume /i and u are small enough 
that one can evaluate V„n without taking the limit: 

V„ V„/i = 


Parallel Transport Around Closed Curve, Performed by 
Same Construction 

Plan: Parallel transport the vector u zlA = counterclockwise around the curve 
& — 91L — y- S. Execution: (1) Call transported vector i/ (m) (“m” 
for “mobile”). (2) At i2, i/ (m) = (3) At f?, i/ (m) = because 9&61 is a geodesic 

and u im) is its tangent vector. (4) At £, i/ (m) = £91 according to Schild’s ladder of 
the picture. (5) At 911, u (m) = 91191 because £9191 is a geodesic and i/ (m) is now its 
tangent vector. (6) At 2, i/ (m) = 2$ according to Schild’s ladder. Result: The change 
in i/ (m) is — $$. Had the curve been circuited in opposite direction (£ —$ —*~ 
2 — y- 911 — y- £), the change would have been -f 

(due to parallel transport up n, out u, down —#», and ^u #**• 

back along-u to starting point 


V 




applied to the vector field A (field) , gives the negative of the change in the localized 
vector A (localized) (called A (mobile) during the phase of travel) on parallel transport 
around the closed circuit. It does not give the change in A (field) on traversal of that 
circuit, for A (field) has the same value at the end of the journey as at the beginning. 
Equation (11.14') expresses that change in terms of the conveniently calculated 
differential operator, 9l(u, v) — [V^, V„] — V [l/If] . Paradox: Neither wanted nor 
evaluated is the change in the quantity A (field) acted on by this operator. Payoff: 
Ostensibly differential in the character of its action on A , the operator Riemann 
if, v) = 9l(u, v) is actually local. Thus, replace the proposed smoothly 
varying vector field A (field) by a quite different but also smoothly varying vector field 
A (field, new) Then the two fields need agree only at the one point in question for them 
to give the same output Riemann (..., A, u, v) = 9l(u, v)A at that point. This one 
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knows from the fact that 8A , the quantity calculated, has an existence and value 
independent of the choice of >l (field) . This one can also verify by detailed calculation 
(exercise 11.2). Power: Although they cancel out in their response to any change 
of A with location, the several differentiations in the curvature operator respond 
directly to the “rate of change of geometry with location” (“geodesic deviation”). 
Prolongation: The closed curve need not be a quadrilateral. The curvature operator 
tells how a vector changes on parallel transport about small curves of arbitrary shape 
(Box 11.7). 


Exercise 11.5. COPLANARITY OF CLOSED CURVES EXERCISE 

Let f x and f 2 be the bivectors (see Box 11.7) for two small closed curves at the same event. 

Show that the curves are coplanar if and only if f x = af 2 for some number a. 


f Box 11.7 THE LAW FOR PARALLEL TRANSPORT ABOUT A CLOSED CURVE 


"\ 


A. Special Case 

Curve is closed quadrilateral formed by vector fields u and v . 

1. Law says (in component form) 

8A a + R a J8yS A^u y v 8 = 0. (1) 

2. On what characteristics of the closed curve does this depend? 

a. Notice that R a /3y8 = — R a p Sy (antisymmetry in last two indices; obvious in 
equation 11.12 for components; also obvious because reversing the direction 
the curve is traversed—i.e., interchanging u and v —should reverse sign of 8A ). 

b. Equation (1) contracts u ® v into these antisymmetric, last two indices. 
The symmetric part of u ® v must give zero. Only the antisymmetric part, 
uAv~u®v — v ® u can contribute: 

SA a + y R a p yS A&(u A v) yS = 0. (2) 

3. This antisymmetric part is a “bivector.” It is independent of the curve’s shape; 
it depends only on (a) the plane the curve lies in, and (b) the area enclosed by 
the curve. [Although without metric “area” is meaningless, “relative areas at an 
event in a given plane” have just as much meaning as “relative lengths at an 




Box 11.7 (continued) 


event along a given direction.” Two vectors at the same event lie on the same 
line if they are multiples of each other; their relative length in that case is their 
ratio. Similarly, two small closed curves at the same event lie in the same plane 
if their bivectors are multiples of each other (exercise 11.5); their relative area 
in that case is the ratio of their bivectors.] 


B. General Case 


Arbitrary but small closed curve. 

1. Break the curve down into a number 
of quadrilaterals, all lying in the same 
plane as the curve. 

2. Traverse each quadrilateral once in the 
same sense as the curve is to be tra¬ 
versed. Result: all interior edges get 
traversed twice in opposite directions 
(no net traversal); the outer edge (the 
curve itself) gets traversed once. 

3. Thus, 5A due to traversing curve is the sum of the 5A ’s from traversal of each 
quadrilateral: 

SA “ = - J 2 *%yS A > A ^for given quadrilateral)^ ■ 

quadrilaterals 

Define the bivector f for the curve as the sum of the bivectors for its component 
quadrilaterals: 

^ ^ ^quadrilateral 

quadrilaterals 



(add “areas”; keep plane the same). 

4. Then 

8A« + ±R^ ysA i 3 fy s = 8A a + R a msi A0/y* = 0. 


C. Warning 

This is valid only for closed curves of small compass: 8A doubles when the area 
doubles; but the error increases by a factor ~2 3/2 [54 oc Aa Ab in calculation of 
§11.4; but error cc(Aa) 2 Ab or Aa(Ab) 2 ]. 
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§11.5. FLATNESS IS EQUIVALENT TO 
ZERO RIEMANN CURVATURE 

To say that space or spacetime or any other manifold is flat is to say that there Flatness of a manifold 
exists a coordinate system { x “(f?)} in which all geodesics appear straight: defined 


x a (\) = a 0 + b a X. 


(11.16) 


(Example: Lorentz spacetime of special relativity, where test bodies move on such 
straight lines.) They can appear so if and only if the connection coefficients in the 
geodesic equation 

(Px& p g dx^ dx" (-V /II «<7\ 


expressed in the same coordinate system, all vanish: 


r 13 = 0 . 

± tiv 


(11.18) 


From the vanishing of these connection coefficients, it follows immediately (equation Flatness implies 
11.12) that all the components of the curvature tensor are zero: Riemann = 0 

= o. - (11.19) 

[Geometric restatement of (11.16) —(11.18) —(11.19): For all geodesics to be 
straight in a given coordinate system means that initially parallel geodesics preserve 
their separation; the geodesic deviation is zero; and therefore the curvature vanishes.] 

Is the converse true? Does zero Riemann curvature imply the existence of a 
coordinate system in which all geodesics appear straight? Yes, as one sees by the 
following construction. 

Transport a vector parallel to itself from to 2, and then back from 2 to Proof that Riemann = 0 
along a slightly different route. It returns to its starting point with no alteration in implies flatness 
magnitude or direction, because Riemann everywhere vanishes. Therefore parallel 
transport of a base vector e M from £P 0 to 2 yields at 2 a base vector that is 
independent, both in magnitude and in direction, of the route of transportation (for 
routes obtainable one from the other by any continuous sequence of deformations). 

As for 2, so for all points of the manifold; and as for the one base vector e^ so 
for a complete set of base vectors (ja = 0, 1,2,3): Parallel transport of a basis 
{e a (^ 0 )} yields everywhere a field of frames (“frame field”), each base vector of 
which suffers zero change (relative to the frame field) on parallel transport from 
any point to any nearby point: thus, 


- 0 ; 


( 11 . 20 ) 


v ,e„ (= ej = 0. 


( 11 . 21 ) 


With the vanishing of these individual derivatives, there also vanishes the commuta¬ 
tor of any two basis-vector fields: 


[ e „, e „] = V f e , - = 0-0 = 0 . 


( 11 . 22 ) 
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The gap in the quadrilateral of Figure 11.2 (there read “e^” for “i/,” “e/’ for ‘V”) 
closes up completely. Thereupon one can introduce coordinates x* 1 , each of which 
increases with a motion in the direction of the corresponding vector field; and with 
appropriate scaling of these coordinates, one can write 


8 

dx* 


(11.23) 


(see exercise 9.9). With this coordinate basis in hand, one can employ the formula 


= e,r^ a 


(11.24) 


Lorentz coordinates exist if 
and only if Riemann = 0 

Flatness does not imply 
Euclidean topology 


to calculate the connection coefficients. From the vanishing of the quantities on the 
left, one concludes that all the connection coefficients on the right (“bending of 
geodesics”) must be zero; so spacetime is indeed flat. 

Summary: Spacetime is flat—i.ethere exist “flat coordinates ” in which = 0 

everywhere and geodesics are straight lines, x a (A) = a* + b a \—if and only if 
Riemann = 0. 

Note: In the spacetime of Einstein, which has a metric, one can choose 
in the above argument to be orthonormal, e^ • e v — at ^ 0 . The resulting field 
of frames will then be orthonormal everywhere, and the resulting coordinate system 
will be Lorentz. Thus, in Einsteinian gravity the above summary can be rewritten: 
spacetime is flat ( there exists a Lorentz coordinate system) if and only if Riemann = 0. 

Warning: Flatness does not necessarily imply Euclidean topology. Take a sheet 
of paper. It is flat. Roll it up into a cylinder. It is still flat, intrinsically. The tracks 
of geodesics over it have not changed. Distances between neighboring points have 
not changed. Only the topology has changed, so far as an observer confined forever 
to the sheet is concerned. (The “extrinsic geometry”—the way the sheet is embedded 
in the surrounding three-dimensional space—has also changed; but an observer on 
the sheet knows nothing of this, and it is not the subject of the present chapter. 
See, instead §21.5.) 

Take this cylinder. Bend it around and glue its two ends together, without changing 
its flat intrinsic geometry. Doing so is impossible if the cylinder remains embedded 
in flat, three-dimensional Euclidean space; perfectly possible if it is embedded in 
a Euclidean space of 4 dimensions. However, embedding is unimportant to observers 
confined to the cylinder, since all they ever measure is intrinsic geometry; so all 
that matters to them is the topological identification of the two ends of the cylinder 
with each other. The result is topologically a torus; but the tracks of geodesics are 
still unchanged; the intrinsic geometry is flat; Riemann vanishes. 

By analogy, take flat Minkowskii spacetime. Pick some'Lorentz frame, and in it 
pick a cube 10 10 light years on each side (0 < x < 10 10 light years; similarly for 
y and z). Identify opposite faces of the cube so that a geodesic exiting across one 
face enters across the other. The result is topologically a three-torus: a “closed 
universe” with finite volume, with flat, Minkowskii geometry, and with a form that 
changes not at all as Lorentz time t passes (no expansion, no contraction). 
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§11.6. RIEMANN NORMAL COORDINATES 


In curved spacetime one can never find a coordinate system with r a /3y = 0 every¬ 
where. But one can always construct local inertial frames at a given event 9 0 ; and 
as viewed in such frames, free particles must move along straight lines, at least 
locally—which means r a /jy must vanish, at least locally. 

A very special and useful realization of such a local inertial frame is a Riemann- 
normal coordinate system. Pick an event 9 a and a set of basis vectors {e a (9 0 )} to 
be used there by an inertial observer. Fill spacetime, near9 0 , with geodesics radiating 
out from 9 0 like the quills of a hedgehog or porcupine. Each geodesic is determined 
by its tangent vector v at 9 0 \ and the general point on it can be denoted 


Riemann normal coordinates: 
a realization of local inertial 
frames 

Geometric construction of 
Riemann normal coordinates 


9 = vy 


t 


affine parameter; 

tells “where” on geodesic 



tangent vector at 9 0 ; 1 
tells “which geodesic”) 


(11.25) 


Actually, this gives more geodesics than are needed. One reaches the same point 
after parameter length if the initial tangent vector is 2v, as one reaches after 
\ if the tangent vector is v\ 

<?(A; V) = § (y A; 2iz) = <?(1; *v). 

Thus, by fixing A = 1 and varying v in all possible ways, one can reach every point 
in some neighborhood of 9 0 . This is the foundation for constructing Riemann normal 
coordinates. Choose an event 9. Find that tangent vector v at 9 () for which 9 = 
§ (1; v). Expand that v in terms of the chosen basis and give its components the 
names x a : 


9 = &(l;x a e a ). 


(11.26) 


The point 9 determines x a uniquely (if 9 is near enough to 9 0 that spacetime 
curvature has not caused geodesics to cross each other). Similarly, x a determines 
9 uniquely. Hence, x a can be chosen as the coordinates of 9 —its “Riemann-normal 
coordinates, based on the event 9 0 and basis {e a (£? 0 )}.” 

Equation (11.26) summarizes Riemann-normal coordinates concisely. Other 
equations, derived in exercise 11.9, summarize their powerful properties: 


Mathematical properties of 
Riemann normal coordinates 


e a (9 0 ) = (d/dx“),j (11-27) 

^% y (^o) = o; ( n - 28 ) 

Z’W^o) = - j ( Ra f)y, + Ra ypJ' ( 1L29 > 


If spacetime has a metric (as it does in actuality), and if the observer’s frame at 
9 0 has been chosen orthonormal (e a - e$ = t] a/3 ), then 
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Sap(f o) = Va/3- 

(11.30) 

SaP,P\) = 0, 

(11.31) 


(11.32) 

~ — —J 


^a/3ys(^o) — Sa&,iQyi^o) “ 

(11.32') 


Other mathematical 
realizations of a local inertial 
frame 

*new(^) = *§ld(^) + corrections of order (*g LD ) 3 . 

Moreover, only those the same to third order, 

*new(^) = *Sld(^) + corrections of order (xg LD ) 4 , 
will preserve the beautiful ties (11.29) and (11.32) to the Riemann curvature tensor. 


Here are components of the Jacobi curvature tensor (see exercise 11.7). 

Is this the only coordinate system that is locally inertial at (i.e., has r a j3y = 0 
there) and is tied to the basis vectors e a there (i.e., has d/dx a = e a there)? No. But 
all such coordinate systems (called "normal coordinates ”) will be the same to second 
order: 


EXERCISES Exercise 11.6. SYMMETRIES OF Riemann 

(To be discussed in Chapter 13). Show that Riemann has the following symmetries: 

^ a / 3 ys = (antisymmetric on last 2 indices) (11.33a) 

R a [py&) — 0 (vanishing of completely antisymmetric part) (11.33b) 

Exercise 11.7. GEODESIC DEVIATION MEASURES ALL 
CURVATURE COMPONENTS 

The equation of geodesic deviation, written up to now in the form 

▼ w 4- Riemann (..., i/, u, u) = 0 
or 

▼i , ▼„» 4- #(/i, u)u — 0, 

also lets itself be written in the Jacobi form V u V u n 4- /(i/, u)n = 0. Here j (i/, v), the “Jacobi 
curvature operator/* is defined by 

/(«, v)n = y [$(#», u)v + #(#», ir)u], (11.34) 

and is related to the “Jacobi curvature tensor” by 


Jacobi (..., n, u, v) = /(u, v)n, 


(11.35) 
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which implies 




avfi 


+ Rfl Bva)' 


(11.36) 


(a) Show that J\ afiy) = 0 follows from R\ py = R* alfiyy 

(b) Show that by studying geodesic deviation (allowing arbitrary u and n in ▼„/? + 
/(«/, u)n = 0) one can measure all components of Jacobi. 

(c) Show that Jacobi contains precisely the same information as Riemann. [Hint: show 
that 


= J (11.37) 

this plus equation (11.36) for J* raP proves “same information content”.] Hence, by studying 
geodesic deviation one can also measure all the components of Riemann. 

(d) Show that the symmetry of R \ tap , = 0 is essential in the equivalence between Jacobi 
and Riemann by exhibiting proposed values for for which R\„ am jt 0, 

and from which one would find J^ va ^ = 0. 

Exercise 11.8. GEODESIC DEVIATION IN GORY DETAIL 

Write out the equation of geodesic deviation in component form in a coordinate system. 
Expand all covariant derivatives (semicolon notation) in terms of ordinary (comma) deriva¬ 
tives and in terms of r *s to show all r and 3 terms explicitly. 

Exercise 11.9. RIEMANN NORMAL COORDINATES IN GENERAL 

Derive properties (11.27), (11.28), (11.29), (11.31), (11.32), and (11,32') of Riemann normal 
coordinates. Hint: Proceed as follows. 

(a) From definition (11.26), derive = e a . 

(b) Similarly, from definition (11.26), show that each of the curves x a = v a X (where the 
v a are constants) is a geodesic through ^ 0 , with affine parameter A. 

(c) Show that r a /3y (^P 0 ) = 0 by substituting x a = v a X into the geodesic equation. 

(d) Since the curves x a = v a X are geodesics for every choice of the parameters v a , they 
provide not only a geodesic tangent u = (3/3A) rt , but also several deviation vectors 
W (a) = (3/3i> a ) x . Compute the components of these vectors in the Riemann normal coordi¬ 
nate system, and substitute into the geodesic deviation equation as written in exercise 11.8. 

(e) Equate to zero the coefficients of the zeroth and first powers of A in the geodesic 
deviation equation of part (d), using 

+ o(r-), 

which is a Taylor series for r. In this way arrive at equation (11.29) for r a pyit in terms 
of the Riemann tensor. 

(f) From equations (11.28), (11.29), and (8.24) for the connection coefficients in terms 
of the metric, derive equations (11.31), (11.32), and (11.32'). 

Exercise 11.10. BIANCHI IDENTITIES 

Show that the Riemann curvature tensor satisfies the following “Bianchi identities” 

*%[*;«] =°. (H.38) 

The geometric meaning of these identities will be discussed in Chapter 15. [Hint: Perform 
the calculation at the origin of a Riemann normal coordinate system.] 
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11. GEODESIC DEVIATION AND SPACETIME CURVATURE 


Exercise 11.11. CURVATURE OPERATOR ACTS ON 1-FORMS 

Let #(i/, if) be the operator $(i/, if) = [V„. V„] — V[„. when acting on 1-forms a (or other 
tensors) as well as on tangent vectors. Show that 

<#(*/, if)a, iv> = — (a, #(i/, v)w >. 

Exercise 11.12. ROTATION GROUP: RIEMANN CURVATURE 

[Continuation of exercises 9.13, 9.14, and 10.17.] Calculate the components of the Riemann 
curvature tensor for the rotation group’s manifold 50(3); use the basis of generators {e a }. 
[Answer: 

Ra W = (H.39) 

where is the permutation symbol defined in equation (3.501): 

s«$ = (s« Y s^ - sv\). 

Note that this answer is independent of location in the group manifold.] 



CHAPTER 


NEWTONIAN GRAVITY IN THE 
LANGUAGE OF CURVED SPACETIME 


The longest period of time for which a modern painting has hung 
upside down in a public gallery unnoticed is 47 days. This 
occurred to Le Bateau by Matisse in the Museum of Modern Art, 
New York City . In this time 116,000 people 
had passed through the gallery. 

McWHIRTER AND McWHIRTER (1971) 




§12.1. NEWTONIAN GRAVITY IN BRIEF 


The equivalence principle is not unique to Einstein’s description of the facts of 
gravity. What is unique to Einstein is the combination of the equivalence principle 
and local Lorentz geometry. To return to the world of Newton, forget everything 
discovered in the last century about special relativity, light cones, the limiting speed 
of light, and proper time. Return to the “universal time” t of earlier centuries. In 
terms of that universal time, and of rectangular, “Galilean” space coordinates, 
Newtonian theory gives for the trajectories of neutral test particles 


d 2 x ’ 90 _ 

dt 2 dx> ~ ’ 


( 12 . 1 ) 


0 (sometimes denoted —U) = Newtonian potential. (12.2) 


Customarily one interprets these equations as describing the “curved paths” x’(t) 
along which test particles fall in Euclidean space (not spacetime). These curved paths 
include circular orbits about the Earth and the parabolic trajectory of a baseball. 
Cartan (1923, 1924) asks one to abandon this viewpoint. Instead, he says, regard 
these trajectories as geodesics fr(X), x J (A)] in curved spacetime. (This change of 
viewpoint was embodied in Figures B and C of Box 1.6.) Since the “affinely ticking” 


- ^ 

This chapter is entirely 
Track 2. Chapters 9-11 are 
necessary preparation for it. 

It is not needed for any 
later chapter, but it will be 
helpful in 

(1) Chapter 17 (Einstein field 
equations) and 

(2) Chapters 38 and 39 
(experimental tests and 
other theories of gravity). 

_ J 

Newtonian gravity: original 
formulation 


Newtonian gravity: 
translation into language of 
curved spacetime 
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12. NEWTONIAN GRAVITY IN LANGUAGE OF CURVED SPACETIME 


EXERCISES 


Newtonian clocks carried by test particles read universal time (or some multiple, 
A = at + b, thereof), the equation of motion (12.1) can be rewritten 


*1 = 0 , 

dA 2 


dbc[ _ _ 

dA 2 T dx’\dA t 


+ l^(4) ! 


0 . 


(12.3) 


By comparing with the geodesic equation 

d 2 x a /d\ 2 4- r a p y (dxP/d\)(dxy/d\) = 0, 

one can read off the values of the connection coefficients: 

ri Q0 = 00/0 x’; all other r a ^ y vanish. (12.4) 

And by inserting these into the standard equation (11.12) for the components of 
the Riemann tensor, one learns (exercise 12.1) 


R’oko = -R*wk = dx j dx k ; al1 other Ra py8 vanish. (12.5) 

Finally, the source equation for the Newtonian potential 

V 2 0==2 <P fjj = 4t7 P (12.6) 

i 

one can rewrite with the help of the "Ricci curvature tensor ” 

R a p = (contraction of Riemann) (12.7) 

in the geometric form (exercise 12.2) 

R 00 = 477p; all other R a p vanish. (12.8) 

Equation (12.4) for r a p y , equation (12.5) for R a p y8 , equation (12.8) for R a ^ plus 
the law of geodesic motion are the full content of Newtonian gravity, rewritten in 
geometric language. 

It is one thing to pass quickly through these component manipulations. It is quite 
another to understand fully, in abstract and pictorial terms, the meanings of these 
equations and the structure of Newtonian spacetime. To produce such understanding, 
and to compare Newtonian spacetime with Einsteinian spacetime, are the goals of 
this chapter, which is based on the work of Cartan (1923, 1924), Trautman (1965), 
and Misner (1969a). 


Exercise 12.1. RIEMANN CURVATURE OF NEWTONIAN SPACETIME 

Derive equation (12.5) for R a p y5 from equation (12.4) for 

Exercise 12.2. NEWTONIAN FIELD EQUATION 

Derive the geometric form (12.8) of the Newtonian field equation from (12.5) through (12.7). 
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§12.2. STRATIFICATION OF NEWTONIAN SPACETIME 

Galileo and Newton spoke of a flat, Euclidean “absolute space” and of an “absolute 
time,” two concepts distinct and unlinked. In absolute space Newtonian physics took 
place; and as it took place, absolute time marched on. No hint was there that space 
and time might be two aspects of a single entity, a curved “ spacetime” —until Einstein 
made the unification in relativity physics, and Cartan (1923) followed suit in New¬ 
tonian physics in order to provide clearer insight into Einstein’s ideas. 

How do the absolute space of Galileo and Newton, and their absolute time, fit 
into Cartan’s “Newtonian spacetime”? The key to the fit is stratification ; stratification 
produced by the universal time coordinate t . 

Regard t as a function (scalar field) defined once and for all in Newtonian space- 
time 

t = t(9). (12.9) 

Without it, spacetime could not be Newtonian, for “r” is every bit as intrinsic to 
Newtonian spacetime as the metric “gr” is to Lorentz spacetime. The layers of 
spacetime are the slices of constant t —the “space slices ” —each of which has an 
identical geometric structure: the old “absolute space.” 

Adopting Cartan’s viewpoint, ask what kind of geometry is induced onto each 
space slice by the surrounding geometry of spacetime. A given space slice is endowed, 
by the Galilean coordinates of §12.1, with basis vectors e j = d/dx’; and this basis 
has vanishing connection coefficients, r* kl = 0 [cf. equation (12.4)]. Consequently, 
the geometry of each space slice is completely flat . 

“Absolute space” is Euclidean in its geometry, according to the old viewpoint, 
and the Galilean coordinates are Cartesian. Translated into Cartan’s language, this 
says: not only is each space slice (t — constant) flat, and not only do its Galilean 
coordinates have vanishing connection coefficients, but also each space slice is en¬ 
dowed with a three-dimensional metric, and its Galilean coordinate basis is orthonormal, 

* e j = ( d/dx *) * (3/3*0 = (12.10) 

If the space slices are really so flat, where do curvature and geodesic deviation 
enter in? They are properties of spacetime. Parallel transport a vector around a closed 
curve lying entirely in a space slice; it will return to its starting point unchanged.. 
But transport it forward in time by At, northerly in space by Ax k , back in time by 
— At , and southerly by — Ax k to its starting point; it will return changed by 

i.e., 

SA° = 0, SA’ = - Ri 00k A°(At)(Ax k ) = ff —A ( \At)(Ax k ). (12.11) 
Geodesics of a space slice (Euclid’s straight lines) that are initially parallel remain 


The geometry of Newtonian 
spacetime: 

"Universal time" as a scalar 
field 


Space slices with Euclidean 
geometry 


Curvature acts in spacetime, 
not in space slices 
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12. NEWTONIAN GRAVITY IN LANGUAGE OF CURVED SPACETIME 


EXERCISE 


Galilean coordinates 


always parallel. But geodesics of spacetime (trajectories of freely falling particles) 
initially parallel get pried apart or pushed together by spacetime curvature, 

▼„ r u n + #(», u)u = 0 , 

or equivalently in Galilean coordinates: 

n° — dn°/dt = 0 initially ==►- n° = 0 always; 


+ - o 

dt 2 0jc fc 


(12.12a) 

(12.12b) 


(see Box 12.1 and exercise 12.3). 


Exercise 12.3. GEODESIC DEVIATION DERIVED 

Produce a third column for Box 11.4, one that carries out the “geometric analysis” in 
component notation using the Galilean connection coefficients (12.4) of Newtonian spacetime. 
Thereby achieve a deeper understanding of how the geometric analysis parallels the old 
Newtonian analysis. 


§12.3. GALILEAN COORDINATE SYSTEMS 

The Lorentz spacetime of special relativity has an existence and structure completely 
independent of any coordinate system. But a special property of its geometry (zero 
curvature) allows the introduction of a special class of coordinates (Lorentz coordi¬ 
nates), which cling to spacetime in a special way 

(c/dx a ) • (d/dx* 3 ) - ?j a/? everywhere. 

By studying these special coordinate systems and the relationships between them 
(Lorentz transformations), one learns much about the structure of spacetime itself 
(breakdown in simultaneity; Lorentz contraction; time dilatation; ...). 
defined Similarly for Newtonian spacetime. Special properties of its geometry (explored 
in abstract later; Box 12.4) permit the introduction of special coordinates (Galilean 
coordinates), which cling to spacetime in a special way 

= Hf)\ 

(d/dx>)-(d/dx k ) = 8 jk ; 

ri Q0 = $ ■ for some scalar field P y and all other r a p y vanish. 

To understand Newtonian spacetime more deeply, study the relations between these 
Galilean coordinate systems. 



Box 12.1 GEODESIC DEVIATION IN NEWTONIAN SPACETIME 


Coordinate system for calculation: Galilean space 
coordinates x’ and universal time coordinate t. 
General component form of equation: 


£>V 

dX 2 


+ R a 






Special conditions for this calculation: let the par¬ 
ticles’ clocks (affine parameters) all be normalized 
to read universal time, X = t. This means that the 
separation vector 

n a = (dx a /dn) x 

between geodesics has zero time component, 
n° = 0; i.e., in abstract language, 

(dt, n ) = t a n a — n° — 0; 

i.e., in geometric language, n lies in a space slice 
(surface of constant t). 

Evaluation of covariant derivative: 



Dn a _ dn a p a a dx* _ dn a 
DX dX dX ~ dX’ 


[0 unless ji = OJ-* unless is a space index ( n° = 0)] 

D 2 n a __ d(Dn a /dX) dn? dx* d(Dn a /dX) d 2 n a d 2 n a 

dX 2 - dX + dX dX - dx = lx^^ ~dF' 


I—[si 


since X = t] 


[0 unless = 0}—l L[0 unless ft is space index] 

Evaluation of tidal accelerations: 


K dx n dX~" 


R* 


dx& v dx 8 


since R* oko and R* 00k are only nonzero components. 

d 2 <P u 


_ __ d j dt k dt n ) 

**-d\ l dx ~ 0k0 dX dX ~ oko 


_ 


dx*dx k 


4 O unless y is space index] 

4for y a space index: 0 unless — <5 = 0] 

Resultant equation of geodesic deviation: 


d 2 n° 

dt 2 


~ 0 


d 2 n* 


d 2 4> 


( agrees with result n° = 0 always, which \ 
followed from choice X = t for all particles/ 

/agrees with Newton-type calculation^ 


- - fc _ 0 - vr---“ 

dx* dx k \m Box 11.4; see also exercise 12.3 / 
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12. NEWTONIAN GRAVITY IN LANGUAGE OF CURVED SPACETIME 


Point of principle: how can one write down the laws of gravity and properties 
of spacetime in Galilean coordinates first (§12.1), and only afterward (here) come 
to grip with the nature of the coordinate system and its nonuniqueness? Answer: 
(a quotation from §3.1, slightly modified): “Here and elsewhere in science, as 
emphasized not least by Henri Poincare, that view is out of date which used to say 
'Define your terms before you proceed/ All the laws and theories of physics, includ¬ 
ing Newton’s laws of gravity, have this deep and subtle character, that they both 
define the concepts they use (here Galilean coordinates) and make statements about 
these concepts.” 

The Newtonian laws of gravity, written in a Galilean coordinate system 

;c 0 = t, (3/3*0 * (d/9* fc ) = S jk 

make the statement ‘T J 00 = 4> . and all other r a ^ y = 0” about the geometry of 
spacetime. This statement in turn gives information about the relationships between 
different Galilean systems. Let one Galilean system (* a (^)} be given, and seek the 
most general coordinate transformation leading to another, (* a '(^)}. The following 
constraints exist: (1) x 0 ' = x° = t (both time coordinates must be universal time); 
(2) at fixed t (i.e., in a fixed space slice) both sets of space coordinates must be 
Euclidean, so they must be related by a rotation and a translation: 


x’" = A fk x k + a* 


^-[translation] 

1 —[rotation matrix, i.e., Ay t A wl — dy k >] 
x k = Ay k x*' — a k y with a k = Ay k a j '. 


(12.13a) 


(12.13b) 


The rotation and translation might, a priori, be different on different slices, Ay k = 
Ay k {t) and a J = a\t)\ but (3) they must be constrained by the required special form 
of the connection coefficients. Calculate the connection coefficients in the new 
coordinate system, given their form in the old. The result (exercise 12.4) is: 


n' 0 , k , = = Ay t A k ^ (produces “Coriolis forces”); 

d<P 

r* = 


O'O' - 3^7 + A j'k( A i'kX - ?*); 


(12.14) 


•4 


[“centrifugal forces’ 


all other f a vanish 


4“inertial forces”] 


(“Euclidean” index conventions; repeated space indices to be summed even if both 
are down; dot denotes time derivative). These have the standard Galilean form (12.4) 
if and only if 


Ay k — 0 , 

I" Newtonian potential in 
Lnew coordinate system 


<p' = <p — a k x k -b constant. 

] t \l Newtonian potential in' 
Lold coordinate system _ 


(12.15) 
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These results can be restated in words: any two Galilean coordinate systems are 
related by (1) a time-independent rotation of the space grid {same rotation on each 
space slice), and (2) a time-dependent translation of the space grid (translation 
possibly different on different slices) 

x* = A yk x k + a?{t). (12.16) 

[constant] t t_[time-dependent] 

The Newtonian potential is not a function defined in spacetime with existence 
independent of all coordinate systems. (There is no coordinate-free way to measure 
it.) Rather, it depends for its existence on a particular choice of Galilean coordinates; 
and if the choice is changed via equation (12.16), then <P is changed: 

<P' = <P - d k x k . (12.17) 

(By contrast, an existence independent of all coordinates is granted to the universal 
time tffP) and the covariant derivative V.) 

Were all the matter in the universe concentrated'in a finite region of space and 
surrounded by emptiness (“island universe”), then one could impose the global 
boundary condition * 


<P —0 as r = {x k x k ) 1/2 —oo. (12.18) 

This would single out a subclass of Galilean coordinates {“absolute” Galilean coor¬ 
dinates), with a unique, common Newtonian potential. The transformation from one 
absolute Galilean coordinate system to any other would be 

x r = A yk x k + a r 4- v r t (12.19) 

constant j 

rotation 

I constant I 
| displacement] 

(“Galilean transformation”). But , (1) by no local measurements could one ever 
distinguish these absolute Galilean coordinate systems from the broader class of 
Galilean systems (to distinguish, one must integrate the locally measurable quantity 
& j = ri Q0 out to infinity); and (2) astronomical data deny that the real universe 
is an island of matter surrounded by emptiness. 

It is instructive to compare Galilean coordinates and Newtonian spacetime as 
described above with Lorentz coordinates and the Minkowskii spacetime of special 
relativity, and with the general coordinates and Einstein spacetime of general rela¬ 
tivity; see Boxes 12.2 and 123. 


constant 

velocity 


Transformations linking 
Galilean coordinate systems 


Newtonian potential depends 
on choice of Galilean 
coordinate system 


Absolute Galilean coordinates 
defined 


Transformations linking 
absolute Galilean coordinate 
systems 


(continued on page 298) 
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Box 12.3 NEWTONIAN GRAVITY A LA CARTAN, AND E1NSTEIN1AN GRAVITY: 
COMPARISON AND CONTRAST 


Property 


Newton-Cartan 


Einstein 


Idea in brief (formulations of 
the equivalence principle of 
very different scope) 

Laws of motion of free parti¬ 
cles in a local, freely 
falling, nonrotating frame are 
identical to Newton’s laws of 
motion as expressed in a 
gravity-free Galilean frame 

Laws of physics in a local, 
freely falling, nonrotating 
frame are identical with the 
laws of physics as formulated 
in special relativity in a 

Lorentz frame 

Idea even more briefly stated 

Point mechanics simple in a 
local inertial frame 

Everything simple in a local 
inertial frame 

Consequence (tested to one 
part in 10 11 by Roll-Krotkov- 
Dicke experiment) 

Test particles of diverse 
composition started with same 
initial position and same 
initial velocity follow the 
same world line (‘‘definition 
of geodesic”) 

Test particles of diverse 
composition started with same 
initial position and same 
initial velocity follow the 
same world line (“definition 
of geodesic”) 

Another consequence 

In every local region, there 
exists a local frame (“freely 
falling frame”) in which all 
geodesics appear straight (all 

r %, = °) 

In every local region there 
exists a local frame (“freely 
falling frame”) in which all 
geodesics appear straight 
(all r\„ = 0) 

Consequence of way light rays 
travel in real physical world? 

Disregarded or evaded. All 
light rays have same velocity? 
Speed depend on motion of 
source? Speed depend on motion 
of observer? Possible to move 
fast enough to catch up with 
a light ray? No satisfactory 
position on any of these 
issues 

Spacetime always and every¬ 
where has local Lorentz 
character 

Summary of spacetime structure 

Stratified into spacelike 
slices; geometry in each slice 
Euclidean; each slice charac¬ 
terized by value of universal 
time (geodesic parameter); 
displacement of one slice with 
respect to another not spec¬ 
ified; no such thing as a 
spacetime interval 

No stratification. Well- 
defined interval between every 
event and ever)’ nearby event; 
spacetime has everywhere local 

Lorentz character, with one 
local frame (specific space 
and time axes) as good as 
another (other space and time 
axes); “homogeneous” rather 
than stratified 

This structure expressed in 
mathematical language 

rvs. yes; spacetime metric 
£«.■* no: 

r a u ;s have no independent 
existence; all derived from 


?d) 

/ ' , oo = tt ('= 

ex' 

pa - 1 (liil 

** 8 2\ dx» 


all other f 9 ^,, vanish 

, ^SfJLV \ 

djc*' CX& / 


(“metric theory of gravity”) 
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12. NEWTONIAN GRAVITY IN LANGUAGE OF CURVED SPACETIME 


EXERCISES 




Figure 1 2.1. 

The coordinate system carried by an orbital laboratory 
as it moves in a circular orbit about the Earth. 


Exercise 12.4. CONNECTION COEFFICIENTS FOR ROTATING, 

ACCELERATING COORDINATES 

Beginning with equation (12.4) for the connection coefficients of a Galilean coordinate system 
{ x a (^)}, derive the connection coefficients (12.14) of the coordinate system {x a '(^)} of 
equations (12.13). From this, verify that (12.15) are necessary and sufficient for { x a \^)} to 
be Galilean. 

Exercise 12.5. EINSTEIN S ELEVATOR 

Use the formalism of this chapter to discuss “Einstein’s elevator”—i.e., the equivalence of 
“gravity” to an acceleration of one’s reference frame. Which aspects of “gravity” are equiva¬ 
lent to an acceleration, and which are not? 

Exercise 12.6. GEODESIC DEVIATION ABOVE THE EARTH 

A manned orbital laboratory is put into a circular orbit about the Earth [radius of orbit = r 0 , 
angular velocity = w = ( M/r 0 s ) u 2 —why?]. An astronaut jetisons a bag of garbage and 
watches it move along its geodesic path. He observes its motion relative to (non-Galilean) 
space coordinates {jc j '(^)} which—see Figure 12.1—(1) are Euclidean at each moment of 
universal time [(9/9x/) • (d/dx k ‘) = 8 jk ], (2) have origin at the laboratory’s center, (3) have 
d/dx' pointing away from the Earth, (4) have d/dx' and d/d/ in the plane of orbit. Use 
the equation of geodesic deviation to calculate the motion of the garbage bag in this coordi¬ 
nate system. Verify the answer by examining the Keplerian orbits of laboratory and garbage. 
Hints: (1) Calculate R a 'p,y 8 , in this coordinate system by a trivial transformation of tensorial 
components. (2) Use equation (12.14) to calculate r a ^y at the center of the laboratory (i.e., 
on the fiducial geodesic). 


§1 2.4. GEOMETRIC, COORDINATE-FREE FORMULATION 
OF NEWTONIAN GRAVITY 

To restate Newton’s theory of gravity in coordinate-independent, geometric language 
is the principal goal of this chapter. It has been achieved, thus far, with extensive 
assistance from a special class of coordinate systems, the Galilean coordinates. To 
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climb out of Galilean coordinates and into completely coordinate-free language is 
straightforward in principle. One merely passes from index notation to abstract 
notation. 

Example: Restate in coordinate-free language the condition r° af3 = 0 of Galilean 
coordinates. 

Solution: Write r° afi = - < cfr, e a > ; the vanishing of this for all a means 
▼g dt = 0 for all /?, which in turn means V u dt — 0 for all u . In words: the gradient 
of universal time is covariantly constant. 

By this process one can construct a set of coordinate-free statements about New¬ 
tonian spacetime (Box 12.4) that are completely equivalent to the standard, non¬ 
geometric version of Newton's gravitation theory. From standard Newtonian theory, 
one can deduce these geometric statements (exercise 12.7); from these geometric 
statements, regarded as axioms, one can deduce standard Newtonian theory (exercise 
12 . 8 ). 


Exercise 12.7. FROM NEWTON TO CARTAN 

From the standard axioms of Newtonian theory (last part of Box 12.4) derive the geometric 
axioms (first part of Box 12.4). Suggested procedure: Verify each of the geometric axioms 
by a calculation in the Galilean coordinate system. Make free use of the calculations and 
results in §12.1. 

Exercise 12.8. FROM CARTAN TO NEWTON 

From the geometric axioms of Newtonian theory (first part of Box 12.4) derive the standard 
axioms (last part of Box 12.4). Suggested procedure: (1) Pick three orthonormal, spatial basis 
vectors (e ; - with fiy • e k = 5^.) at some event Parallel transport each of them by arbitrary 
routes to all other events in spacetime. 

(2) Use the condition $(i/, n)e^ = 0 for all u and u [axiom (3)] and an argument like 
that in §11.5 to conclude: (a) the resultant vector fields e j are independent of the arbitrary 
transport routes, (b) Ve # = 0 for the resultant fields, and (c) [ee k ] = 0. 

(3) Pick an arbitrary “time line”, which passes through each space slice (slice of constant 
t) once and only once. Parametrize it by t and select its tangent vector as the basis vector 
e 0 at each event along it. Parallel transport each of these e 0 ’s throughout its respective space 
slice by arbitrary routes. 

(4) From axiom (4) conclude that the resultant field is independent of the transport routes; 
also show that the above construction process guarantees V y e 0 = V 0 e. = 0. 

(5) Show that [e a , e^] = 0 for all pairs of the four basis-vector fields, and conclude from 
this that there exists a coordinate system (“Galilean coordinates”) in which e a - d/dx a (see 
§11.5 and exercise 9.9). 

(6) Show that in this coordinate system e ; • = 8 }k everywhere (space coordinates are 

Euclidean), and the only nonzero components of the connection coefficient are F ; 00 ; here 
axioms (6) and (2) will be helpful. 

(7) From the self-adjoint property of the Jacobi curvature operator (axiom 7) show that 

^ ; 0 k 0 = s h° w that in terms of the connection coefficients this reads r j 00k = r k 00j : 

and from this conclude that there exists a potential & such that /"%, = <P 

(8) Show that the geometric field equation (axiom 5) reduces to Poisson’s equation 
V 2 0 = 4 7T P . 

(9) Show that the geodesic equation for free fall (axiom 8) reduces to the Newtonian 
equation of motion d 2 x’/dt 2 4- 0 j = 0. 


Coordinate-free, geometric 
axioms for Newton's theory 
of gravity 


EXERCISES 


(continued on page 302) 
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Box 12.4 NEWTONIAN GRAVITY: GEOMETRIC FORMULATION 
CONTRASTED WITH STANDARD FORMULATION 


Geometric Formulation 


Newton’s theory of gravity and the properties of Newtonian spacetime can be derived 
from the following axioms. (For derivation see exercise 12.8.) 


(1) There exists a function t called “universal time”, and a symmetric covariant 
derivative V (with associated geodesics, parallel transport law, curvature opera¬ 
tor, etc.). 

(2) The 1-form dt is covariantly constant; i.e., 

V w dt = 0 for all u. 


[Consequence: if w is a spatial vector field (i.e., w lies everywhere in a surface of 
constant t\ i.e. {dt, w) = 0 everywhere), then is also spatial for every u , 


(dt, V„w> = V„ (dt, w) - <V„ dt, w) = 0.] 


[0 always]J 


I—[0 always] 


(3) Spatial vectors are unchanged by parallel transport around infinitesimal closed 
curves; i.e., 


$(u, ri)w = 0 if w is spatial, for every u and n. 

(4) All vectors are unchanged by parallel transport around infinitesimal, spatial, 
closed curves; i.e., 


${v, w) = 0 for every spatial v and w. 

(5) The Ricci curvature tensor, R a/S = has the form 

Ricci — 4Tip dt & dt , 
where p is the density of mass. 

(6) There exists a metric defined on spatial vectors only, which is compatible 
with the covariant derivative in this sense: for any spatial w and v , and for 
any u whatsoever, 

V u (w v) - (V u w)‘ v + w(V„v). 

[Note: axioms (1), (2), and (3) guarantee that such a spatial metric can exist; 
see exercise 12.9.] 
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(7) The Jacobi curvature operator j(u, /?), defined for any vectors w, n , p by 

jf(u,n)p = y[$(p,/»)t# + #(p, *#)#»], 

is “self-adjoint” when operating on spatial vectors; i.e., 

v[jf(u,n)w] = w • [/(«#, n)v] for all spatial v, w ; 

and for any £/, n. 

(8) “Ideal rods” measure the lengths that are calculated with the spatial metric; 
“ideal clocks” measure universal time t (or some multiple thereof); and “freely 
falling particles” move along geodesics of V. [Note: this can be regarded as a 
definition of “ideal rods,” “ideal clocks,” and “freely falling particles.” A more 
complete theory (e.g., general relativity; see §16.4) would predict in advance 
whether a given physical rod or clock is ideal, and whether a given real particle 
is freely falling.] 

Note: For an alternative but equivalent set of axioms, see pp. 106-107 of Trautman 
(1965). 


Standard Formulation 


The following standard axioms are equivalent to the above. 


(1) There exist a universal time f, a set of Cartesian space coordinates x> (called 
“Galilean coordinates”), and a Newtonian gravitational potential <P . 

(2) The density of mass p generates the Newtonian potential by Poisson’s equation, 


V 2 <P = 


d 2 <P 

dx j 


= 4v7p. 


(3) The equation of motion for a freely falling particle is 


drx* 
dr 2 



= 0 . 


(4) “Ideal rods” measure the Galilean coordinate lengths; “ideal clocks” measure 
universal time. 
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12. NEWTONIAN GRAVITY IN LANGUAGE OF CURVED SPACETIME 


Exercise 12.9. SPATIAL METRIC ALLOWED BY OTHER AXIOMS 

Show that the geometric axioms (1), (2), and (3) of Box 12.4 permit one to introduce a spatial 
metric satisfying axiom (6). Hint: Pick an arbitrary spatial basis {e,} at some event. Define 
it to be orthonormal. e i *e k = 8 }k . Extend this basis through all spacetime by the method 
used in (1) of exercise 12.8. Define • e k = 8 jk everywhere in spacetime for this basis. Then 
prove that the resulting metric satisfies the compatibility condition of axiom (6). 

Exercise 12.10. SPACETIME METRIC FORBIDDEN BY OTHER AXIOMS 

Show that in Newtonian spacetime it is impossible to construct a nondegenerate spacetime 
metric g, defined on all vectors, that is compatible with the covariant derivative in the sense 
that 


▼„£r(n. P) = g(\n, p) + g(n, V u p). (12.20) 

Note: to prove this requires mastery of the material in Chapter 8 or 13; so study either 8 
or 13 before tackling it. Hint: Assume that such a g exists. Show, by the methods of exercise 
12.8, that in a Galilean coordinate system the spatial components g jk are independent of 
position in spacetime. Then use this and the form of R *^ y5 in Galilean coordinates to prove 
R mo and — R 0ik0 are not identical, a result that conflicts with the symmetries of the Riemann 
tensor [eq. (8.45)] in a manifold with compatible metric and covariant derivative. 


§12.5. THE GEOMETRIC VIEW OF PHYSICS: 
A CRITIQUE 


The principle of general 
covariance has no forcible 
content 


Twentieth-century viewpoint 
judges a theory by simplicity 
of its geometric formulation 


Einstein's theory of gravity is 
simple; Newton's is complex 


An important digression is in order. 

“Every physical quantity must be describable by a (coordinate-free) geometric 
object, and the laws of physics must all be expressible as geometric relationships 
between these geometric objects.” This view of physics, sometimes known as the 
“principle of general covariance/ 3 pervades twentieth-century thinking. But does it 
have any forcible content? No, not at all, according to one viewpoint that dates back 
to Kretschmann (1917). Any physical theory originally written in a special coordinate 
system can be recast in geometric, coordinate-free language. Newtonian theory is 
a good example, with its equivalent geometric and standard formulations (Box 12.4). 
Hence, as a sieve for separating viable theories from nonviable theories, the principle 
of general covariance is useless. 

But another viewpoint is cogent. It constructs a powerful sieve in the form of a 
slightly altered and slightly more nebulous principle: “Nature likes theories that are 
simple when stated in coordinate-free, geometric language.”* According to this 
principle, Nature must love general relativity, and it must hate Newtonian theory. 
Of all theories ever conceived by physicists, general relativity has the simplest, most 
elegant geometric foundation [three axioms: ( 1 ) there is a metric; ( 2 ) the metric is 
governed by the Einstein field equation G = 877 T; (3) all special relativistic laws 
of physics are valid in local Lorentz frames of metric]. By contrast, what diabolically 


* Admittedly, this principle is anthropomorphic: twentieth-century physicists like such theories and 
even find them effective in correlating observational data. Therefore, Nature must like them too! 
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clever physicist would ever foist on man a theory with such-a complicated geometric 
foundation as Newtonian theory? 

Of course, from the nineteenth-century viewpoint, the roles are reversed. It judged 
simplicity of theories by examining their coordinate formulations. In Galilean 
coordinates, Newtonian theory is beautifully simple. Expressed as differential equa¬ 
tions for the metric coefficients in a specific coordinate system, Einstein’s field 
equations (ten of them now!) are horrendously complex. 

The geometric, twentieth-century view prevails because it accords best with exper¬ 
imental data (see Chapters 38-40). In Chapter 17 it will be applied ruthlessly to 
make Einstein’s field equation seem compelling. 



CHAPTER 


RIEMANNIAN GEOMETRY: 

METRIC AS FOUNDATION OF ALL 

Philosophy is written in this great book (by which / mean the 
universe) which stands always open to our view, but it cannot 
be understood unless one first learns how to comprehend the 
language and interpret the symbols in which it is written, and its 
symbols are triangles, circlas, and other geometric figures, 
without which it is not humanly possible to comprehend even 
one word of it; without these one wanders in a dark labyrinth . 

GALILEO GALILEI (1623) 




This chapter is entirely Track 2. 
Chapters 9-11 are necessary 
preparation for it. It will be 
needed as preparation for 

(1) the second half of 
Chapter 14 (calculation 
of curvature), and 

(2) the details, but not the 
message, of Chapter 15 
(Bianchi identities). 

§13.6 (proper reference 
frame) will be useful 
throughout the applications of 
gravitation theory (Chapters 
18-40). 


Constraints imposed on 
spacetime by special relativity 


§13.1. NEW FEATURES IMPOSED ON GEOMETRY BY 
LOCAL VALIDITY OF SPECIAL RELATIVITY 

Freely falling particles (geodesics) define and probe the structure of spacetime. This 
spacetime is curved. Gravitation is a manifestation of its curvature. So far, so good, 
in terms of Newton’s theory of gravity as translated into geometric language by 
Cartan. What is absolutely unacceptable, however, is the further consequence of 
the Cartan-Newton viewpoint (Chapter 12): stratification of spacetime into slidable 
slices, with no meaning for the spacetime separation between an event in one slice 
and an event in another. 

Of all the foundations of physics, none is more firmly established than special 
relativity; and of all the lessons of special relativity none stand out with greater 
force than these. (1) Spacetime, far from being stratified, is homogeneous and 
isotropic throughout any region small enough (“local region”) that gravitational 
tide-producing effects (“spacetime curvatures”) are negligible. (2) No local experi¬ 
ment whatsoever can distinguish one local inertial frame from another. (3) The speed 
of light is the same in every local inertial frame. (4) It is not possible to give 
frame-independent meaning to the separation in time (“no Newtonian stratifica- 
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tion”). (5) Between every event and every nearby event there exists a frame- 
independent, coordinate-independent spacetime interval (“Riemannian geometry”). 
(6) Spacetime is always and everywhere locally Lorentz in character (“local Lorentz 
character of this Riemannian geometry”). 

What mathematics gives all these physical properties? A metric: a metric that is 
locally Lorentz (§§13.2 and 13.6). All else then follows. In particular, the metric 
destroys the stratified structure of Newtonian spacetime, as well as its gravitational 
potential and universal time coordinate. But not destroyed are the deepest features 
of Newtonian gravity: (1) the equivalence principle (as embodied in geodesic de¬ 
scription of free-fall motion, §§13.3 and 13.4); and (2) spacetime curvature (as 
measured by tidal effects, §13.5). 

The skyscraper of vectors, forms, tensors (Chapter 9), geodesics, parallel transport, 
covariant derivative (Chapter 10), and curvature (Chapter 11) has rested on crum¬ 
bling foundations—Newtonian physics and a geodesic law based on Newtonian 
physics. But with metric now on the scene, the whole skyscraper can be transferred 
to new foundations without a crack appearing. Only one change is necessary: the 
geodesic law must be selected in a new, relativistic way; a way based on metric 
(§§13.3 and 13.4). Resting on metric foundations, spacetime curvature acquires 
additional and stronger properties (the skyscraper is redecorated and extended), 
which are studied in §13.5 and in Chapters 14 and 15, and which lead almost 
inexorably to Einstein’s field equation. 


§13.2. METRIC 

A spacetime metric; a curved spacetime metric; a locally Lorentz, curved spacetime 
metric. This is the foundation of spacetime geometry in the real, physical world. 
Therefore take a moment to recall what “metric” is in three contrasting languages. 

In the language of elementary geometry, “metric” is a table giving the interval 
between every event and every other event (Box 13.1 and Figure 13.1). In the 
language of coordinates, “metric” is a set of ten functions of position, g^(.x a ), such 
that the expression 

As 2 = -Jr 2 = g llv {x a )Ax^Ax v (13.1) 

gives the interval between any event Jt a and any nearby event x a -f Ax a . In the 
language of abstract differential geometry, metric is a bilinear machine, 
g =“(. to produce a number [“scalar product g(u , v) = (u * ir)”] out of two 

tangent vectors, u and v. 

The link between the abstract, machine viewpoint and the concrete coordinate 
viewpoint is readily exhibited. Let the tangent vector 

<f = Ax a e a = Ax a (d/dx a ) 

represent the displacement between two neighboring events. The abstract viewpoint 
gives 

As- = f ■ f = g(Ax l 'e v ) = Ax* Ax r g(e a . e r ) 

U oniinued on page MO) 


Metric: the instrument which 
imposes the constraints 


Metric described in three 
languages 
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Box 13.1 METRIC DISTILLED FROM DISTANCES 


Raw Data on Distances 

Let the shape of the earth be described as in Figure 13.1, by giving distances between 
some of the principal identifiable points: buoys, ships, icebergs, lighthouses, peaks, 
and flags: points to a total of n = 2 X 10'. The total number of distances to be 
given is n(n — l)/2 = 2 x 10 14 . With 200 distances per page of printout, this means 


First 

Second 

Distance 

First 

point 

point 

(Nautical miles) 

point 

9,316,434 

14,117,103 

1410.316 

9,316,434 

9,316,434 

14.117,104 

1812.717 

9,316,43^ 

9,316,434 

^ *3 1 r A-y a 

14,117,105 

1629.291 

9.3’ 


10 12 pages weighing 6 g each, or 6 x 10 6 metric tons of data. With 6 tons per truck 
this means 10 6 truckloads of data; or with one truck passing by every 5 seconds, 
5 x 10 6 seconds or 2 months of night and day traffic to get in the data. 



First Distillation: Distances to Nearby Points Only 


Get distances between faraway points by adding distances covered on the elementary 
short legs of the trip. Boil down the table of distances to give only the distance 
between each point and the hundred nearest points. Now have 100 n = 2 X 10 9 
distances, or 2 x 10 9 /200 = 10 7 pages of data, or 60 tons of records, or 10 truckloads. 

Second Distillation: Distances Between Nearby 
Points in Terms of Metric 

Idealize the surface of the earth as smooth. Then in any sufficiently limited region 
the geometry is Euclidean. This circumstance has a happy consequence. It is enough 
to know a few distances between the nearby points to be able to determine all the 
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2 


distances between the nearby points. Locate point 2 so that (102) is a right triangle; 
thus, (12) 2 = (10) 2 + (20) 2 . Consider a point 3 close to 0. Define 

x(3) = (13) - (10) 

and 

V(3) = (23) - (20). 

Then the distance (03) does not have to be supplied independently; it can be 
calculated from the formula* 

(03 ) 2 = [x(3)] 2 + [/(3)] 2 . 

Similarly for a point 4 and its distance (04) from the local origin 0. Similarly for 
the distance (mri) between any two points m and n that are close to 0: 

(mn) 2 — [x(m) - x(n)] 2 + [y(m) — y(n)] 2 . 

Thus it is only needful to have the distance (bn) (from point 1) and (2m) (from 
point 2) for each point m close to 0 (m = 3, 4,..., N -j- 2) to be able to work out 


*If the distance (03) is given arbitrarily, the resulting four-vertex figure will burst out of the plane. 
Regarded as a tetrahedron in a three-dimensional Euclidean space, it has a volume given by the lormula 
of Niccolo Fontana Tartaglia (1500-1557). generalized today (Blumenthai 1953) to 


/ volume of \ 


0 1 

1 

1 

1 

/i-dimensional 

i /(-l) ft+1 \ 1/4 - 1 ’ 

1 0 

(01)- 

(02)- . 

. (0;,)- 

simplex 
spanned by 

H 2- ) „■ 

1 (10)- 

0 

(12)- . 

. (1/iV- 

\ in + 1) points j 


1 (n0)- 

(fll)- 

(nl) 1 . 

0 


which reduces for three points to the standard textbook formula of Hero of Alexandria (a.d. 62 to 
A.D. 150). 


area = — (01)][.? — (02>][.v — (12)]} 1 '-. 


2s = (01) + (02) + (12). 


for the area of a triangle. Conversely, if the four points are to remain in two-dimensional Euclidean 
space, the tetrahedron must collapse to zero volume, This requirement supplies one condition on the 
one distance (03). It simplifies the discussion of this condition to take (03) small and (102) to be a right 
triangle, as above. However, the general principle is independent of such approximations, and follows 
direetlv from the extended Hero-Tartaglia formula. It is enough in a locally Euclidean or Lorentz space 

of n dimensions to have laid down (n -f l) fiducial points 0. 1.2. n. and to know the distance of 

everv other point j, k ,. . . from these fiducial points, in order to be able to calculate the distance of 
these points j. k. . . . from one another ("distances between nearby points in terms of coordinates”; metric 
as distillation of distance data). 
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its distance from every point n close to 0. The prescription to determine the 
N(N — l)/2 distances between these N nearby points can be reexpressed to advan¬ 
tage in these words: (1) each point has two coordinates, a - and^y; and (2) the distance 
is given in terms of these coordinates by the standard Euclidean metric; thus 

(As) 2 = (Ax) 2 + (Ay) 2 , 

Having gone this far on the basis of “distance geometry” (for more on which, see 
Robb 1914 and 1936), one can generalize from a small region (Euclidean) to a large 
region (not Euclidean). Introduce any arbitrary smooth pair of every- 
where-independent curvilinear coordinates x k , and express distance, not only in the 
immediate neighborhood of the point 0, but also in the immediate neighborhood 
of every point of the surface (except places where one has to go to another coordinate 
patch; at least two patches needed for 2-sphere) in terms of the formula 

ds 2 = g jk dx* dx k . 

Thus out of the table of distances between nearby points one has distilled now five 
numbers per point (two coordinates, a 1 , a 2 , and three metric coefficients, g n , g 12 — 
g 2V and g 22 ), down by a factor of 100/5 = 20 from what one had before (now 3 
tons of data, or half a truckload). 


Box 13.1 (continued) 


Third Distillation; Metric Coefficients Expressed as Analytical 
Functions of Coordinates 

Instead of giving the three metric coefficients at each of the 2 x 10 7 points of the 
surface, give them as functions of the two coordinates a 1 , a 2 , in terms of a power 
series or an expansion in spherical harmonics or otherwise with some modest number, 
say 100, of adjustable coefficients. Then the information about the geometry itself 
(as distinct from the coordinates of the 2 x 10 7 points located on that geometry) 
is caught up in these three hundred coefficients, a single page of printout. Goodbye 
to any truck! In brief, metric provides a shorthand way of giving the distance between 
every point and every other point—but its role, its justification and its meaning lies 
in these distances and only in these many distances. 
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Figure 13.1. 

Distances determine geometry. Upper left: Sufficiently great tidal forces, applied to the earth with tailored 
timing, have deformed it to the shape of a tear drop. Lower left: This tear drop is approximated by 
^ polyhedron built out of triangles (“skeleton geometry”). The approximation can be made arbitrarily 
good by making the number of triangles sufficiently great and the size of each sufficiently small. Upper 
right: The geometry in each triangle is Euclidean: giving the three edge lengths fixes all the features 
of the figure, including the indicated angle. Lower right: The triangles that belong to a given vertex, 
laid out on a fiat surface, fail to meet. The deficit angle measures the amount of curvature concentrated 


angle 


Mauritius 

► Polygon laid out 
fiat fails to 
close 

• Gap measures 
curvature 
concentrated 
at Cairo 


at that vertex on the tear-drop earth. The sum of these deficit angles for all vertices of the tear drop 
equals 4 n. This “Gauss-Bonnet theorem” is valid for any figure with the topology of the 2-sphere; for 
the simplest figure of all, a tetrahedron, four vertices with a deficit angle at each of 180° are needed—3 
triangles x 60° per triangle available = 180° deficit. In brief, the shape of the tear drop, in the given 
skeleton-geometry approximation, is determined by its 50 visible edge lengths plus, say, 32 more edge 
lengths hidden behind the figure, or a total of 82 edge lengths, and by nothing more (“distances determine 
geometrvLV“Metric” letls the distance between every point and every nearby point. If volcanic action 
raises Rejkjavik, the distances between that Icelandic capital and nearby points increase accordingly; 
distances again reveal shape. Conversely, that there is not a great bump on the earth in the vicinity 
of Iceland, and that the earth does not now have a tear-drop shape, can be unambiguously established 
by analyzing the pattern of distances from point to point in a sufficiently well-distributed network of 
points, with no call for any observations other than measurements of distance. 
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Covariant components of 
metric 


"Line element" compared 
with "metric as bilinear 
machine" 


Metric produces a 
correspondence between 
1-forms and tangent vectors 


Lowering indices 


for the interval between those events; comparison with the coordinate viewpoint 
[equation (13.1)] reveals 

gw = gie^ev) = (13.2) 

(standard equation for calculating components of a tensor). 

Just as modern differential geometry replaces the old style “differential” df by 
the “differential form” df (Box 2.3, page 63), so it also replaces the old-style “line 
element” 

ds 2 = g^ v dx* dx v = (“interval between x a and -f dx a ”) (13.3) 

by the bilinear machine (“metric tensor”) 

g = ds 2 = dx* ® dx*. (13.4) 

The output g ((, f) of this machine, for given displacement-vector input, is identical 
to the old-style interval. Hence, ds 2 = g^ v dx* ® dx v represents the interval of an 
unspecified displacement; and the act of inserting f into the slots of ds 2 is the act 
of making explicit the interval g(( , () = g^, Ax* Ax 1 ’ of an explicit displacement. 

In curved spacetime with metric, just as in flat spacetime with metric (§2.5), a 
particular 1-form u corresponds to any given tangent vector u : 

u is defined by “<&, v) = g(u , v) for all v ” (13.5) 

(“representation of the same physical quantity in the two alternative versions of 
vector and 1-form”; “corresponding representations” as (J)-tensor and as (J)-tensor). 
Example: the 1-form u corresponding to a basis vector u = e a has components 


[ standard way 1_J 

to compute J 

[definition (13.5)}— 1 


M /s = <". ®/?> = g(u, e 0 ) = g(e a , e fi ) = g a/3 ; 


1 —[equation (13.2)] 

1 [by u = ej 


thus 


g a p ujfi is the 1-form e a corresponding to e a . (13.6) 

Also as in flat spacetime (§3.2), a tensor can accept either a vector or a 1-form into 
any given slot 

S(u , a, v) = S(w, cr, v). (13.7) 

Equivalently, in component language, the indices of a tensor can be lowered with 
the covariant components of the metric 


s a*y 7 s (®«>®y) = S(e«> e y) = S{g atl w^, w*, e y ) = g ail S^ r (13.8) 
^definition of S a & y ] [by equation (13.6)] 
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The basis vectors { e a } can be chosen arbitrarily at each event. Therefore the 
corresponding components g a/3 of the metric are quite arbitrary (though symmetric: 
gad = £/?«)• But the mixed components g a p are not arbitrary. In particular, equations 
(13.5) and (13.7) imply 

g(u, v) = g{u , v) = (u, v). (13.9) 

Therefore one concludes that the metric tensor in mixed representation is identical 
with the unit matrix: 


S% = e d) = **> = S<y (13.10) 

This feature of the metric in turn fixes the contravariant components of the metric: 

r%/2 = g* fi = 5%; (13.11) 

l-[“lowering an index” of g afi ] 

i.e., 


Ilg a/3 il is the matrix inverse of ilg a/3 ll. (13.12) 

This reciprocity enables one to undo the lowering of tensor indices (i.e., raise indices) 
with g a &\ 

s* y = KS'fiy = g^SvaS^y = g^S/y. (13.13) 

The last two paragraphs may be summarized in brief: 

0 ) g\ = 

(2) ll^ll = ll^ll-i; 

(3) tensor indices are lowered with g a/} ; 

(4) tensor indices are raised with g afi . 

In this formalism of metric and index shuffling, a big question demands attention: 
how can one tell whether the metric is locally Lorentz rather than locally Euclidean 
or locally something else? Of course, one criterion (necessary; not sufficient!) is 
dimensionality—a locally Lorentz spacetime must have four dimensions. (Recall the 
method of §1.2 to determine dimensionality.) Confine attention, then, to four¬ 
dimensional manifolds. What else must one demand? One must demand that at 
every event 9 there exist an orthonormal frame (orthonormal set of basis vectors 
{©i}) in which the components of the metric have their flat-spacetime form 

gat 5 = ©5 • ets = Vaf) = diagonal (-1,1,1,1). (13.14) 

To test for this is straightforward (exercise 13.1). (1) Search for a timelike vector 
u (u • u < 0). If none exist, spacetime is not locally Lorentz. If one is found, then 
(2) examine all non-zero vectors v perpendicular to u. If they are all spacelike 
(v • v > 0), then spacetime is locally Lorentz. Otherwise it is not. 


Mixed and contravariant 
components of metric 


Raising indices 


Metric must be locally 
Lorentz 
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EXERCISES 


Exercise 13.1. TEST WHETHER SPACETIME IS LOCAL LORENTZ 

Prove that the above two-step procedure for testing whether spacetime is locally Lorentz 
is valid: i.e.. prove that if the procedure says “yes," then there exists an orthonormal basis 
Safi — 'Oafi at event in question; if it says “no," then no such basis exists. 

Exercise 13.2. PRACTICE WITH METRIC 

A four-dimensional manifold with coordinates v, r, 0. <p has line element (old-style notation) 
ds 2 =-(!-- 2 M/r) dv 2 + 2 dv dr + r 2 (dd 2 + sin-0 d$ 2 ), 
corresponding to metric (new-style notation) 
ds 2 = —(1 — 2 M/r) dv dv + dv ® dr + dr ® dv + r 2 (d8 <8> d8 + sin 2 0 d$ <8> </<>), 
where M is a constant. 

(a) Find the “covariant” components g af3 and “contravariant” components g afi of the metric 
in this coordinate system. [Answer: g vv = -(1 - 2 M/r), g vr = g n = 1, g ee = r 2 , g = r 2 
sin 2 0; all other g a/} vanish; g vr = g™ = 1, g" = (1 - 2 M/r), g ee = r~ 2 , g** = r ~ 2 sin~ 2 d, 
all other g afi vanish.] 

(b) Define a scalar field t by 

t=v - r - 2M\n[(r/2M) - 1]. 

What are the covariant and contravariant components (u a and u a ) of the 1-form u = dP. 
What is the squared length u 2 = u • u, of the corresponding vector? Show that u is timelike 
in the region /* > 2M. [Answer: u v = 1 , u r = -1/(1 - 2 M/r), u e - u 0 = 0; u v = - 1/(1 - 
2 M/r), u r = 0, u 6 = u* = 0; u 2 = - 1/(1 - 2M/r).] 

(c) Find the most general non-zero vector w orthogonal to u in the region r > 2M , and 
show that it is spacelike. Thereby conclude that spacetime is locally Lorentz in the region 
r > 2 M. [Answer: Since w u = w a u a — — w v /{\ — 2M/r ), w v must vanish, but uy, w 9 , 
are arbitrary, and w 2 = (1 - 2M/r)w 2 q- r~ 2 w e 2 q- r~ 2 sin _2 0w 0 2 > 0.] 

(d) Let r, r, 0, <p be new coordinates for spacetime. Find the line element in this coordinate 
system. [Answer: This is the “Schwarzschild" line element 

ds 2 = — (1 — dt 2 H- -—■ — ■■■■—h r 2 d6 2 q- r 2 sin 2 0 d<p 2 ,] 

\ r / 1 — 2 M/r 

(e) Find an orthonormal basis, for which g = ?] a/J in the region r > 2M. [Answer: 

e 6 = 0 — 2 M/r)~ 1/2 d/dt , e- = (1 — 2 M/r) 1/2 d/dr, e$ = r _1 0/30, = (r sin 0) _1 d/d<p.] 


§13.3. CONCORD BETWEEN GEODESICS OF 

CURVED SPACETIME GEOMETRY AND STRAIGHT 
LINES OF LOCAL LORENTZ GEOMETRY 

More could be said about the mathematical machinery and physical implications 
of “metric,” but an issue of greater urgency presses for attention. What has metric 
(or spacetime interval) to do with geodesic (or world line of test particle)? Answer: 
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Two mathematical objects (“straight line in a local Lorentz frame” and “geodesic 
of the over-all global curved spacetime geometry”) equal to the same physical object 
(“world line of test particle”) must be equal to each other (“condition of consist¬ 
ency”). As a first method to spell out this consistency requirement, examine the two 
mathematical representations of the world line of a test particle in the neighborhood 
of a given event *P 0 . The local-Lorentz representation says: 

“Pick a local Lorentz frame at [As spelled out in exercise 13.3, such a local 
Lorentz frame is the closest thing there is to a global Lorentz frame at #> 0 ; i.e., 
it is a coordinate system in which 

o) = Va /3 (flat-spacetime metric), (13.15a) 

gafi,y(? o) = 0, (13.15b) 

gafi, ysC&o) 7” 0 except in special cases, such as flat space.] (13.15c) 

The world line in that frame has zero acceleration, 


d 2 x a /dr 2 = 0 at £? 0 (“straight-line equation”), (13.16) 

where t is proper time as measured by the particle’s clock.” 

The geodesic representation says 

“In the local Lorentz frame, as in any coordinate frame, the world line satisfies 
the geodesic equation 

d 2 x a /dr 2 + r a p y (dxP/dT)(dx y /dr) = 0 (13.17) 

(t is an affine parameter because it is time as measured by the test particle’s clock).” 
Consistency of the two representations for any and every choice of test particle (any 
and every choice of dx a /dr at ^ 0 ) demands 

y(^o) = 0 in any local Lorentz frame [coordinate 

system satisfying equations (13.15) at ^ 0 ]; (13.18) 

i.e., it demands that every focal Lorentz frame is a local inertial frame. (On local 
inertial frames see §11.6.) In such a frame, all local effects of “gravitation” disappear. 
That is the physical shorthand for (13.18). 

One does not have to speak in the language of a specific coordinate system when 
one demands identity between the geodesic (derived from the r a (iy ) and the straight 
line of the local Lorentz geometry (g^). The local Lorentz specialization of coordi¬ 
nates may be the most immediate way to see the physics (“no local effects of 
gravitation”), but it is not the right way to formulate the basic mathematical re¬ 
quirement in its full generality and power. The right way is to demand 


Vg = 0 (“compatibility of g and V”). (13.19) 

Stated in the language of an arbitrary coordinate system, this requirement reads 


Safi ; 


d Safi 


;Y ■ 


dx y 




Local-Lorentz description of 
straight lines 


Geodesic description of 
straight lines 


Condition of consistency: 
r a 0 y = 0 in local Lorentz 
frame 


Consistency reformulated: 
Vg = 0 . 


(13.19') 
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That this covariant requirement is fulfilled in every coordinate system follows from 
its validity in one coordinate system: a local Lorentz frame. (The first term in this 
equation, and the last two terms, are separately required to vanish in the local 
Lorentz frame at point & 0 —and required to vanish by the physics.) From Vg = 0, 
one can derive both the abstract chain rule 

▼„ (v w) = (V u v) • w + v-(V u w) (13.20) 

r a p y expressed in terms of (Exercise 13.4) and the following equations for the connection coefficients in any 
metnc frame in terms of (1) the metric coefficients, g a(S — e a • and (2) the covariant 

commutation coefficients 

c af3y = <■«/&ty = <» M . K- y ( 13 - 21 ) 

of that frame: 


ra Hy = S a ' lF ^y (definition of r^ y ), 

F iif}y = ~2 "*■ C fi/3y S/J-Yji d" c iiy/3 ~ Sfiy.n ~ c fiy^ 

= J (gufi,y + gny,p - gfiy.n) in an y coordinate frame. 


(13.22) 


(13.23) 


(See Exercise 13.4). 

Equations (13.23) are the connection coefficients required to make the geodesics of 
curved spacetime coincide with the straight lines of the local Lorentz geometry. And 
they are fixed uniquely; no other choice of connection coefficients will do the job! 

Summary: in curved spacetime with a local Lorentz metric, the following seemingly 
different statements are actually equivalent: (1) the geodesics of curved spacetime 
coincide with the straight lines of the local Lorentz geometry; (2) every local Lorentz 
frame [coordinates with g a ^ 0 ) = g a p y y (f? 0 ) = 0] is a local inertial frame 

[^V(^ 0 ) = the metric and covariant derivative satisfy the compatibility 

condition Vg = 0; (4) the covariant derivative obeys the chain rule (13.20); (5) the 
connection coefficients are determined by the metric in the manner of equations 
(13.23). A sixth equivalent statement, derived in the next section, says (6) the 
geodesics of curved spacetime coincide with world lines of extremal proper time. 


EXERCISES Exercise 13.3. MATHEMATICAL REPRESENTATION OF LOCAL 

LORENTZ FRAME 

By definition, a local Lorentz frame at a given event is the closest thing there to a global 
Lorentz frame. Thus, it should be a coordinate system with an d with as many 

derivatives of g^ v as possible vanishing at & 0 . Prove that there exist coordinates in which 
gnp(& o) = V and W*o) = °» hut that 0 ) cannot vanish in general. Hence, such 

coordinates are the mathematical representation of a local Lorentz frame. [Hint: Let {x a (^)) 
be an arbitrary but specific coordinate system, and {x M (^)} be a local Lorentz frame, both 
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with origins at & 0 . Expand the coordinate transformation between the two in powers of x> 1 

x a ' = x* + j N a ^x" + 1 P a ^x»x' , x° + ...; 

and use the transformation matrix L a \ = dx a '/dx» to get $„„(!? 0 ). 0 ), and g^ pc {9 0 ) 

in terms of and its derivatives and the constants M a ^ N a flv , P a flvp . Show that whatever 
g a ' 0 > may be (so long as it is nonsingular, so g a '&' exists!), one can choose the 16 constants 
M a fi to make g^ y = (ten conditions); one can choose the 4 X 10 = 40 constants N y 

to make the 10 X 4 = 40 g fJLVtP (9 {0 ) vanish; but one cannot in general choose the 4 X 20 = 
80 pa n»P t0 make the 10 x 10 = 100 g^ v pa vanish.] 

Exercise 13.4. CONSEQUENCES OF COMPATIBILITY BETWEEN g AND ▼ 

(a) From the condition of compatibility Vg = 0, derive the chain rule (13.20). 

(b) From the condition of compatibility Vg = 0 and definitions (13.21) and (13.22), derive 
equation (13.23) for the connection coefficients. [Answer: See exercise 8.15, p. 216.] 


§13.4. GEODESICS AS WORLD LINES OF 
EXTREMAL PROPER TIME 

In a local Lorentz frame, it is easy to distinguish a world line that is straight from 
one that is not. Position the Lorentz frame and so orient it that the starting point 
of the world line, d, lies at the origin and the end point, lies at x = 0, y — 0, 
z = 0, t = T. As an example of a nonstraight world line, consider passage at uniform 
velocity from £ to point 9 with coordinates Q T\ 0,0, ±R) and from there again with 
uniform velocity to point $. The lapse of proper time from start to finish (“length 
of world line”) is 

r = (T 2 — R 2 ) 1/2 . 

Thus the lapse of proper time is diminished from its straight-line value, and dimin¬ 
ished moreover for any choice of R whatsoever, except for the zero or straight-line 
value R = 0. As for this simple nonstraight curve, so also for any other nonstraight 
curve: the lapse of proper time between 3 and 9 is less than the straight-line lapse 
(Exercise 6.3). Thus, in flat spacetime, extremal length of world line is an indicator 
of straightness. 

Any local region of the curved spacetime of the real, physical world is Lorentz 
in character. In this local Lorentz geometry, it is easy to set up Lorentz coordinates 
and carry out the extremal-length analysis just sketched to distinguish between a 
straight fine and a nonstraight line: 

dr — 

a maximum for straight line \ 
as compared to any variant of 1. (13.24) 

the straight line J 



f (-t? dx* dx p ) 1/2 
V 



t 

♦ 



In flat spacetime, straight 
lines have extremal length 

Extremal length in curved 
spacetime 
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Proof that curves of extremal 
length are geodesics 


Such a test for straightness can be carried out separately in each local Lorentz region 
along the world line, or, with greater efficiency, it can be carried out over many 
local Lorentz regions simultaneously, i.e., over a region with endpoints d and if) 
so widely separated that no single Lorentz frame can possibly contain them both. 
To carry out the analysis, one must abandon local Lorentz coordinates. Therefore 
introduce a general curvilinear coordinate system and find 


= f dr = f (-&»• dx^dx") 1 ' 2 


/an extremum for timelike world line that 1 
is straight in each local Lorentz frame 
along its path, as compared to any “nearby” 
\ variant of this world line , 


(13.25) 


In the real world, the path of extremal r, being straight in every local Lorentz frame, 
must be a geodesic of spacetime. 

Notice that the word “maximum” in equation (13.24) has been replaced by 
“extremum” in the statement (13.25). When S and are widely separated, they 
may be connected by several different geodesics with differing lapses of proper 
time (Figure 13.2). Each timelike geodesic extremizes r with respect to nearby de¬ 
formations of itself, but the extremum need not be a maximum. When several 
distinct geodesics connect two events, the typical one is not a local maximum (“moun¬ 
tain peak”) but a saddle point (“mountain pass”) in such a diagram as Figure 13.2 
or 13.3. 

Concord between locally straight lines (lines of extremal r) and geodesics of curved 
spacetime demands that timelike geodesics have extremal proper length. If so, then 
any curve x ti (X) between d (where X = 0) and © (where X = 1) that extremizes r 
should satisfy the geodesic equation. To test for an extremal by comparing times, 
pick a curve suspected to be a geodesic, and deform it slightly but arbitrarily: 


original curve, x* = a^{X)\ 
deformed curve, x^ = a fl (X) + Sa fi (X). 


Along either curve the lapse of proper time is 




dxv dx v \ 1/2 
dX dX ) 


dX. 


(13.26) 


(13.27) 


At fixed X the metric coefficient g„„[x a (A)] differs from one curve to the other by 




(13.28) 


and the components dx”/dX of the tangent vector differ by 


(£) 


' dx" \ _ d(a p + Sa p ) da” _ d 

~~d\ ~d\ ~ ~d\ { ’ 8aV) ' 


( 13 . 29 ) 
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These changes in g^ and dx"/dX, at fixed A, produce corresponding changes in the 
lapse of proper time in equation (1327): 


i ( ~g^(da>"/dX)d(Sa")/dX - y ( g^ „Sa a )(da^/dX){da"/dX) 

St = f - - - 

J oj l-g ys (dav/d\)(da*/d\))^ J 

Integrate the first term by parts. Strike out the end-point terms, because both paths 
must pass through a and $ (Sa* = 0 at A = 0 and A = 1). Thus find 

„ r x=1 T da y da*~\ 1/2 

Sr= Lo A(A)SaC r^ix^x] dx • (13 - 30 > 

Here the/ ff (“force terms”) in the integrand are abbreviations for the four expressions 


1 

d 

da" 

Sav ~dX 

1 3*„r 

2 dx” 

da a da" 

dX dX 

da y da 8 

L ^ Yi dx dx. 

1/2 dx | 

da y da h 1 1/2 

. SyS dX dX J 

[— £y* 

o» 


(13.31) 


An extremum is achieved, and the first-order change 8r vanishes for every first-order 
deformation 8a a (X) from an optimal path x a = when the four quantities f a 

that multiply the 8a a all vanish. Thus one arrives at the four conditions 


/.(A) - 0 (13.32) 

for the determination of an extremal world line. (An alternative viewpoint on the 
extremization is spelled out in Figure 13.3.) 

Sufficient these four equations are, but independent they are not, by reason of a 
“bead argument” (automatic vanishing of 8r for any set of changes that merely slide 
points, like beads, along an existing world line). The operation of mere “sliding of 
beads” implies the trivial change 

8a°(X) = h(X)^-, (13.33) 


where h(X) is an arbitrary function of position along the world line (“more sliding 
here than there”). Already knowing that this operation cannot change r, one is 
guaranteed that the integrand in (13.30) must vanish when one inserts (13.33) for 
8a a \ and must vanish, moreover, whatever choice is made for the arbitrary “magni¬ 
tude of slide” factor / 7 (A). This requirement implies and demands that the scalar 
product f a da°/dX must automatically vanish; or, otherwise stated, 

fo^r = 0 . (13.34) 


The argument applies, and this equation holds, whether one is or is not dealing 
with an optimal world line. An equation of this type, valid whether or not the world 
line is an allowable track for a free test particle (track of extremal lapse of proper 
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time), is known as an identity. Equation (13.34), an important identity in the realm 
of spacetime geodesics, is an appropriate forerunner for the Bianchi identities of 
Chapter 15: the most important identities in the realm of spacetime curvature. 

The freedom that exists to “slide A-values along the world line” can be exploited 
to replace the arbitrary parameter X by the physically more interesting parameter 
of proper time itself, 

^ T dcfi da* 1 1/2 ^ 

d 7 = [- s »-d\i\\ (,3 - 35) 


Figure 13.2. 

Star oscillating back and forth through the plane of a disc galaxy, as an example of a situation where 
two events (I and $ can be connected by more than one geodesic. Upper left: The galaxy seen edge-on, 
showing (dashed line) the path of the star in question, referred to a local frame partaking of and comoving 
with the general revolution of the nearby “disc stars.” Upper right: The effective potential sensed by 
the star, according to Newtonian gravitation theory, is like that experienced by a ball which rolls down 
one inclined plane and up another (“free fall toward galactic plane” with acceleration g = \ in the units 
used here). The three central frames: Possible and impossible world lines for the star connecting two 
given events d (plane of galaxy at t = 0) and $ (plane of galaxy at t = 2). Right: Throw star up from 
the galactic plane with enough velocity so that it just gets back to the plane at t = 2. Left: Throw it 
up with half the velocity and it will come back in half the time (very contrary to behavior of a simple 
harmonic oscillation, but in accord with galaxy’s v-shaped potential!), thus being able to make two 
excursions in the allotted time between d and Center: A conceivable world line (conceivable with 
rocket propulsion!) but not a geodesic. Bottom: Comparison of these and any other paths that allow 
themselves to be approximated in the form 


z — a x sin (777/2) + a 2 sin (2 t77/2). 


Here the two adjustable parameters, a x and a 2 , provide the coordinates in a two-dimensional “function 
space” (approximation to the infinite-dimensional function space required to depict all conceivable world 
lines connecting^ and s8; note comparison in right center frame between one-term Fourier approximation 
and exact, parabolic law of free fall; similarly in left center frame, where the two curves agree too closely 
to be shown separate on the diagram). Details: In the context of general relativity, take an arbitrary 
world line that connects d and $, evaluate lapse of proper time, repeat for other world lines, and say 
that a given world line represents a possible motion (“geodesic”) when for it the proper time is an 
extremum with respect to all nearby world lines. In the Newtonian approximation, the difference between 
the lapse of proper time and the lapse (t$ — t#) of coordinate time is all that comes to attention, in the 
form of the “action integral” (on a “per-unit-mass basis”) 


K kinetic\ _ /potential 
energy/ \ energy 


■-/; 


)]* 


(maximum, or other extremum, in the proper time implies minimum, or corresponding other extremum, 
in the action I). The integration gives 

I = (tt 2 ^/8) _ (4|flj|/7r) q- (m 2 a\/2) 

for \a 2 \ < \ Jfljl (one-excursion motions), and for |tf 2 |> \ l fl il (two-excursion motions), 

/ = (7r 2 flf/8) + {ir 2 al/2) — (4\a 2 \/ir) — (al/ir\a 2 \). 

The one-excursion motion minimizes the action (maximizes the lapse of proper time). The two-excursion 
motion extremizes the action but does not minimize it (“saddle point”; “mountain pass” in the topogra¬ 
phy). Choquard (1955) gives other examples of problems of mechanics where there is more than one 
extremum. Morse (1934) and Morse and Cairns (1969) give a theorem connecting the number of saddles 
of various types with the numbers of maxima and minima (“critical-point theorem of the calculus of 
variations in the large”). 
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*2 



Figure 13.3. 

Extremizing lapse of proper time by suitable choice of world line. Left: Spacetime; and world line 
F that extremizes the lapse of proper time r from d to # compared to other world lines. The specific 
world lines depicted in the diagram happen to be distinguished from fiducial world line G by two “Fourier 
amplitudes” a x and a 2 : 

= a x sin (ttA) + a 2 sin (2 ttA), 

where the arbitrary scaling of A, and its zero, are so adjusted that \(d) = 0, A($) = 1. 

Right: “Path space.” The coordinates in this space are the Fourier amplitudes a x and a 2 . Only these 
two amplitudes (“two dimensions”) are shown out of what in principle are infinitely many amplitudes 
(“infinite-dimensional path space”) required to represent the general timelike world line connecting (I 
and f0. Any given contour curve runs through all those points (in path space) for which the corresponding 
world lines (in spacetime) rack up the indicated lapse of proper time t. Foregoing description is classical; 
according to quantum mechanics, all the timelike world lines connecting d and $ occur with the same 
probability amplitude (“principle of democracy of histories”) with the only difference from one to another 
being the phase of this complex probability amplitude exp (- irm/h) (m = mass of particle, h = quantum 
of angular momentum). In the sum over these probability amplitudes, however, destructive interference 
wipes out the contributions from all those histories which differ too much from the optimal or classical 
history (“Fresnel wave zone”; “Feynman’s principle of sum over histories”; see Feynman and Hibbs, 
1965). Capitalizing on this wave-mechanical background to show how the machinery of the physical 
world works, Box 25.3 spells out the Hamilton-Jacobi method (“short-wavelength limit of quantum 
mechanics ) for determining geodesics, a method considerably more convenient for most applications 
than the usual “second-order differential equations for geodesics” (equation 10.27). 


Focus on a specific world line, A), with all deformations of it gone from 

view; one may replace a^(A) by x^(A) everywhere. Then the differential equations 
(13.32) for an extremal world line reduce to 


d 2 x v 


(^Suv . ^Sofl 


| dx^ dx v 

V dx* dx- 

dx a ) 

’ dr dr 


= 0 . 


(13.36) 
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As an aside, note that the identity (13.34) now follows by one differentiation (with 
respect to r) of the equation 

dx° dx\ x= 0 (13.37) 

dr dr 


Thus the identity is to be interpreted as saying that 4-velocity and 4-acceleration 
are orthogonal for any world line, extremal or not. Now return to (13.36), raise an 
index with g l3a , and thereby bring the equation for a straight line of local Lorentz 
geometry into the form 


+ — ==o. 

dr 2 S 2 V 9x" dx" dx° J dr dr 


(13.38) 


Compare with the standard form of the equation for a geodesic in “premetric 
geometry,” 

d 2 x p r s dx^d£__ o (13.39) 

d \ 2 d\ d\ 


Conclude that the geodesics of the premetric geometry will agree with the straight 
lines of the local Lorentz geometry if and only if two conditions are satisfied: (1) 
the 40 connection coefficients r 13 ^ that define geodesics, covariant derivatives, and 
parallel transport must be given in terms of the 10 metric coefficients g^ v ( Einstein 
gravitation potentials”) by the equations (13.22) and (13.23) previously derived; and 
(2) the geodesic parameter \ must agree with the proper time r up to an arbitrary 
normalization of zero point and an arbitrary but constant scale factor, thus 


\ = ar + b. 


(Nothing in the formalism has any resemblance whatsoever to the universal time 
t of Newton “flowing everywhere uniformly”; rather, there is a separate proper time 
r for each geodesic). See Box 13.3 for another variational principle, which gives in 
one step both the extremal world line and the right parametrization on that line. 

With this step, one has completed the transfer of the ideas of curved-space 
geometry from a foundation based on geodesics to a foundation based on metric. 
The resulting geometry always and everywhere anchors itself to the principle of “local 
Lorentz character,” as the geometry of Newton-Cartan never did and never could. 


Exercise 13.5. ONCE TIMELIKE, ALWAYS TIMELIKE 

Show that a geodesic of spacetime which is timelike at one event is everywhere timelike. 
Similarly, show that a geodesic initially spacelike is everywhere spacelike, and a geodesic 
initially null is everywhere null. [Hint: This is the easiest exercise in the book!] 

(continued on page 324 ) 


EXERCISES 
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Box 13.2 "GEODESIC" VERSUS "EXTREMAL WORLD LINE" 


Once the connection coefficients T’" have been 
expressed in terms of Einstein’s gravitational po¬ 
tentials g Ml , by the equations (13.22) and (13.23), 
as they are now and hereafter will be in this book 
(“Riemannian or metric geometry”), it is permis¬ 
sible and appropriate to subsume under the one 
word “geodesic” two previously distinct ideas: (1) 
a parametrized world line that satisfies the geo¬ 
desic equation 

d 2 x a . p a dx ** dx v _ q 
d \ 2 ^ dX d\ ~ ’ 

and (2) a world line that extremizes the proper 
time (or, if spacelike, a curve that extremizes the 
proper distance) between two events d and f9. The 
one possible source of confusion is this, that (1) 


presupposes a properly parametrized curve (as was 
essential, for example, in the Schild’s ladder con¬ 
struction employed for parallel transport in Chap¬ 
ter 10), whereas (2) cares only about the course 
of the world line through spacetime, being in¬ 
different to what parametrization is used or 
whether any parametrization at all is introduced. 
This is not to deny the possibility of “marking in 
afterward” along the extremal curve the most nat¬ 
ural and easily evaluated of all parameters, the 
proper time itself, whereupon the extremal curve 
of (2) satisfies the geodesic equation of (1). Am¬ 
biguity is avoided by insisting on proper para¬ 
metrization: henceforth the word “curve” means 
a parametrized curve, the word “geodesic” means 
a properly parametrized geodesic. 


j 


\ 

Box 13.3 "DYNAMIC" VARIATIONAL PRINCIPLE FOR GEODESICS 

If the principle of extremal length 

c* r dx* dx'- I 1 '* 

,= J, [-^sr-dTi ^ = “ remum <» 

is indifferent to choice of parametrization [“^A” canceling out in (1)] and if the 
geodesic equation finds the proper parametrization a matter of concern, it is appro¬ 
priate to search for another extremal principle that yields in one package both the 
right curve and the right parameter. By analogy with elementary mechanics, one 
expects that an equation of motion [the geodesic equation 

d 2 x fi /d \ 2 + (dx a /d\)(dxP/d\) = 0 ] 

whose leading term has the form “jc” can be derived from a Lagrangian with leading 
term “^x 2 ” (“kinetic energy”; “dynamic” term). The simplest coordinate invariant 
generalization of \x 2 is 




(dx> i /dX)(dx l '/dX). 
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Thus one is led to try, in place of the “geometric” principle of extremal length, a 
new “dynamic” extremal principle: 


dx> 1 dx v 
dX dX 


1 f* 

= 2 l « 

r ® / dx ° \ 

= J L^x a ,-jj-JdA = extremum 


(2) 


(replacement of square root in previous variational principle by first power). The 
condition for an extremum, here as before [equations (13.30) to (13.32)] is annulment 
of the so-called Euler-Lagrange “functional derivative” 


( 3 ) 


or, written out in full detail, 
d 2 x” 1 


go 


dX ‘ 2 


SI _ 

. /coefficient of Sx a in\ 

Sx a ~ 

\ the integrand of Si) 

dL 

d 

0 L 

CD 

X 

Q 

1 

dX s( 

dx°y 


\ 

. dx ) 

( d gov 

1 ^gofi 

dg^Adx^ dx” 

\dx^ 

dx" 

dx° ) dX dX 


= 0 : 


or, after multiplication by the reciprocal metric, 


d 2 x a 


r+ gOO _ 

dX 2 + 8 2 


1 (*hL 

\ dx* 


d gop 


| dx^ 

dx” 

dx” 

dx a ) 

' dX 

dX 


= 0; 


which translates into the geodesic equation 


d 2 x a , ra dx^ 1 dx" n 

+ 1 „„—r-7T- — 0. 


dX 2 


dX dX 


( 4 ) 


( 5 ) 


( 6 ) 


Thus, the new “dynamic” expression (2) is indeed extremal for geodesic curves— 
and, by contrast with proper length, (1), it is extremal when and only when the 
geodesic is affinely parametrized. [Its “Euler-Lagrange equations” (6) remain satisfied 
only under parameter changes X Dew = aX old + b, which keep the parameter affine; 
by contrast, the Euler-Lagrange equations (13.31) and (13.32) for the “principle of 
extremal length” (1) remain satisfied for any change of parameter whatsoever.] 




J 
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Symmetries of Riemann in 
absence of metric 


New symmetries imposed by 
metric 


Exercise 13.6. SPACELIKE GEODESICS HAVE EXTREMAL LENGTH 

Show that any spacelike curve linking two events d and iff is a geodesic if and only if it 
extremizes the proper length 


S = f (g„r dx “ dx‘) l/2 . 

J a 

[Hint: This is almost as easy as exercise 13.5 if one has already proved the analogous theorem 
for timelike geodesics.] 

Exercise 13.7. METRIC TENSOR MEASURED BY LIGHT SIGNALS AND 
FREE PARTICLES [Kuchar] 

(a) Instead of parametrizing a timelike geodesic by the proper time r, parametrize it by an 
arbitrary parameter p, 

t = F(n). 

Write the geodesic equation in the y-parametrization. 

(b) Use now the coordinate time / as a parameter. Throw out a cloud of free particles 
with different “velocities” y* = dx i /dt and observe their “accelerations” a { = d 2 x i /dt 2 . 
Discuss what combinations of the components of the affine connection r\ x one can measure 
in this way. (Assume that no standard clocks measuring t are available!) 

(c) Show that one can measure the conformal metric i.e., the ratios of the components 
of the metric tensor g lK to a given component (say, g 00 ) 

Sik ^ — ( Soo) 1 > 

using only the light signals moving along the null geodesics g lK dx l dx K = 0. 

(d) Combine now the results of (b) and (c). Assume that f l KX is generated by the metric 
tensor by (13.22), (13.23), in the coordinate frame x l . Show that one can determine A 
everywhere, if one prescribes it at one event (equivalent to fixing the unit of time). 


§13.5. METRIC-INDUCED PROPERTIES OF RIEMANN 

In Newtonian spacetime, in the real, physical spacetime of Einstein—indeed, in any 
manifold with covariant derivative—the Riemann curvature tensor has these sym¬ 
metries (exercise 11 . 6 ): 

R a p y & = ^ a 0 [ Y 5 ] (antisymmetry on last two indices) (13.40) 

R a [p y s] =0 (vanishing of completely antisymmetric part). (13.41) 

In addition, it satisfies a differential identity (exercise 11.10): 

R a p [y8 - e] = 0 (“Bianchi identity”) (13.42) 

(see Chapter 15 for geometric significance). 

When metric is brought onto the scene, whether in Einstein spacetime or elsewhere, 
it impresses on Riemann the additional symmetry (exercise 13.8) 
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Keys = R iafi]ys (antisymmetry on first two indices). (13.43) 

This, together with (13.40) and (13.41), forms a complete set of symmetries for 
Riemann-, other symmetries that follow from these (exercise 13.10) are 

Keys = R y sa 0 (symmetry under pair exchange), (13.44) 

and 

R la 0 ys] = 0 (vanishing of completely antisymmetric part). (13.45) 

These symmetries reduce the number of independent components of Riemann from 
4x4x4x4 = 256 to 20 (exercise 13.9). 

With metric present, one can construct a variety of new curvature tensors from 
Riemann. Some that will play important roles later are as follows. 

(1) The double dual of Riemann, = * Riemann* (analog of Maxwell = 

* Faraday), which has components 

= jt^KS*\ W = “ { K^sKr (13.46) 


(exercise 13.11). 

(2) The Einstein curvature tensor , which is symmetric (exercise 13.11) 


G es = G t0 . (13.47) 

(3) The Ricci curvature tensor, which is symmetric, and the curvature scalar 

**,=**»„, K S = K P ; R = R%; (13.48) 

which are related to the Einstein tensor by (exercise 13.12) 

RP s = Gl i t +±R 8 f> s . (13.49) 

(4) The Weyl conformal tensor (exercise 13.13) 

C a \s = Ral3 ys ~ 2+ J S^ySV^R. (13.50) 

The Bianchi identity (13.42) takes a particularly simple form when rewritten in 
terms of the double dual *£?: 

-G a py S ;& = 0 (“Bianchi identity”) (13.51) 

(exercise 13.11); and it has the obvious consequence 

= 0 (“contracted Bianchi identity”). (13.52) 


Chapter 15 will be devoted to the deep geometric significance of these Bianchi 
identities. 


The curvature tensor G 


Einstein tensor 

Ricci tensor 
Curvature scalar 


Weyl conformal tensor 


Bianchi identities 
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EXERCISES 


Exercise 13.8. RIEMANN ANTISYMMETRIC IN FIRST 
TWO INDICES 

(a) Derive the antisymmetry condition (13.43). [Hint: Prove by abstract calculations that any 
vector fields s, u, v, w satisfy 0 = $?(u, ir)(s • w) = s ■ [tf(u, v)w) + w[‘:A(u, ir)s|. Then 
from this infer (13.43).] 

(b) Explain in geometric terms the meaning of this antisymmetry. 


Exercise 13.9. NUMBER OF INDEPENDENT COMPONENTS 
OF RIEMANN 


(a) In the absence of metric, a complete set of symmetry conditions for Riemann is R* fiyt = 
^ a /8[ys] ancl ^"[/jys] = 0. Show that in four-dimensional spacetime these reduce the number 
of independent components from 4x4x4x4 = 256 to4x4x6-4x4 = 96- 
16 = 80. 


(b) Show that in a manifold of n dimensions without metric, the number of independent 
components is 


n 3 (n-\) n 2 (n-l)(n-2) n 2 (n 2 - 1) 

- 2 -6-= “S--• (13-53) 

(c) In the presence of metric, a complete set of symmetries is R a/3yS = R [a/3nys] , and 

= 0. Show that in four-dimensional spacetime, these reduce the number of inde¬ 
pendent components to 6 x 6 — 4 x 4 = 36 - 16 = 20. 

(d) Show that in a manifold of n dimensions with metric, the number of independent 
components is 

I”«(« — 1) I 2 n 2 (n — l)(n — 2) n 2 (n 2 - 1) 

L 2 J-6-= 12 ~ ( 13 ' 54 > 

Exercise 13.10. RIEMANN SYMMETRIC IN EXCHANGE OF PAIRS; 

COMPLETELY ANTISYMMETRIC PART VANISHES 

From the complete set of symmetries in the presence of a metric, R a0yS = A [a(8][yi] and 
= °- derive: (a) symmetry under pair exchange, R afiyS = R ySa0! and (b)°vanishing 
of completely antisymmetric part, « MyS] = 0. Then (c) show that the following form a 
complete set of symmetries: 

RafiyS = ^[a/3][yS] = R[a/3y&] = 0* (13.55) 

Exercise 13.11. DOUBLE DUAL OF RIEMANN ; EINSTEIN 

(a) Show that « = * Riemann * contains precisely the same amount of information as 
(l ^ ?45^] n, ^ and satisfies precisely the same set of symmetries [(13.40), (13.41), (13.43) to 

(b) From the symmetries of «, show that Einstein [defined in (13.47)] is symmetric 
= 0 ). 

(c) Show that the Bianchi identities (13.42), when written in terms of -G , take the form 
(13.51) (“vanishing divergence,” ▼ * S =0). 

(d) By contracting the Bianchi identities ▼ • G = 0, show that G = Einstein has vanishing 
divergence [equation (13.52)]. 

Exercise 13.12. RICCI AND EINSTEIN RELATED 

(a) From the symmetries of Riemann , show that Ricci is symmetric = 0). 

(b) Show that Ricci is related to Einstein by equation (13.49). 
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Exercise 13.13. THE WEYL CONFORMAL TENSOR 

(a) Show that the Weyl conformal tensor (13.50) possesses the same symmetries [(13.40), 
(13.41), (13.43) to (13.45)] as the Riemann tensor. 

(b) Show that the Weyl tensor is completely “trace-free”; i.e., that 


contraction of C a/3y5 on any pair of slots vanishes. (13.56) 


Thus, C a/3yS can be regarded as the trace-free part of Riemann , and R afi can be regarded 
as the trace of Riemann . Riemann is determined entirely by its trace-free part C a/9y5 and 
its trace R afi [see equation (13.50), and recall R — 

(c) Show that in spacetime the Weyl tensor has 10 independent components. 

(d) Show that in an rt-dimensional manifold the number of independent components of 
Weyl [defined by a modification of (13.50) that maintains (13.56)] is 


n 2 (n 2 - 1) 
12 


n(n + 1) 
2 


for n > 3, 


0 for n < 3. 


(13.57) 


Thus, in manifolds of 1, 2, or 3 dimensions, the Weyl tensor is identically zero, and the 
Ricci tensor completely determines the Riemann tensor. 


§13.6. THE PROPER REFERENCE FRAME OF 
AN ACCELERATED OBSERVER 


A physicist performing an experiment in a jet airplane (e.g., an infrared astronomy 
experiment) may use several different coordinate systems at once. But a coordinate 
system of special utility is one at rest relative to all the apparatus bolted into the 
floor and walls of the airplane cabin. This “proper reference frame” has a rectangular 
“x, y, £” grid attached to the walls of the cabin, and one or more clocks at rest 
in the grid. That this proper reference frame is accelerated relative to the local 
Lorentz frames, the physicist knows from his own failure to float freely in the cabin, 
or, with greater precision, from accelerometer measurements. That his proper refer¬ 
ence frame is rotating relative to local Lorentz frames he knows from the Coriolis 
forces he feels, or, with greater precision, from the rotation of inertial-guidance 
gyroscopes relative to the cabin walls. 

Exercise 6.8 gave a mathematical treatment of such an accelerated, rotating, but 
locally orthonormal reference frame in flat spacetime. This section does the same 
in curved spacetime. In the immediate vicinity of the spatial grid’s origin x> = 0 
(region of spatial extent so small that curvature effects are negligible), no aspect 
of the coordinate system can possibly reveal whether spacetime is curved or flat. 
Hence, all the details of exercise 6.8 must remain valid in curved spacetime. Never¬ 
theless, it is instructive to rediscuss those details, and some new ones, using the 
powerful mathematics of the last few chapters. 

Begin by making more precise the coordinate grid to be used. The following is 
perhaps the most natural way to set up the grid. 

(1) Let t be proper time as measured by the accelerated observer’s clock (clock at 
center of airplane cabin in above example). Let ‘f = !? 0 (t) be the observer’s world 
line, as shown in Figure 13.4,a. 


Proper reference frame 
described physicalfy 


Six-step construction of 
coordinate grid for proper 
frame 
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Figure 13:4. 

The proper reference frame of an accelerated observer. Diagram (a) shows the observer’s orthonormal 
tetrad x e d } being transported along his world line $> 0 (t) [transport law (13.60)]. Diagram (b) shows 
geodesics bristling out perpendicularly from an arbitrary event $> 0 (4) on the observer’s world line. Each 
geodesic is specified uniquely by (1) the proper time r at which it originates, and (2) the direction (unit 
tangent vector n = d/ds = n’e, along which it emanates). A given event on the geodesic is specified 
by r, n, and proper distance s from the geodesic’s emanation point; hence the notation 

& = ^[t, n, 5 ] 

for the given event. The observer’s proper reference frame attributes to this given event the coordinates 

n, j]) = T, 

x } X&[t, n, j]) = snK 

(2) The observer carries with himself an orthonormal tetrad {e-} (Figure 13.4,a), 
with 

e o — u = dVJdr ~ (4-velocity of observer) (13.58) 

(«6 points along observer’s “time direction”), and with 



(orthonormality). 


e a-*0 = Vafi 


(13.59) 
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(3) The tetrad changes from point to point along the observer’s world line, relative 
to parallel transport: 


= ~Oe a , (13.60) 

Qliv = a fi u v _ ui L a v + ujatf**** (13.61) 

= “generator of infinitesimal Lorentz transformation.” 

This transport law has the same form in curved spacetime as in fiat (§6.5 and 
exercise 6.8) because curvature can only be felt over finite distances, not over 
the infinitesimal distance involved in the “first time-rate of change of a vector” 
(equivalence principle). As in exercise 6.8, 


a = ▼„!# = (4-acceleration of observer), 


UJ — 


angular velocity of rotation of spatial 
basis vectors e- relative to Fermi- 
Walker-transported vectors, i.e., 

\relative to inertial-guidance gyroscopes, 


u * a ~ u ' w = 0. 


(13.62) 


If uj were zero, the observer would be Fermi-Walker-transporting his tetrad 
(gyroscope-type transport). If both a and w were zero, he would be freely falling 
(geodesic motion) and would be parallel-transporting his tetrad, = 0. 

(4) The observer constructs his proper reference frame (local coordinate system) in 
a manner analogous to the Riemann-normal construction of §11.6. From each 
event 9 0 (t) on his world line, he sends out purely spatial geodesics (geodesics 
orthogonal to u = d9Jdrr\ with affine parameter equal to proper length, 

(13.63) 

proper length along 
geodesic from starting 
point; tells “where” 
on geodesic 


(See Figure 13.4,b.) The tangent vector has unit length, because the chosen affine 
parameter is proper length: 


proper time; tells ‘ 
“starting point” of 
geodesic 


9 = ^[t, ii, s], 

_It L 


tangent vector to 
geodesic at starting 
point; tells “which” 
geodesic 


n = (3<?/0s) s = o ; n = (dx^/ds) along geodesic, 



(13.64) 


(5) Each event near the observer’s world line is intersected by precisely one of the 
geodesics S[t, n, s]. [Far away, this is not true; the geodesics may cross, either 
because of the observer’s acceleration, as in Figure 6.3, or because of the curva¬ 
ture of spacetime (“geodesic deviation”).] 



Transport law for observer's 
tetrad 
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( 6 ) Pick an event near the observer’s world line. The geodesic through it originated 
on the observer’s world line at a specific time t, had original direction n = 

and needed to extend a distance 5 before reaching i j > . Hence, the four numbers 

(**\ x \ x*, jt^) = (t, sn 2 , sn (13.65) 

are a natural way of identifying the event <?. These are the coordinates of 
in the observer’s proper reference frame. 

(7) Restated more abstractly, 


x 0 (<f [t, /?, s]) = r, 

*V[T, /», 5]) = 5/7> = = 5/7 * ej. 


(13.65') 


In flat spacetime this construction process and the resulting coordinates x a (*P) are 
identical to the process and resulting coordinates £ a '(^) of exercise 6 . 8 . 

For use in calculations one wants not only the coordinate system, but also its metric 
coefficients and connection coefficients. Fortunately, and are needed only 
along the observer’s world line, where they are especially simple. Only a foolish 
observer would try to use his own proper reference frame far from his world line, 
where its grid ceases to be orthonormal and its geodesic grid lines may even cross! 
(See §6.3.) 

All along the observer’s world line ^ 0 (t), the basis vectors of his coordinate grid 
are identical (by construction) to his orthonormal tetrad 


3/Sx® = (13.66) 

and therefore its metric coefficients are 


Connection coefficients along 
observer's world line 


gefi = V afi all along <? 0 (r). (13.67) 

Some of the connection coefficients are determined by the transport law (13.60) 
for the observer’s orthonormal tetrad: 


v */ e i = V 6 e 5 = 


a0 


Thus 

Since O has 


= -Oet = -e 

r^a() = —8*a a11 alon g W- (13.68) 

the form (13.61) and the observer’s 4-velocity and 4-acceleration have 


components 1/5 — — 1 , uj = 0 , a§ = 0 in the observer’s own proper frame, these 
connection coefficients are 


— ^000 “** 

~ “ I'm = ;66 — + 7^66 = 


all along ^ 0 (t). (13.69a) 
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The remaining connection coefficients can be read from the geodesic equation for 
the geodesics S[r, n, s] that emanate from the observer’s world line. According to 
equation (13.65), the coordinate representation of each such geodesic is 


x & ($) = r = constant, x’(s) = n’s ; 

* 

hence, d 2 x a /ds 2 = 0 all along the geodesic, and the geodesic equation reads 

0 = _|_ pa j-a n J n k 

ds 2 ^ h ds ds * * 


This equation is satisfied on the observer’s world line for all spatial geodesics (all 
n } ) if and only if 

fa ik = faifc = 0 all along $> 0 (r). (13.69b) 


The values (13.69) of the connection coefficients determine uniquely the partial 
derivatives of the metric coefficients [see equation (13.19')]: 


Safi ,6 - °> 

<?66J 2#*, 


8jk t l = 0, 
gohfc = " c 6ftl"' 


all along 0 > o (r); 


(13.70) 


and these, plus the orthonormality condition ga^P 0 (r)] = y\ afii imply that the line 
element near the observer’s world line is 


ds 2 = —(1 + 2 a*xf)dx^ 2 — 2 (c^|X^cj^) dx® dx } 

+ Sfidx’dx^ + 0(\x*\ 2 ) dx a dx&. (13.71) 

Several features of this line element deserve notice, as follows. 

(1) On the observer’s world line f? 0 (r)—i.e., x* = 0 —ds 2 = y\ a ^ dx a dx&. 

(2) The observer’s acceleration shows up in a correction term to # 55 , 

$g66 = —2a*x, (13.72a) 

which is proportional to distance along the acceleration direction. For the flat-space¬ 
time derivation of this correction term, see § 6 . 6 . 

(3) The observer’s rotation relative to inertial-guidance gyroscopes shows up in 
a correction term to gp p which can be rewritten in 3 -vector notation 

8goj e } = -* X = +<o X x. (13.72b) 

(4) These first-order corrections to the line element are unaffected by spacetime 
curvature and contain no information about curvature. Only at second order, 
0( |x J '| 2 ), will curvature begin to show up. 

(5) In the special case of zero acceleration and zero rotation (a = oj = 0), the 
observer's proper reference frame reduces to a local Lorentz frame ( g ^ = y\ a ^ 
r a £y = 0) all along his geodesic world line! By contrast, the local Lorentz coordinate 


Metric of proper reference 
frame, and its physical 
interpretation 




' J ' J * 13 - WEMANNIAN GEOMETRY: METRIC AS FOUNDATION OF AU 

systems constructed earlier in the book (“general” local Lorentz coordinates of §8.6, 
Riemann normal coordinates” of § 11.6) are local Lorentz only at a single event. 
In the case of zero rotation and zero acceleration, one can derive the following 
expression for the metric, accurate to second order in \x*\: 

ds 2 = (- \ - R^^x^dt 2 - {^R^^x^dtdx* 

/ i a (13.73) 

+ ~ y R t!fc xt x m J dx l dx J + 0( jx*| 3 ) dx« dx & 

[sec, e.g., Manasse and Misner (1963)]. Here are the components of the 

Riemann tensor along the world line x? = 0. Such coordinates are called “Fermi 
Normal Coordinates.” 


EXERCISES 


Exarcis* 13.14. INERTIAL AND CORIOLIS FORCES 

An accelerated observer studies the path of a freely falling particle as it passes through the 
origin of his proper reference frame. If 


v = (dx'/dx 6 )*) (13.74) 

is the particle’s ordinary velocity, show that its ordinary acceleration relative to the observer’s 
proper reference frame is 


dx& 

[inertial acceleration! 


tt = —a -2«x v + 2 (a • v)r. 

J i w, 


(13.75) 


[Coriolis acceleration] 


relativistic correction 
to inertial acceleration 


Here a is the observer’s own 4-acceleration, and « is the angular velocity with which his 
spatial basis vectors m -are rotating [see equations (13.62)]. [Hint: Use the geodesic equation 
at the point x‘ — 0 of the particle’s trajectory. Note: This result was derived in flat spacetime 
in exercise 6.8 using a different method.] 


Exercise 13.15. ROTATION GROUP: METRIC 

(Continuation of exercises 9.13, 9.14, 10.17 and 11.12). Show that for the manifold SO( 3) 
of the rotation group, there exists a metric g that is compatible with the covariant derivative 
T. Prove existence by exhibiting the metric components explicitly in the noncoordinate basis 
of generators {•„}. [Answer: 


Safi = (13.76) 

Restated in words: If one postulates that: (1) the manifold of the rotation group is locally 
Euclidean; (2) the generators of infinitesimal rotations {•„} are orthonormal, •„••„ = <5 ■ 
and (3) {•„} obey the standard rotation-group commutation relations * S 

-« a/Sr * r ; (13.77) 

then the resulting geodesics of SO( 3) agree with the geodesics chosen in exercise 10.17.] 




§14.1. CURVATURE AS A TOOL FOR 
UNDERSTANDING PHYSICS 

Elementary physics sometimes allows one to shortcircuit any systematized calculation 
of curvature (frequency of oscillation of test particle; tide-producing acceleration 
near a center of attraction; curvature of a closed 3-sphere model universe; effect 
of parallel transport on gyroscope or vector; see Figures 1.1, 1.10, and 1.12, and 
Boxes 1.6 and 1.7); but on other occasions a calculation of curvature is the quickest 
way into the physics. This chapter is designed for such occasions. It describes three 
ways to calculate curvature and gives the components of the Einstein curvature tensor 
for a plane gravitational wave (Box 14.4, equation 5), for the Friedmann geometry 
of the universe (Box 14.5), and for Schwarzschild geometry, both static (exercise 
14.13) and dynamic (exercise 14.16). These and other calculations of curvature 
elsewhere are indexed under “curvature tensors.” 

It is enough to look at an expression for a 4-geometry as complicated as 

ds 2 = —(x/3 1/2 L + //12L 2 r 3I/2 ( f {- Z /Lf~ in dfi 

+ (*/ 3 1/2 L + f/\2L 2 f+* in { f + ( _ z/I) -i t 3— dx 2 

+ 2/31/2 (- z/L)-^ df 

+ (x/3 in L + y 2 /\2l?f +31/8 ^ J +2/31/2 ( —z/L)~ 2 ~ 3 ~ 1/2 X 

x (rfe7l)* 2 


This chapter is entirely Track 2. 

Chapter 4 (differential forms) 
and Chapter 10, 11, and 13 
(differential geometry) are 
necessary preparation for 
§§14.5-14.6. 

This chapter is needed as 
preparation for Chapter 15 
(Bianchi identities). 

It will be helpful in many 
applications of gravitation 
theory (Chapters 23-40). 
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Situations in which one must 
compute curvature 


"Standard procedure" for 
computing curvature 


Methods of displaying 
curvature formulas 


Computation of curvature 
using a computer 


EXERCISES 


[Harrison (1959)] to realize that one might understand the physical situation better 
if one knew what the curvature is; similarly with any other complicated expressions 
for metrics that arise from solving Einstein’s equations or that appear undigested 
in the literature. In any such case, the appropriate method often is: curvature first, 
understanding second. 

Curvature is the simplest local measure of geometric properties (see Box 14.1). 
Curvature is therefore a good first step toward a more comprehensive picture of 
the spacetime in question. 

One sometimes has an expression for a spacetime metric first, and then makes 
calculations of curvature to understand it. But more often one makes calculations 
of curvature, subject to specified conditions of symmetry in space and time, as an 
aid in arriving at an expression for a physically interesting metric (stars, Chapters 
23 to 26; model cosmologies, Chapters 27 to 30; collapse and black holes, Chapters 
31 to 34; and gravitational waves, Chapters 35 to 37). 

The basic “standard procedure for computing curvature” is illustrated in Box 14.2. 
Two formulas in Box 14.2, derived previously, are used in succession. The first 
(equations 1 and 2) has the form r ~~ gdg and provides the The other 

(equation 3) has the form R ~~ dT + T 2 and gives the curvature components R 

After the curvature components have been computed, there are helpful ways to 
present the results. (1) Form the Ricci tensor R^ v = R a fxa y an d the scalar curvature 
R = R%. (2) Form other invariants such as R^ a(i R a \ v . (3) Form components 
in a judiciously chosen orthonormal frame iu a = dx& 9 and (4) display R [ ^\^] 
as a 6 x 6 matrix ^(in four dimensions; a 3 x 3 matrix in three dimensions) where 
[fip] = [01], [02], [03], [23], [51], [12] labels the rows and [afi] labels the columns 
(exercises 14.14 and 14.15). (5) Last, but by far the most important for general 
relativity, form the Einstein tensor G^ as described in §14.2. 

The method of computation outlined above and described in more detail in Box 
14.2 is used wherever it is quicker to employ a standard method than to learn or 
invent a better method. The standard method is always preferable for the student 
in a short course where physical insight has higher priority than technical facility. 
It is, however, a dull method, better suited to computers than to people. Fvetuthe 
algebra can be handled by a computer (see Box 14.3). 


Exercise 14.1. CURVATURE OF A TWO-DIMENSIONAL HYPERBOLOID 

Compute the curvature of the hyperboloid t 2 - * 2 - f = T 2 = const in 2 + 1 Minkowski 
spacetime with ds 3 2 = — dt 2 + dx 2 + dy 2 . First show that intervals within this two-dimen¬ 
sional surface can be expressed in the form ds 2 = T 2 (da 2 + sinh 2 a d^> 2 ) by a suitable choice 
of coordinates a, <£, on the hyperboloid. __ 

Exercise 14.2. RIEMANNIAN CURVATURE EXPRESSIBLE IN TERMS OF 
RICCI CURVATURE IN TWO AND THREE DIMENSIONS 

In two dimensions, there is only one independent curvature component, R 1212 . Evidently 
the single scalar quantity R must carry the same information. The two-dimensional identity 
Rfivap = i Rigfiaivp — gtipgra) established by noting that it is the only tensor formula giving 

(<continued on page 343) 
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Box 14.1 PERSPECTIVES ON CURVATURE 

1. Historical point of departure: a curved line on 
a plane. There is no way to define the curvature 
of a line by measurements confined to (“intrinsic 
to”) the line itself One needs, for example, the 
azimuthal bearing 6 of the tangent vector relative 
to a fixed direction in the plane, as a function of 
proper distance 5 measured along the curve; thus, 

6 — 0(s). Then curvature k and its reciprocal, the 
radius of curvature p, are given by k(s) = 

1/pCs) = d6{s)/ds. Alternatively, one can examine 
departure, y , measured normally off from the tan¬ 
gent line as a function of distance x measured 
along that tangent line; then k = 1/p = d 2 y/dx 2 , 

2. This concept was later extended to a curved surface embedded in flat (Euclidean) 
3-space. Departure, z, of the smooth curved surface from the flat surface tangent 
to it at a given point is described in the neighborhood of that point by the quadratic 
expression 

z = — ax 2 -f bxy -f — cy 2 . 

Rotation of the axes by an appropriate angle a, 

x = | cos a + 7] sin a, 
y = — £ sin a + r] cos a, 



reduces this expression to 


z = + jK 2 'n 2 > 

with 

k i = i/pi 

and 

k 2 = l/p 2 

representing the two “principal curvatures” of the surface. 

3. Gauss (1827) conceived the idea of defining curvature by measurements confined 
entirely to the surface ( society of ants”). From a given point on the surface, 
proceed on a geodesic on the surface for a proper distance c measured entirely within 
the surface. Repeat, starting at the original point but proceeding in other directions. 
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Box 14.1 (continued) 


Obtain an infinity of points. They define a “circle”. Determine its proper circum¬ 
ference, again by measurements confined entirely to the surface. Using the metric 
corresponding to the embedding viewpoint 

ds 2 = dz 2 + d£ 2 + dq 2 (Euclidean 3-space) 

( metric intrinsic^ 
to the curved 1 , 
2-geometry / 

one can calculate the result of such an “intrinsic measurement.” One calculates that 
the circumference differs from the Euclidean value, 2irc, by a fractional correction 
that is proportional to the square of <; specifically, 


Lim — 

t —>0 



circumference 
2 ire 


) 


= KlK 2 = 


1 

P1P2 



Note especially the first equality sign. Gauss did not conceal the elation he felt on 
discovering that something defined by measurements entirely within the surface 
agrees with the product of two quantities, Kj and k 2 , that individually demand for 
their definition measurements extrinsic to the surface. 


4. The contrast between “extrinsic” and “intrin¬ 
sic” curvature is summarized in the terms, 

(extrinsic curvature) = k = (k 1 + K 2 )(cm _1 ), 

( intrinsic or Gaussian\ 

curvature / = K i lc 2 ( an 2 ) 

(the latter being identical with half the scalar cur¬ 
vature invariant, R, of the 2-geometry). Draw a 
3:4:5 triangle on a flat piece of paper; then curl 
up the paper. The Euclidean 2-geometry intrinsic 
to the piece of paper is preserved by this bending. 
The Gaussian curvature intrinsic to the surface 
remains unaltered; it keeps the Euclidean value 
of zero (k 2 , non-zero; k 1; zero; product, k,k 2 = 
zero). However, the extrinsic curvature is changed 
from Kj + k 2 = 0 to a non-zero value, k, + 
k 2 54 0 . 








5. The curvature dealt with in this chapter is curvature intrinsic to spacetime; that 
is, curvature defined without any use of, and repelling every thought of, any em¬ 
bedding in any hypothetical higher-dimensional flat manifold (concept of Riemann, 
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Clifford, and Einstein that geometry is a dynamic participant in physics, not some 
God-given perfection above the battles of matter and energy). 

6. The curvature of the geometry of spacetime imposes curvature on any spacelike 
slice (3-geometry; “initial-value hypersurface”) through that spacetime (see “relations 
of Gauss and Codazzi” in Chapter 21, on the initial-value problem of geometro- 
dynamics). 

7. Rotation of a vector transported parallel to 

itself around a closed loop provides a definition ^ 

of curvature as useful in four and three as in two f 

dimensions. (In a curved two-dimensional geome- __ 

try, at a point there is only one plane. Conse- 

quently only one number is required to describe / ilflfoP^^L 

the Gaussian curvature there. In three and four / 

dimensions, there are more independent planes / 

through a point and therefore more numbers are / 

required to describe the curvature.) In the dia- k 

gram, start with a vector at position 1 (North Pole). \ 

Transport it parallel to itself (position 2,3,...) \ ? ' 

around a 90 o -90°-90° spherical triangle. It arrives \ 

back at the starting point (position 4) turned N. 

through 90°: --— 

( ( angle tumed\ 

_ through / ( w / 2) 1 

curvature/ /area circum-\ ~ (1/8X4 tia 2 ) ~ a 2 
'navigated / 

(positive; sense of rotation same as sense of cir¬ 
cumnavigation). 

8. Still staying for simplicity with a curved two- 
dimensional manifold, describe the curvature of 
the 2-surface as a 2-form (“box-like structure”) 
defined over the entire surface. The number of 
boxes enclosed by any given route gives im¬ 
mediately the angle in radians (or tenths or hun¬ 
dredths of a radian, etc., depending on chosen 
fineness of subdivision) turned through by a vector 
carried parallel to itself around that route. The 
contribution of a given box is counted as positive 
or negative depending on whether the sense of the 
arrow marked on it (see magnified view) agrees 
or disagrees with the sense of circumnavigation of 
the route. 
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Box 14.1 (continued) 



9. Curvature 2-form for the illustrated surface of rotational symmetry (“pith hel¬ 
met”) with metric ds 2 = do 2 + r 2 (p) d<t> 2 is 

1 d 2 r 

curvature =- r^do A rd<b (1) 

r do * 

(positive on crown of helmet, negative around brim, as indicated by sense of arrows 
in the “boxes of the 2-form” shown at left). “Meaning” of r is illustrated by imbed¬ 
ding the surface in Euclidean 3-space, a convenience for visualization; but more 
important is the idea of a 2-geometry defined by measurements intrinsic to it, with 
no embedding. 


10. How lengths (“metric”) determine curvature in quantitative detail is shown 
nowhere more clearly than in this two-dimensional example, a model for “what is 
going on behind the scene” in the mathematical calculations done in this chapter 
with 1-forms and 2-forms in four-dimensional spacetime. 

a. Net rotation in going around element of surface d$®dd is 8 — 8 (no turn of 
vector to left or to right in its transport along a meridian dd or $$). 

b. Rotation of vector in going from d to relative to coordinate system (directions 
of meridians), is 


(angle 8) = 


arc 

length 


r(o + do) A<j> — r(o) Aj> 



do 



CURVATURE AS A TOOL FOR UNDERSTANDING PHYSICS 


339 


c. Rotation of vector in going from d to SB is similarly 

(angle 5) = Aj>. 

' 'a + Ja 

d. Thus net rotation is: 

e. Expressed as a form, this gives immediately equation (1). 

f. Ideas and calculations are more complicated in four dimensions, primarily 
because one has to deal with different choices for the orientation of the surface 
to be studied at the point in question. 

11. Translation of these geometric ideas into the language of forms is most immediate 
when one stays with this example of two dimensions. A sample vector A' = (A 1 , A 2 ) 
carried around the boundary of an element of surface comes back to its starting 
point slightly changed in direction: 



A\AK 


( 2 ) 


a. To be more specific, it is convenient to adopt as the basis 1-forms cu 1 = da 
and w 2 = r d<p, and have A 1 as the component of A along the direction of 
increasing a, A 2 as the component of A along the direction of increasing <j>. 
The matrix is a rotation matrix, which produces a change in direction but 
no change in length (zero diagonal components); thus here 


ll# J ;ll = 


o a \ 

-Ah 0 


(3) 


In this example, A. ^ evidently represents the angle through which the vector 
A turns on transport parallel to itself around the element of surface. 

b. So far the rotation is “indefinite” because the size of the element of surface 
has not yet been specified. It is most conveniently conceived as an elementary 
parallelogram, defined by two vectors (“bivector”). Thus A*j, or, specifically, 
the one element that counts, A\ (the “angle of rotation”), has to be envisaged 
as a mathematical object (“2-form”) endowed with two slots, into which these 
two vectors are inserted to get a definite number (angle in radians). In the 
example of the pith helmet, one has, from equation (1) 


a\ = 


1 d 2 r 
r do 2 


w 1 A u> 2 . 


(4) 


Thus the in the text are called “curvature 2-forms.” 
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Bo* 14.1 (continued) 

c. The text tells one how to read out of such expressions the. components of the 
Riemann curvature tensor; for example here, 

— — Rl &\ = (-1 /rXcPr/do 2 ) (coefficients of uj' 1 A au 2 or u/ 2 A a/ 5 ). 

d. Generalizing to four dimensions, one understands by R a /3/1P the factor that one 
has to multiply by three numbers to obtain a fourth. The number obtained 
is the change (with reversed sign) that takes place in the ath component of 
a vector when that vector is transported parallel to itself around a closed path, 
defined, for example, by a parallelogram built from two vectors u and v. The 
factors that multiply R a Pll , are (1) the component of the vector .4 in the /?th 
direction and (2,3) the pv component of the extension of the parallelogram, 
(u^v" — iAv"). Thus 

SA a = - tAu"). 


Bo* 14.2 STRAIGHTFORWARD CURVATURE COMPUTATION 
(Illustrated (or a Globa) 

The elementary and universally applicable method for computing the components 
R>l “nf 3 of the Riemann curvature tensor starts from the metric components g v in 
a coordinate basis, and proceeds by the following scheme: 

„ R-dr + rr 


The formulas required for these three steps are 


"* 2 \ dx* + dx* dx‘ )’ 


= g*r^. 


R \ap = 


. + r* r p » 

' pa 1 *0 


r* 

P0 1 per 


The metric of the two-dimensional surface of a sphere of radius a is 

ds 2 = (P’idd 2 + sin 2 0 d<p 2 ). ( 4 ) 

To compute the curvature by the standard method, use the formula for ds 2 as a 
table of g k} values. It shows that g„ = a 2 , g H = 0, g„ = a 2 sin 2 *. Compute the six 
possible different f fkl = f jlk (there will be 40 in four dimensions) from formula 
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(1). Thus 


Raise the first index: 


r »H> = - a2sin 9 cos e = -r^ 9 , 
^ 888 ~ ~ 0, 

R 88 $ ~ r '$#8 ~ 0 * 

ra H> = -sin (9 cos (9, 
r *i* - cotfl, 

r “«8 = r%* = o = r*„ = r v 


( 5 ) 


( 6 ) 


Choose a suitable curvature component (one that is not automatically zero by reason 
of the elementary symmetry R^ afi = R lM) , nor previously computed in another 
form, as by R 9 „ al3 = R^^). In this two-dimensional example, there is only one 
choice (compared to 21 such computations in four dimensions); it is 


pe _ ^*00 

0*0 - —T ~ d<j) 


+ r\ e n^ - r\^r\ 9 


a r< 


de 


- ~ o + o - r^r\ e 


— sin 2 # — cos 2 0 + sin 0 cos 0 cot 0; 


so 


or 


R \ a <t, = sin 2 0 


R‘*.* = A. 


(7) 

( 8 ) 

(9) 

( 10 ) 

(H) 

More generally one writes o/“ = L a p dx To transform the curvature tensor to 
orthonormal components in this simple but illuminating example of a diagonal 
metric requires a single normalization factor for each index on a tensor. Thus 
v e = av e , v* = a sin 0 v*, v B = a~ l v B , v^ = (a sin 0)~ 1 u 0 . Similarly, from R* 
sin-0 one finds the components of the curvature tensor, 


Contraction gives the components of the Ricci tensor, 

R \ = R \ = ^ 2 ’ R \ = 0 , 
and further contraction gives the curvature scalar 

R = 2/a 2 . 

A convenient orthonormal frame in this manifold is 

w e — add, w* = a sin 0 d<j>. 


000 — 


— 1 — 
<t>0<> - a 2 “ 


00 ’ 


in the orthonormal frame. 


( 12 ) 
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Box 14.3 ANALYTICAL CALCULATIONS ON A COMPUTEFt 


Research in gravitation physics and general rela¬ 
tivity is sometimes beset by long calculations, re¬ 
quiring meticulous care, of such quantities as the 
Einstein and Riemann curvature tensors for a 
given metric, or the divergence of a given stress- 
energy tensor, or the Newman-Penrose tetrad 
equations under given algebraic assumptions. 
Such calculations are sufficiently straightforward 
and deductive in logical structure that they can be 
handled by a computer. Since 1966, computers 
have been generally taking over such tasks. 

There are several computer languages in which 
the investigator can program his analytic calcula¬ 
tions. The computer expert may wish to work in 
a machine-oriented language such as LISP [see, 
e.g., the work of Fletcher (1966) and of Hearn 
(1970)]. However, most appliers of relativity will 
prefer user-oriented languages such as REDUCE 
[created by Hearn (1970) and available for the 
IBM 360 and 370, and the PDP 10 computers], 
ALAM [created by DTnverno (1969) and available 
on Atlas computers], CAMAL [created by Barton, 
Bourne, and Fitch (1970) and available on Atlas 
computers], and FORMAC [created by Tobey et 
al (1967) and available on IBM 7090, 7094, 360, 
and 370]. For a review of activity in this area, see 
Barton and Fitch (1971). Here we describe only 
FORMAC. It is the most widely available and 
widely used of the languages; but it is probably 
not the most powerful [see, e.g., D’Invemo (1969)]. 
FORMAC is to analytic work what the earliest and 
most primitive versions of FORTRAN were to 
numerical work. 

FORMAC manipulates algebraic expressions 
involving: numerical constants, such as 1/3; sym¬ 
bolic constants, such as x or u ; specific elementary 
functions, such as sin (u) or exp (x); and symbolic 
functions of several variables, such as /(x, u) or 
g(u). For example, it can add ax 4* bx 2 to 2x 4- 
(3 + b)x 2 and get (a 4 * 2)x 4* (3 4 * 2 b)x 2 ; it can 
take the partial derivative of x 2 uf(x , u) 4* cos (x) 
with respect to x and get 

2xw/(x, u) 4 * x 2 u 0/(x, u)/dx — sin (x). 

It can do any algebraic or differential-calculus 


computation that a human can do—but without 
making mistakes! Unfortunately, it cannot inte¬ 
grate analytically; integration requires inductive 
logic rather than deductive logic. 

PL/1 is a language that can be used simulta¬ 
neously with FORMAC or independently of it. 
PL/1 manipulates strings of characters—e.g., 
“Z/l x29— + /.” It knows symbolic logic; it can 
tell whether two strings are identical; it can insert 
new characters into a string or remove old ones; 
but it does not know the rules of algebra or differ¬ 
ential calculus. Thus, its primary use is as an ad¬ 
junct to FORMAC (though from the viewpoint of 
the computer system FORMAC is an adjunct of 
PL/1). 

FORMAC programs for evaluating Einstein’s 
tensor in terms of given metric components and 
for doing other calculations are available from 
many past users [see, e.g., Fletcher, Clemens, 
Matzner, Thome, and Zimmerman (1967); Ernst 
(1968); Harrison (1970)]. However, programming 
in FORMAC is sufficiently simple that one ordi¬ 
narily does not have difficulty creating one’s own 
program to do a given task. If a difficulty does 
arise, it may be because the analytic computation 
exhausts the core of the computer. It is easy to 
create an expression too large to fit in the core of 
any existing computer by several differentiations 
of an expression half a page long! 

Users of FORMAC, confronted by core¬ 
exhaustion, have devised several ways to solve 
their problems. One is to remove unneeded parts 
of the program and of the FORMAC system from 
the core. Routines called PURGE and KILL have 
been developed for this purpose by Clemens and 
Matzner (1967). Another is to create the answer 
to a given calculation in manageable-sized pieces 
and output those pieces from the computer’s core 
onto its disk. One must then add all the pieces 
together—a task that is impossible using 
FORMAC alone, or even FORMAC plus PL/1, 
but a task that James Hartle has solved [see 
Hartle and Thome (1974)] by using a combination 
of FORMAC, PL/1, and IBM data-manipulation 
routines called SORT. 
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^ as a li near function of R, constructed from R and the metric alone, and with the correct 
contracted value = R. Establish a corresponding three-dimensional identity expressing 
R HkL in terms of the Ricci tensor R jk and the metric. 

Exercise 14.3. CURVATURE OF 3-SPHERE IN ORTHONORMAL FRAME 

Compute the curvature tensor for a 3-sphere 

ds 2 = a 2 [d\ 2 + sin 2 x(d0 2 + sin 2 0 dtp 2 )] (14.2) 

or for a 3-hyperboloid 

ds 2 = a 2 [dx 2 + sinh 2 x(d8 2 + sin 2 0 dtp 2 )]. ( 14 . 3 ) 

Convert the coordinate-based components R' jkl to a corresponding orthonormal basis, R'w 
Display R'\i = R 1 ®^ as a 3 x 3 matrix with appropriately labeled rows and columns. 
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The distribution of matter in space does not immediately tell all details of the local 
curvature of space, according to Einstein. The stress-energy tensor provides infor¬ 
mation only about a certain combination of components of the Riemann curvature 
tensor, the combination that makes up the Einstein tensor. Chapter 13 described 
two equivalent ways to calculate the Einstein tensor: (1) by successive contractions 
of the Riemann tensor 


R = g^R 


fJLV 9 


<v = R w ~tSu V R 


(14.4) 


Three ways to compute the 
Einstein tensor from the 
Riemann tensor 


[equations (13.48) and (13.49)]; (2) by forming the dual of the Riemann tensor and 
then contracting: 


fV 8 = (***V s 
<V = 


f A /?!/**! fpays 

|pa[ e 

— S P<7y8 n R^ V I 

afijiv ** fper |’ 


(14.5a) 

(14.5b) 


[equations (13.46) and (13.47)]. A third method, usually superior to either of these, 
is discovered by combining equations (14.5a,b): 


<V = G*, = -»•»,„* (14.6) 

[Note: in any frame, orthonormal or not, the permutation tensor 8 8pa n has compo¬ 
nents ^ r 


gSpcr _ o 5per _ 

U 0 fil> — O 01LV ~ 


+ 1 if 8po is an even permutation of 
— 1 if Spo is an odd permutation of ftpp, 
. 0 otherwise; 
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to see this, simply evaluate using definition (3.50h) and using the components 

(8.10) oft a 0 M , and « p ® r4 .] Equation (14.6) for the Einstein tensor, written out explic¬ 
itly, reads 

G °0 = ~{R n n + R 23 23 + * 31 3i)> 

G\ = -(R 02 02 + * 03 03 + R 2 \z\ 

(14.7) 

G°1 = + R°\ 3 > 

G 1 2 = 7? 10 20 4 * ^ 13 23 , 

and every other component is given by a similar formula, obtainable by obvious 
permutations of indices. 


Standard method of 
computing curvature is 
wasteful 


Ways to avoid "waste": 


(1) geodesic Lagrangian 
method 


(2) method of curvature 
2-forms 


§14.3. MORE EFFICIENT COMPUTATION 

If the answer to a problem or the result of a computation is not simple, then there 
is no simple way to obtain it. But when a long computation gives a short answer, 
then one looks for a better method. Many of the best-known applications of general 
relativity present one with metric forms in which many of the components g^ 
and R^ Pa0 are zero; for them the standard computation of the curvature (Box 14.2) 
involves much “wasted” effort. One computes many that turn out to be zero. 
One checks off many terms in a sum like — that are zero, or cancel with 

others to give zero. Two alternative procedures are available to eliminate some of 
this “waste.” The “geodesic Lagrangian” method provides an economical way to 
tabulate the rfl a0- The method of “curvature 2-forms” reorganizes the description 
from beginning to end, and computes both the connection and the curvature. 

The geodesic Lagrangian method is only a moderate improvement over the 
standard method, but it also demands only a modest investment in the calculus of 
variations, an investment that pays off in any case in other contexts in the world 
of mathematics and physics. In contrast, the method of curvature 2-forms is efficient, 
but demands a heavier investment in the mathematics of 1 -forms and 2 -forms than 
anyone would normally find needful for any introductory survey of relativity. Anyone 
facing several days’ work at computing curvatures, however, would do well to learn 
the algorithm of the curvature 2 -forms. 


§14.4. THE GEODESIC LAGRANGIAN METHOD 

One normally thinks that the connection coefficients T > a/3 must be known before 
one can write the geodesic equation 


X* + r» a0 x a xP = 0. (14.8) 

(Here and below dots denote derivative with respect to the affine parameter, X.) 
However, the argument can be reversed. Once the geodesic equations have been 
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written down, the connection coefficients can be read out of them. For instance, 
—on the 2-sphere as treated in Box 14.2, the geodesic equations are 

__ 9 — sin 6 cos 0 <t> 2 = 0, (14.90) 

+ 2 cot 0 <t>9 = 0. — (14.9<J>) 

The first equation here shows that = -sin 0 cos 0; the second equation shows 

that r*^ = r* 9</> = cot 0; and the absence of any further terms shows that all other 
r i ik are zero. 

The first essential principle is thus dear: an explicit writing out of the geodesic 
equation is equivalent to a tabulation of all the connection coeffitients /’m 
T he second principle says more: one can write out the geodesic equation without 
ever having computed the r» aj3 . In order to arrive at the equations for a geodesic 
(see Box 13.3), one need only recall that a geodesic is a parametrized curve that 
^ extr ernizes^th^mcgraT^ * ^ — 


7 = y fgt'i'x’dX 


(14.10) 


in the sense 


SI = 0. 

In practical applications of this variational principle, the first sfepistorewrite 
equation (14.10) in the simplest possible form, inserting the specific values of g v for 
the problem at hand. If one’s interest-attaches to the geodesics themselves, one"can 
recognize many constantsOfmotion even without carrying out any variations (see 

_Chapter 25^on'geodesic motion in Schwarzschild geometry, especially §25.2 on 

conservation laws and constants of motion). For the purpose of computing the r* 
one proceeds to vary each coordinate in turn, obtaining four equations. Next these 
equations are rearranged so that their leading terms are Jc". In this form they must 
be precisely the geodesic equations (14.8). Consequently, the are immediately 
available as the coeffi cient s in thesejour^equations. FoTthe final step in computing 
by-thns-metirod, one returns to the standard method and to formulas of 
the type R~~dT + rr, treated in the standard way (Box 14.2); and as the need 
arises for each r in turn, one scans the geodesic equation to find it. The procedure 
is best understood by following an example: Box 14.4 provides one. 


Geodesic Lagrangian method 
in 4 steps: 

(1) write / in simple form 


(2) vary / to get geodesic 
equation 


(3) read off r a fiy 

(4) compute R a fiyS etc. by 
standard method 


Exercise 14.4. EINSTEIN EQUATIONS FOR THE CLOSED FRIEDMANN 
UNIVERSE CALCULATED BY USING THE GEODESIC 
LAGRANGIAN METHOD 

The line element of interest here is (see Chapter 27) 


EXERCISE 


d?2 _ ~dt 2 + cf(t) [dft + S j n 2 x (<$2 + s j n 20 ^2)| 

(continued on page 348) 
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— 

Box 14.4 GEODESIC LAGRANGIAN METHOD SHORTENS SOME 
CURVATURE COMPUTATIONS 

Aim: Compute the curvature for the line element 

ds 2 = L 2 {e 2 & dx 2 -f e~ 2 & dy 2 ) — 2 dudv (1) 

where L and ft are functions of u only. [This metric is discussed as an example of 
a gravitational wave in §§35.9-35.12.] 

Method: Obtain the r* a p from the geodesic equations as inferred from the varia¬ 
tional principle (14.10), then compute R — dT 4- r 2 as in Box 14.2. 

Step 1. State the variational integral. For the met ri c amd er^onskieration^equation 
(14.10) requires 81 = D for 

I = J L 2 (^x 2 + e^y 2 ) — ui>j d\. (2) 

A world line that extremizes this integral is a geodesic. 

Step 2: Vary the coordinates of the world line, one at a time, in their dependence 

on X. First vary x(X\ keeping fixed the functions ^(X), u(X), and v(X). Then 

81 = J (U^x) Sx dX = — J (L 2 ^x)‘ Sx dX. 

The requirement that SI = 0 for this variation (among others) gives _ 

0 = (LV^jc)- = L 2 ^x + xu^-iL 2 ^). 

ou 

Varying y, w, u, in- the same way gives 

0 = (L 2 e~ 20 y) * = L 2 e~ 2 Py + yu (L 2 e ~ 2 &), 

0 = v + | ** (L 2 e*f>) +^rfi~ (L z e ~ 2f> ), 

2 du 2 au 

0 = a. 

Step 3: Rearrange to get x M leading terms. If this step is not straightforward, this 
method will not save time, and the technique of either Box 14.2 or Box 14.5 will 
be more suitable. In the example here, one quickly writes, using a prime for 3/3 u. 


0 = x + 2(Zr 1 Z/ + P')xu, (3*) 

0 = y + 2(L~'L'- P')yu, ( 3 /) 

0 = v + (LV^XIT 1 !/ + P')x 2 + (LV-^XZr 1 !/ - P')f, (3i>) 

0 = u. (3m) 
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Step 3': Interpret these equations as a tabulation of 7’a Equations (3) are the 
standard equations for a geodesic, 

x* + r\ fi x a x^ = 0 . 

Therefore it is enough to scan them to find the value of any desired r. For instance 
r x yu must appear in the coefficient (r* yu + r x uy ) = 2r x yu of the yu term in the 
equation for x . But no yu term appears in equation (3 jc). Therefore r x yu is zero 
in this example. Note that equations (3) are simple, in the sense that they contain 
few terms; therefore most of the r* a p must be zero. For instance, it follows from 
equation (3«) that all ten r u B are zero. The only non-zero r *s are r* — T x = 
( LL ' + P') from equation (3.x), r\ u = r\ y = (L _1 L' - p r ) from equation (3 y), 
and r v xx and r v yy from equation (3v). 

Step 4: Compute each R 11 ^, etc. There is little rehef from routine in systematically 
applying equation (3) from Box 14.2. One must list 21 components R* taf} that are 
not related by any of the symmetries R^ = R ajiiiP = -R^ 0a , and compute each. 
In the example here, one notes that r\ p = 0 implies R\ py = -R vaPy = 0. There¬ 
fore 15 of the list of 21 vanish at one swat. The list then is: 


Kapy = ~R\py = 0, 

R u XUX = -R\ ux = -(f v XI r + r v xx r* xu 

_ /r \ 1 . f\n 


= -(IV*) ( 


7 ~ + P" + 2-j 

J-i L 


-p' + p' 2 ), 


D _ _ D V _ Cj 

£K uxxy xxy — "j 

R u XV u = -~R\yu = 0. (4) 

R uyuy = RV yuy = ~{ RV yy)' + RV yy RV yu 

= -(L 2 e~ 2p )^~- — P" - 2-j-P' + £' 2 ), 

f? _ _ DV _ f) 

lx uyxy A yxy — 

R xy X y = (L 2 e^)R x yxy = 0 . 

One can now calculate the Einstein tensor via equation (14.7). In the example here, 
however, it is equally simple to form first the Ricci tensor by the straightforward 
contraction R^^p. Only g = * and [i = y give any contribution, because no super¬ 
script index can be a w, and no subscript a v . Thus one finds 

R uu = -2 [L-i£" + p’% 

all other R aQ =0, ^ 


R = 0. 


From this last result, it follows that here the desired Einstein tensor is identical with 
the Ricci tensor. 
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Concepts needed for method 
of curvature 2-forms 


(a) Set up the variational integral (14.10) for a geodesic in this metric, then successively 
var y X* and to obtain, after some rearrangement, four equations 0 = i ’ + • .0 = 
X + • * *» etc. displaying the s in the form of equation (14.8). 

(b) Use this display as a table of s to compute R‘ Xfir and of which only R< xtx 

and R x e xe are non-zero (consequence of the complete equivalence of all directions tangent 
to the \G<f> sphere). 

(c) Convert to an orthonormal frame with w t = <//, tu* = a = ?, = ?, and list 

R tx tx anc * Explain why all other components are known by symmetry in terms of 

these two. 

(d) Calculate, using equations (14.7), all independent components of the Einstein ten¬ 
sor G^;. [Answer: See Box 14.5.] 


§14.5. CURVATURE 2-FORMS 

In electrodynamics the abstract notation 

F = dA 

saves space compared to the explicit notation 

_ _ dA 1 dA 3 

31 ~ 0* 3 0 a : 1 ’ 

F _ dA 2 dA l 

12 0JC 1 0JC 2 ’ 

..., etc. (six equations); 

there is no reason to shun similar economies in dealing with the dynamics of 
geometry. Cartan introduced the decisive ideas, seen above, of differential forms 
(where a simple object replaces a listing of four components; thus, a = 
and of the exterior derivative d . He went on (1928,1946) to package the 21 compo¬ 
nents of the curvature tensor into six curvature 2-forms, 


Regarded purely as notation, these 2-forms automatically produce a profit. They 
cut down the weight of paper work required to list one’s answer after one has it. 
They also provide a route into deeper insight on “curvature as a geometric object,” 
although that is not the objective of immediate concern in this chapter. 

Cartan’s exterior derivative d automatically effects many cancelations in the 
calculation of curvature. It often cancels terms before they ever need to be evaluated. 

Extension of Cartan’s calculus from electromagnetism and other applications 
(Chapter 4) to the analysis of curvature (this chapter) requires two minor additions 
to the armament of forms and exterior derivative: (1) the idea of a vector-valued 
(or tensor-valued) exterior differential form; and (2) a corresponding generalization 
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of the exterior derivative d. This section uses both these tools in deriving the key 
formulas (14.18), (14.25), (14.31), and (14.32). Once derived, however, these formulas 
demand no more than the standard exterior derivative for all applications and for 
all calculations of curvature (§14.6 and Box 14.5). 

The extended exterior derivative leads to nothing new in the first two contexts 
to which one applies it: a scalar function (“ 0 -form”) and a vector field (“vector-valued 
0-form ). Thus, take any function f. Its derivative in an unspecified direction is a 
1 -form; or, to make a new distinction that will soon become meaningful a “scalar- 
valued 1-form.” Specify the direction in which differentiation is to occur (“fill in 
the slot in the 1-form”). Thereby obtain the ordinary derivative as it applies to a 
function 

(df,u) = d u f. (14.11) 

Next t ’ take an y vect or field v. Its covariant derivative in an unspecified direction 
is a “vector-valued 1-form.” Specify the direction u in which differentiation is to 
occur (“fill in the slot in the 1-form”). Thereby obtain the covariant derivative 

<dv,u) = y u v. (14.12a) 

This object too is not new; it is the covariant derivative of the vector v taken in 
the direction of the vector u. When one abstracts away from any special choice of 
the direction of differentiation u, one finds an expression that one has encountered 
before, though not under its new name of “vector-valued 1-form.” This expression 
measures the covariant derivative of the vector v in an unspecified direction (“slot 
for direction not yet filled in”). From a look at (14.12a), one sees that this extended 
exterior derivative is applied to v, without reference to u, is 

dv = (14.12b) 

Similarly, for any “tensor-valued 0-form” [i.e. (g) tensor] S, dS = VS. 

Before proceeding further with the exterior (soon to be marked as “antisymmetric”) 
differentiation of tensors, write down a formula (see exercise 14.5) for the exterior 
(antisymmetric) derivative of a product of forms: 

d(a A p) = (da) A p + (- l)»a A dp, (14.13a) 

where a is a p-form and p is a ^-form. 

Now extend the exterior derivative from elementary forms to the exterior product 
of a tensor-valued />-form S with any ordinary < 7 -form, p, thus, 

d(S A p) = dS A p + (-l)PS A dp. (14.13b) 

This equation can be regarded as a general definition of the extended exterior 

derivative. For example, if S is a-tensor-valued 2-form, S = «,„</*? a dx» 

then equation (14.13b) says 

dS = d[(e a o l3 S a f> mt )(dxV A </**)] = d(e a e p S a ^ yil ) A (dx* A dx»). 


Extended exterior derivative 

(1) acting on a scalar 

(2) acting on a vector 


(3) defined in general 
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Definition of w ti v 


As another example, use (14.13b) to calculate d(ua), where u is a vector-valued 
0 -form (vector) and a is a scalar-valued 1-form (1-form): 

d{ua) — (du) Aa + t/ da. 

If one were following the practice of earlier chapters, one would have written u ® a 
where ua appears here, u ® da instead of u da , and e a ® ep instead of e a ep. 
However, to avoid overcomplication in the notation, all such tensor product symbols 
are omitted here and hereafter. 

Equations (14.12) and (14.13) do more than define the (extended) exterior deriva¬ 
tive d and provide a way to use it in computations. They also allow one to define 
and calculate the antisymmetrized second derivatives, e.g., d 2 v. The relation 

d 2 v = <9lv 

where v is a vector will then introduce the “operator-valued” or “(J)-tensor valued” 
curvature 2-form The notation of the extended exterior derivative puts a new 
look on the old apparatus of base vectors and parallel transport, and opens a way 
to calculate the curvature 2-form 

Let the vector field v be expanded in terms of some field of basis vectors 
thus 

v = e^v^. 

Then the exterior derivative of this vector is 

dv = de^v* 1 + du*. 

Expand the typical vector-valued 1-form de in the form 

de M = e v w\. (14.14) 

Here the “components” uj v A in the expansion of de are 1-forms. Recall from 
equation (10.13) that the typical uu v A is related to the connection coefficients by 

u)\ = r\ x cv x . (14.15) 

Therefore the expansion of the “vector” (really, “vector-valued 1-form”) is 

dv — e^dv 11 + cu fi v v v ). (14.16) 

Now differentiate once again to find 

d 2 v = de a A ( dv a + w a p v 9 ) 

+ e li {d 2 v ti -f duj fi v v v — cu fi v A dv v ) 

— e tl (cu fl a A dv a -f iu fi a A uj a v v v 
-f d 2 v^ -f dcu fi v v v — cu fi a A dv a ). 

The simplifications made here use (1) the equation (14.14), for a second time; and 
(2) the product rule (14.13a), which introduced the minus sign in the last term, ready 
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to cancel the first term. Now consider the term d 2 v». Recall that any given compo¬ 
nent, for example, v 3 , is an ordinary scalar function of position (as contrasted to 
v or e 3 or e 3 v 3 ). Therefore the standard exterior derivative (Chapter 4) as applied 
to a scalar function is all that d can mean in d 2 v^. But for the standard exterior 
derivative applied twice, one has automatically d^ = 0 (Box 4.1, B; Box 4.4). This 
circumstance reduces the expansion for d 2 v to the form 


d 2 v = efi* v v\ (14.17) 

where the are abbreviations for the curvature 2-forms 

^ + uj* a A tv a v . (14.18) 

Ordinarily, equation (14.18) surpasses in efficiency every other known method for 
calculating the curvature 2-forms. 

The remarkable form of equation (14.17) deserves comment. On the left appear 
two d’s, reminders that one has twice differentiated the vector field v. But on the 
right, as the result of the differentiation, one has only the vector field v at the point 
in question, undifferentiated. How v varies from place to place enters not one whit 
in the answer. All that matters is how the geometry varies from place to place. Here 
is curvature coming into evidence. It comes into evidence free of any special features 
of the vector field v, because the operation d 2 is an antisymmetrized covariant 
derivative [compare equation (11.8) for this antisymmetrized covariant derivative 
in the previously developed abstract language, and see Boxes 11.2 and 11.6 for what 
is going on behind the scene expressed in the form of pictures]. In brief, the result 
of operating on v twice with d is an algebraic linear operation on v, thus, 

d 2 v — Glv. (14.19) 

Here $ is an abbreviation for the “(J)-tensor valued 2-form,” 

S? = ® ui*€l* r . (14.20) 

If d is a derivative with a “slot in it” in which to insert the vector saying in what 
direction the differentiation is to proceed, then the d 2 w of d 2 w = $.w has two slots 
and calls for two vectors, say, u and v. These two vectors define the plane in which 
the antisymmetrized exterior derivative of (14.19) is to be evaluated (change in w 
upon going around the elementary route defined by u and v and coming back to 
its starting point; Boxes 11.6 and 11.7). To spell out explicitly this insertion of vectors 
into slots, return first to a simpler context, and see the exterior derivative of a 1-form 
(itself a 2-form) “evaluated” for a bivector u A v (“count of honeycomblike cells 
of the 2-form over the parallelogram-shaped domain defined by the two vectors 
u and v”), and see the result of the evaluation (exercise 14.6) expressed as a com¬ 
mutator, 


Curvature 2-forms SI 1 **: 

(1) in terms of d 2 v 

(2) in terms of a>** 


Tensor-valued curvature 
2-form SL 


(da, u A v> = 9 u <cr, v) - 3„<a, u > - (a, [u, v]>. 


(14.21) 
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This result generalizes itself to a tensor-valued 1-form S of any rank in an obvious 
way; thus, 

<</S, u A v} = T„<S, v> - ▼„<$, i/> - <S, [i#, v]). (14.22) 

Apply this result to the vector-valued 1-form S = €fw. Recall the expression for 
a directional derivative, {dw,u) = V^iv. Thus find the result 

{d 2 w, u A v) = ▼„ y v w - Y„ ▼„ w - T M nr 


Relation of curvature 2-form 
&L to curvature operator 


where $(c/, v) is the curvature operator defined already in Chapter 11 [equation 
(11.8)]. The conclusion is simple: the (J)-tensor-valued 2-form $ of (14.19), evaluated 
on the bivector (“parallelogram”) u A v, is identical with the curvature operator 
$(«/, v) introduced previously; thus 


<#, u A v} — <31 (c/, v). (14.24) 

Now go from the language of abstract operators to a language that begins to make 
components show up. Substitute on the left the expression (14,20) and on the right 
the value of the curvature operator from (11.11); and rewrite (14.24) in the form 


if A v) = ® iv v R /l paL pU a vP. 


Relation of 6L to components 

of Riemann 


Compare and conclude that the typical individual curvature 2-form is given by the 
formula 

A (14.25) 


(sum over a, /?, restricted to a < /?; so each index pair occurs only once). 

Equation (14.25) provides the promised packaging of 21 curvature components 
into six curvature 2-forms; and equation (14.18) provides the quick means to calcu¬ 
late these curvature 2-forms. It is not necessary to take the key calculational equations 
(14.18) on faith, or to master the extended exterior derivative to prove or use them. 
Not one mention of any d do they make except the standard exterior d of Chapter 
4. These key equations, moreover, can be verified in detail (exercise 14.8) by working 
in a coordinate frame. One adopts basis 1-forms cu a = dx a . One goes on to use 
= r* vX dx x from equation (14.15). In this way one obtains the “standard 
formula for the curvature” [equation (11.12) and equation (3) of Box 14.2] by 
standard methods. 

In summary, the calculus of forms and exterior derivatives reduces the 


calculation to the 


r* 


a(S 


R* 


vafi 




computation. Now look at the other link in the chain that leads from metric to 
curvature. It used to be 




r* 


a/3* 
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It now reduces to the calculation of “connection 1-forms”; thus 

g*, — i ► u>\- 

Two principles master this first step in the curvature computation: (1) the sy mm etry 
of the covariant derivative; and (2) its compatibility with the metric. Condition (1), 
symmetry, appears in hidden guise in the principle 

d 2 ? = 0. (1426) 

Here the notation “9 for point” comes straight out of Cartan. He thought of a vector 
as defined by the movement of one point to another point infinitesimally close to 
it. To write d3> was therefore to take the “derivative of a point” [make a construction 
with a point deleted (tail of vector) and “point reinserted nearby” (tip of vector)]. 
The direction of the derivative d in df is indefinite. In other words, df contains 
a “slot.” Only when one inserts into this slot a definite vector v does df give a 
definite answer for Cartan’s vector. What is that vector that df then gives? It is 
v itself. “The movement that is v tells the point 9 to reproduce the movement that 
is v”; or in concrete notation, 

(dP, v> = if. (1427) 

Put the content of this equation into more formalistic terms. The quantity df is 
a (J)-tensor 

d& = e^ut*. ( 1428 ) 

It is distinguished from the generic (J)-tensor 

T = ej\ui” 

by the special value of its components 

= s v 

In this sense it deserves the name of “unit tensor.” Insert this tensor in place of 
S into equation (14.22) and obtain the result 

{d 2 9, u A v> = Y„ v - \u - [*#, v] = 0. (14.29) 

The zero on the right is a restatement of equation (10.2a) or of “the closing of the 
vector diagram” in the picture called “symmetry of covariant differentiation” in Box 
10.2. The vanishing of the righthand side for arbitrary u and v demands the vanish¬ 
ing of d 2 9 on the left; and conversely, the vanishing of d 2 9 demands the symmetry 
of the covariant derivative. The other principle basic to the forthcoming computa¬ 
tions is “compatibility of covariant derivative with metric,” as expressed in the form 

d(u • v) = (du) • if + u • (dv). (14.30) 

It is essential here to ascribe to the metric (the “dot”) a vanishing covariant deriva- 
tive; thus 


Symmetry of covariant 
derivative: 

(1) expressed as d 2 & = 0 


</(•) = 0 . 
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(2) expressed as 
duj^ + a/% A w v = 0 


Compatibility of g and Y 
expressed as 

rfsv = «v + 


Method of curvature 2-forms 
in 4 steps: 

(1) select metric and frame 

(2) calculate connection 
1-forms w ll v 


Capitalize on the symmetry and compatibility of the covariant derivative by using 
basis vectors (and where appropriate the basis 1-forms dual to these basis vectors) 
in equations (14.26) and (14.30). Thus from 

. d<3> = e^cu* 1 


compute 

0 = d 2c Jp = de^ A (v* -f e^dcu* 1 
= e/a/% A uj v + tfo/%, 

and conclude that the coefficient of e M must vanish; or 

0 = devf* -f a/% A uj v (“symmetry”). (14.31a) 

Next, into (14.30) in place of the general u and v insert the specific and e„, 
respectively, and find 

d S»v = “V + w rn (“compatibility”), (14.31b) 

where 

= Sm wCL v = r v .va uja - (14.31c) 

In equations (14.31) one has the connection between metric and connection forms 
expressed in the most compact way. 


§14.6. COMPUTATION OF CURVATURE USING 
EXTERIOR DIFFERENTIAL FORMS 

The use of differential forms for the computation of curvature is iUustrated in Box 
14.5. This section outlines the method. There are three main steps: compute a/%; 
compute $%; and compute G%. More particularly, first select a metric and a frame. 
Thereby fix the basis forms u)* = L%, dx a and the metric components g^ in 
ds 2 = ® cv v . Then determine the connection forms a/%, and determine them 

uniquely, as solutions of the equations 

0 = dev^ -f o/% A cu v , (14.31a) 

dg^ - cv^ + «V (14.31b) 

The “guess and check” method of finding a solution to these equations (described 
and illustrated in Box 14.5) is often quick and easy. [Exercise (14.7) shows that a 
solution always exists by showing that the Christoffel formula (14.36) is the unique 
solution in coordinate frames.] It is usually most convenient to use an orthonormal 
frame with g^ v = t ) fiv (or some other simple frame where g^ = const, e.g., a null 
frame). Then dg^ — 0 and equation (14.31b) shows that uj^ — — w vti . Therefore 
there are only six ev ^ for which to solve in four dimensions. 


(continued on page 358 ) 
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Box 14.5 CURVATURE COMPUTED USING EXTERIOR DIFFERENTIAL FORMS 
(METRIC FOR FRIEDMANN COSMOLOGY) 

The Friedmann metric 

ds 2 — — dt 2 -f a 2 (t)[dx 2 + sin 2 x(dd 2 + sin 2 8 d<j> 2 )] 

(Box 27.1) represents a spacetime where each constant-/ hypersurface is a three- 
dimensional hypersphere of proper circumference 2ira{t). An orthonormal basis is 
easily found in this spacetime; thus, 

ds 2 = — {(v 1 ) 2 + (ujx ) 2 -f (ujO ) 2 -f (u>*) 2 , 

where 

a/* — dt, 
iv* = a dx, 

u) 6 — a sin x dO, (1) 

tv * — a sin x sin 6 d<j>. 


A. Connection Computation 

Equation (14.31b) gives, since dg = di = 0, just 

«V = -«v (2) 

so there are only six 1-forms cv^ to be found. Turn to the second basic equation 
(14.31a). The game now is to guess a solution (because this is so often quicker than 
using systematic methods) to the equations 0 = dev* -f u) ti v A uj v in which the uj v 
and thus also du)* are known, and cu fi v are unknown. The solution cu^ v is known 
to be unique; so guessing (if it leads to any answer) can only give the right answer. 
Proceed from the simplest such equation. From cv* = dt, compute 

dev 1 = 0. 

Compare this with dtv 1 = — ut^ A tv* or (since a/* t - = — cv # = 0, byiv^ — —cu Vfi ) 

dev * = —<v\ A cv k = 0. 

This equation could be satisfied by having iv\ oc cv k , or in more complicated ways 
with cancelations among different terms, or more simply by cv\ = 0. Proceed, not 
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Box 14.5 ( continued) 


looking for trouble, until some non-zero uj^ v is required. From uj x = a find 


dot x — a dt A dx 

- 0 a/a))cu t A cu x = -(d/a)cu x A wK 


Compare this with 


dcu x — — uj x A 
\ a 


= —u)*i A a>‘ — ujk-q A lu b — ujx^ A a/A 

Guess that a/* t * = ( a/a)ujx from the first term; and hope the other terms vanish. 
(Note that this allows a/‘* A a>* = A lu* = Aai* = 0 in the </a>‘ 

equation.) Look at a/ ® = a sin x dQ, and write 

diu e = (a/ajw* A w s + a -1 cotx ui x A 
= — cu e { A w { - a/®- A cu* — A w*. 

Guess, consistent with previously written equations, that 

w e { = w l h = (a/a)w\ 


(o'*- = —ujx§ — a 1 cot x cu s . 


Finally from 


div* = ( a/a)u j‘ A w* + a -1 cot x A w* 

+ (a sin x) -1 cot 6 uj s A ui$ 

= -a/*- A cu £ - A ujx — A u>\ 

deduce values of uj llA-, and uj^§. These are not inconsistent with previous 
assumptions that terms like a A uj + vanish (in the duj e equation); so one has 
in fact solved duj ** = — u> fi v A uj v for a set of connection forms c as follows: 

uj k i ~ uj\ — ( a/a)uj k , 

— — uj x $ = a~ l cot x 
= cos x 

= -a;^ = a -1 cot x (3) 

= cos x sin 8 d<t>, 

UJ*§ — — (a sin x)" 1 cot 8 uj* 

— cos 8 d(j>. 

Of course, if these hit-or-miss methods of finding uj^ v do not work easily in some 
problem, one may simply use equations (14.32) and (14.33). 
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B. Curvature Computation 


The curvature computation is a straightforward substitution of uj* v from equations 
(3) above into equation (14.34), which is 

91* v = div* p + ui* a A w a v . 

This equation is short enough that one can write out the sum 

+ u>‘$ A + a >A 

in contrast to the ten terms in the corresponding R = dT + T 2 equation [equation 
(3) of Box 14.2], Warning!: From = {a/d)w\ do not compute dw l ^ — . 
{a/a)‘ w * A u> x . Missing is the term (a/a)dcuK Instead write cu‘- = ( a/a)w* = 
a dx, and then find dio = ddt A d\ = ( a/a)u >* A aA. With elementary care, 
then, in correctly substituting from (3) for the cu* p in the formula for 9L* V , one finds 

= ( a/a)cu‘ A wx, 
and 

= (] q. a 2 )a~ 2 cux /\ 

This completes the computation of the R* va p, since in this isotopic model universe, 
all space directions in the orthonormal frame cu* are algebraically equivalent. One 
can therefore write 

0L* k = (a/a)w i A cu k , 

(4) 

9l k t = a 2 (1 + a 2 )u) k A cu 1 , 

for the complete list of 9L* P . Specific components, such as 

Ri iii = Ri xie = °> etc -> 

or 

R'eU = 0, R 9 m = a~ 2 (l + d 2 ), 
are easily read out of this display of 91* v . 

C. Contraction 

From equations (14.7), find 


G h = + 3<r 2 (l + a 2 ), (5 a) 

G { * = G u = G‘* = 0 = = G 9 * = G**, (5b) 

G** = G u = G** = -{2a~ l a + a~ 2 ( 1 + d 2 )}, (5c) 

and 

R=-G% = 6[cr l S + a -2 (l + d 2 )]. (6) 

_.____ J 
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If guessing is not easy, there is a systematic way to solve equations (14.31) in 
an orthonormal frame or in any other frame in which dg^ v — 0. Compute the du/ li 
and arrange them in the format 

dw a = A w". (14.32) 

In this way display the 24 “commutation coefficients” c^. These quantities enter 
into the formula 


(3) calculate curvature 
2-forms 


"V = y ( c „,« + ~ c„ ail )uJ a (14.33) 

to provide the six w (exercise 14.12). 

Once the are known, one computes the curvature forms ‘A v - V (again only six 
in four dimensions, since = -5by use of the formula 


(4) calculate components of 
curvature tensors 


= duj\ + w\ A cu a p . (14.34) 

Out of this tabulation, one reads the individual components of the curvature tensor 
by using the identification scheme 


A uj0. (14.35) 

The Einstein tensor G is computed by scanning the A 111 ' display to find the 
appropriate R»' alt components for use in formulas (14.7). 


EXERCISES 


Exercise 14.5. EXTERIOR DERIVATIVE OF A PRODUCT OF FORMS 

Establish equation (14.13a) by working up recursively from forms of lower order to forms 
of higher order. [Hints: Recall from equation (4.27) that for a p -form 

a — 0l \ix y ..n p \dx 11 ' A • • • A dx*p, 
the exterior derivative is defined by 

da = d'V-V dxH A dxH /\ ... /\ dx n „. 


Applied to the product a A /} of two 1-forms, this formula gives 

d(a A 0) = </[(<*„ rfx x ) A (/^rf*")] 

= d[(a K fi ll )(dx x A dx“)] 

0(a x /?J 

= — - / dx K A dx x A dx * 
dx K 

= {j^ dxK A dxX ) A dx “ ~ dxX) A (^ dxK A dxM ) 

- (da) A 0 - a A dp. 

Extend the reasoning to forms of higher order.] 
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Exercise 14.6. RELATIONSHIP BETWEEN EXTERIOR DERIVATIVE 
AND COMMUTATOR 

Establish formula (14.21) by showing (a) that the righthand side is an algebraic linear function 
of u and an algebraic linear function of v , and (b) that the equation holds when u and 
v are coordinate basis vectors u — d/dx k , v = d/dx l . 

Exercise 14.7. CHRISTOFFEL FORMULA DERIVED FROM CONNECTION FORMS 

In a coordinate frame w* = dx», show that equation (14.31a) requires and 

that, with this symmetry established, equation (14.31b) gives an expression for 0g AJ ,/0;c a which 
can be solved to give the Christoffel formula 

Exercise 14.8. RIEMANN-CHRISTOFFEL CURVATURE FORMULA 
RELATED TO CURVATURE FORMS 

Substitute uj ik v = x dx x into equation (14.18), and from the result read out, according 
to equation (14.25), the classical formula (3) of Box 14.2 for the components 

Exercise 14.9. MATRIX NOTATION FOR REVIEW OF CARTAN 
STRUCTURE EQUATIONS 

Let e = (e v .. ., e n ) be a row matrix whose entries are the basis vectors, and let to be a 
column of basis 1-forms w fi . Similarly let Q = lla/%11 and <51 = III® *,11 be square matrices 
with 1-form and 2-form entries. This gives a compact notation in which de = e v w v and 
d°J = read 


de — eQ and dty = eu, 


(14.37) 


respectively. 

(a) From equations (14.37) and d 2 6* — 0, derive equation (14.31a) in the form 

0 = du + Q A co. (14.38) 

[Solution: d 2 # = de A to + e cfco = e{Q A to + tfto).] 

(b) Compute d 2 e as motivation for definition (14.18), which reads 

<51 = dQ + 12 A Q. (14.39) 

(c) From </ 2 oo = 0, deduce 6L A to = 0 and then decompress the notation to get the 
antisymmetry relation i? M [a/3y] = 0. 

(d) Compute d<5l from equation (14.39), and relate it to the Bianchi identity R M p [ a p . y] = 0. 

(e) Let v = be a column of functions; so v = ev = e is a vector field. Compute, 
in compact notation, dv and d 2 v to show d 2 v = ei'Hv (which is equation 14.17). 

Exercise 14.10. TRANSFORMATION RULES FOR CONNECTION FORMS 
IN COMPACT NOTATION 

Using the notation of the previous exercise, write e ' = eA in place of e^ — e V A V ,, and 
similarly to' = A~ l u, to represent a change of frame. Show that d& = eto = e'u'. Substitute 
e' = eA in de' — e'S?' to deduce the transformation law 


Q' = A-'ttA + A - 1 dA. (14.40) 

Rewrite this in decompressed notation for coordinate frames with A v , = dx v /cx fi ' as a 
formula of the form r* = (?). 
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Exercise 14.11. SPACE IS FLAT IF THE CURVATURE VANISHES (see §11.5) 

If coordinates exist in which all straight lines (d^/dX 2 = 0) are geodesics, then one says 
the space is flat. Evidently all and R vanish in this case, by equation (14.8) and 
equation (3) in Box 14.2. Show conversely that, if $ = 0, then such coordinates exist. Use 
the results of the previous problem to find differential equations for a transformation A to 
a basis e' where X?' = 0. What are the conditions for complete integrability of these equations? 
[Note that df K = /^(x,/) is completely integrable if d 2 f K = 0 modulo the original equations.] 
Why will the basis forms cu M in this new frame be coordinate differentials cu M = dx ^? 

Exercise 14.12. SYSTEMATIC COMPUTATION OF CONNECTION FORMS 
IN ORTHONORMAL FRAMES 

Deduce equation (14.32) by applying equation (14.21) to basis vectors, using equations (8.14) 
to define Then show that, in an orthonormal frame (or any frames with g = const), 
equation (14.33) provides a solution of equations (14.31), which define cu tl l> . [Compare also 
equation (8.24b).] 

Exercise 14.13. SCHWARZSCHILD CURVATURE FORMS 

Use the obvious orthonormal frame cu* = e 0 dt y cu* = e A dr y cu 9 = r dO , cu 0 = r sin 6 d<t> 
for the Schwarzschild metric 

ds 2 = — e 20 dt 2 + e 2A dr 2 A- r 2 (d6 2 + sin 2 0 d(p 2 ) y (14.41) 

in which 0 and A are functions of r only; and compute the curvature forms $*** and the 
Einstein tensor G\ by^ the methods of Box 14.5. [Answer: = Ecu 1 A cu\ = Ecu* A 

a/*, = Ecu 1 A cu 0 y = Feu 9 A cu * = Feu 0 A cu\ = Feu* A cu 9 y with 


E = —e~ 2A (<P" + &' 2 — &'A% 

E = - — e ~ 2A 0\ 
r 

F = — e~ 2A A'\ 
r 


(14.42) 


and then 


Gl= -(F+2F), 

Gl = — (F + 2£), 

gI = g\= -(E+E+F), 

Gi=Gl=Gl = 0 = Gi = Gl=GU 


(14.43) 


Exercise 14.14. MATRIX DISPLAY OF THE RIEMANN-TENSOR COMPONENTS 

Use the symmetries of the Riemann tensor to justify displaying its components in an ortho- 
normal frame in the form 


= 


V i 

02 
_ 03 

p 

E i 
» 

I 

H 

23 

1 


31 

-H T j 

F 

12 

! 

/ 


(14.44) 




§14.6. METHOD OF EXTERIOR DIFFERENTIAL FORMS 


361 


where the rows are labeled by index pairs fiv = 01, 02, etc., as shown; and the columns afi, 
similarly. Here E, F and H are each 3x3 matrices with (why?) 

E = E T , F = F T trace H = 0, (14.45) 

where E T means the transpose of E . 


Exercise 14.15. RIEMANN MATRIX WITH VANISHING 
EINSTEIN TENSOR 

Show that the empty-space Einstein equations — 0 allow the matrix in equation (14.44) 
to be simplified to the form 




a/3 




(14.46) 


where now, in addition to the equality E = F that this form implies, the further conditions 

trace E = 0, H = H T (14.47) 


hold. 


Exercise 14.16. COMPUTATION OF CURVATURE FOR A PULSATING 
OR COLLAPSING STAR 

Spherically symmetric motions of self-gravitating bodies are discussed in Chapters 26 and 
32. A metric form often adopted in this situation is 


ds 2 = - e 2 * dT 2 + e 2 - 4 dR 2 + r\d6 2 + sin 2 <9 d<$> 2 ), (14.48) 


where now 0, A , and r are each functions of the two coordinates R and T Compute the 
curvature 2-forms and the Einstein tensor for this metric, using the methods of Box 14.5. 
In the guessing of the cu* v , most of the terms will already be evident from the corresponding 
calculation in exercise 14.13. [Answer, in the obvious orthonormal frame cu^ = e* dT, cu k = 
e A dR, cu 6 = r dd , cu* — r sin 6 d$: 


Ecu* A cu k , 

— Ecu f A cu 6 + Hcu k A cu 6 , 
61*^ — Eu j* A cu* + Hcu k A cu 
= Fu)* A cu 


= Fcu k A cu* — Hcu f A cu 6 , 
( Ji k ^ = Fcu k A cu* — Hcu f A cu*. 


which, in the matrix display of exercise 14.14, gives 



r. 

i 

E . !. 

E ! . -H 

A 

TR 

TO 

Tj> 



.IF 


§6 


1 • 

. H ! . F 

• j 



\ . 

-H . 1 . 

F / 

R0 


(14.49) 


(14.50) 
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14. CALCULATION OF CURVATURE 


Here 


E=e-™{A + 
E = e- 2 *(r - 

r 

F=±(\-r' 

F = - e- 2#, ryi ■ 
r 

The Einstein tensor is 

c ff 

Qtk 

G f s 

G S e 

G k s 


A 2 — A0) — e~ 2A (<P" + <P' 2 — 4>'A '), 
_ f4>) — L e -2A r ’Q^ 

■' — f<P' — r'A), 

2 e -2A + j.2 e ~2<P ^ 

f — e _2/1 (/-'vl' - /•"). 
r 


-G t f 

= F+2F, 

II 

2 H, 

G\ = 

o, 

G\ = 

— (2 E+F), 

G% = 

-(E+E+F), 

II 

<-& 

<< 

G*i = 0.] 


(14.51) 


(14.52) 


Exercise 14.17. BIANCHI IDENTITY IN dtft. = 0 FORM 

Define the Riemann tensor as a bivector-valued 2-form, 

® = |e/e,r, (14.53) 

and evaluate d$ to make it manifest that d9L = 0. Use 

- a/*, A oA°, (14.54) 

which is derived easily in an orthonormal frame (adequate for proving dft = 0), or (as a 
test of skill) in a general frame where = < & ,l (s g av and (why?) t/g"" = -g‘‘“(dg 0/J )g /J '’. 
[Afore: only wedge products between forms (not those between vectors) count in fixing signs 
in the product rule (14.13) for d.] 


Exercise 14.18. LOCAL CONSERVATION OF ENERGY AND MOMENTUM: 
d*T = 0 MEANS T = 0 

Let the duality operator *, as defined for exterior differential forms in Box 4.1, act on the 
forms, but not on the contravariant vectors, which appear when the stress-energy tensor T 
or the Einstein tensor G is written as a mixed (}) tensor: 

T = 

or 

G = e ll G» v w v . 

(a) Give an expression for *T (or *G) expanded in terms of basis vectors and forms. 



§14.6. METHOD OF EXTERIOR DIFFERENTIAL FORMS 
(b) Show that 
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where d 3 Z v — c v|a/3y| uj a A ujP A iu y [see Box 5.4 and equations (8.10)]. 
(c) Compute d*T using the generalized exterior derivative d; find that 


d*T — e. y y/\g\uj Q A cu l A cv 2 A cu 3 . 



CHAPTER 


r 


BIAIMCHI IDENTITIES AND 

THE BOUNDARY OF A BOUNDARY 




-\ 

This chapter is entirely Track 2. 

As preparation, one needs to 
have covered (1) Chapter 4 
(differential forms} and (2} 

Chapter 14 (computation of 
curvature}. 

In reading it, one will be 
helped by Chapters 9-11 
and 13. 

It is not needed as 
preparation for any later 
chapter, but it will be helpful 
in Chapter 17 (Einstein field 
equations). 

v _ J 


Identities and conservation of 
the source: electromagnetism 
and gravitation compared: 


§15.1. BIANCHI IDENTITIES IN BRIEF 

Geometry gives instructions to matter, but how does matter manage to give instruc¬ 
tions to geometry? Geometry conveys its instructions to matter by a simple handle: 
“pursue a world line of extremal lapse of proper time (geodesic).” What is the handle 
by which matter can act back on geometry? How can one identify the right handle 
when the metric geometry of Riemann and Einstein has scores of interesting features? 
Physics tells one what to look for: a machinery of coupling between gravitation 
{spacetime curvature) and source (matter; stress-energy tensor T) that will guarantee 
the automatic conservation of the source (▼ • T = 0). Physics therefore _a$ksjnathe- 
matics: “What tensor-like feature of the geometry is automatically conserved?” 
Mathematics comes back with the answer: “The Einstein tensor.” Physics queries, 
“How does this conservation come about?” Mathematics, in the person of Elie 
Cartan, replies, “Through the principle that ‘the boundary of a boundary is zero’” 
(Box 15.1). 

Actually, two features of the curvature are automatically conserved; or, otherwise 
stated, the curvature satisfies two Bianchi identities, the subject of this chapter. Both 
features of the curvature, both “geometric objects,” lend themselves to representation 
in diagrams, moreover, diagrams that show in action the principle that “the boundary 
of a boundary is zero.” In this respect, the geometry of spacetime shows a striking 
analogy to the field of Maxwell electrodynamics. 

In electrodynamics there are four potentials that are united in the 1-form A = 
A u dx Out of this quantity by differentiation follows the Faraday , F = dA. This 
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field satisfies the identity dF — 0 (identity, yes; identity lending itself to the definition dF = 0 
of a conserved source, no). 

In gravitation there are ten potentials (metric coefficients g^ v ) that are united in 
the metric tensor g — g^ v </x M & dx v . Out of this quantity by two differentiations 
follows the curvature operator 

« = A e v R^ ap dx a A dxP. 


This curvature operator satisfies the Bianchi identity d$l — 0, where now “</” is a d ( Ji = 0 
generalization of Cartan’s exterior derivative, described more fully in Chapter 14 
(again an identity, but again one that does not lend itself to the definition of a 
conserved source). 

In electromagnetism, one has to go to the dual, *F, to have any feature of the dd*F = 0 plus Maxwell 
field that offers a handle to the source, d*F — 4tt *J. The conservation of the equations =»- d*J = 0 
source, d*J = 0, appears as a consequence of the identity dd*F = 0; or, by a 
rewording of the reasoning (Box 15.1), as a consequence of the vanishing of the 
boundary of a boundary. 

(i continued on page 370 ) 


Bo. 15.1 THE BOUNDARY OF A BOUNDARY IS ZERO 


A 


A. The Idea in Its 1-2-3-Dimensional Form 

Begin with an oriented cube or approximation to 
a cube (3-dimensional). 

Its boundary is composed of six oriented faces, 
each two-dimensional. Orientation of each face is 
indicated by an arrow. 

Boundary of any one oriented face consists of 
four oriented edges or arrows, each one-dimen- 
- sional. 

Every edge unites one face with another. No 
edge stands by itself in isolation. 

“Sum” over all these edges, with due regard to 
sign. Find that any given edge is counted twice, 
once going one way, once going the other. 

Conclude that the one-dimensional boundary of 
the two-dimensional boundary of the three-di¬ 
mensional cube is identically zero. 

C j 
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15. BIANCHI IDENTITIES AND THE BOUNDARY OF A BOUNDARY 



Box 15.1 (continued) 


B. The Idea in Its 2-3-4-Dimensional Form 


Begin with an oriented four-dimensional cube or 
approximation thereto. The coordinates of the 
typical comer of the four-cube may be taken to 
be (/ 0 ±\At, x 0 ±\ Ax, jo ± 2 fy, z 0 ± \ Az ); 
and, accordingly, a sample comer itself, in an 
obvious abbreviation, is conveniently abbreviated 
-I-h. There are 16 of these comers. Less com¬ 

plicated in appearance than the 4-cube itself are 


its three-dimensional faces, which are “exploded 
off of it” into the surrounding area of the diagram, 
where they can be inspected in detail. 

The boundary of the 4-cube is composed of eight 
oriented hyperfaces, each of them three-dimen¬ 
sional (top hyperface with extension Ax Ay Az, 
for example; a “front” hyperface with extension 
At Ay Az ; etc.) 








1. BIANCHI IDENTITIES IN BRIEF 


367 


Boundary of any one hyperface (“cube”) consists of six oriented faces, each 
two-dimensional. 

Every face (for example, the hatched face Ax Ay in the lower lefthand comer) 
unites one hypersurface with another (the “3-cube side face” At Ax Ay in the lower 
lefthand comer with the “3-cube top face” Ax Ay Az, in this example). No face stands 
by itself in isolation. The three-dimensional boundary of the 4-cube exposes no 
2-surface to the outside world. It is faceless . 

“Sum” over all these faces, with due regard to orientation. Find any given face 
is counted twice, once with one orientation, once with the opposite orientation. 

Conclude that the two-dimensional boundary of the three-dimensional boundary 
of the four-dimensional cube is identically zero. 


C. The Idea in Its General Abstract Form 

dd = 0 (the boundary of a boundary is zero). 


D. Idea Behind Application to Gravitation and Electromagnetism 

The one central point is a law of conservation (conservation of charge; conservation 
of momentum-energy). 

The other central point is “automatic fulfillment” of this conservation law. 

“Automatic conservation” requires that source not be an agent free to vary 
arbitrarily from place to place and instant to instant. 

Source needs a tie to something that, while having degrees of freedom of its own, 
will cut down the otherwise arbitrary degrees of freedom of the source sufficiently 
to guarantee that the source automatically fulfills the conservation law. Give the 
name “field” to this something. 

Define this field and “wire it up” to the source in such a way that the conservation 
of the source shall be an automatic consequence of the “zero boundary of a boundary” 
Or, more explicitly: Conservation demands no creation or destruction of source inside 
the four-dimensional cube shown in the diagram. Equivalently, integral of “creation 
events” (integral of d*J for electric charge; integral of d*T for energy-momentum) 
over this four-dimensional region is required to be zero. 

Integral of creation over this four-dimensional region translates into integral of 
source density-current (* J or *T) over three-dimensional boundary of this region. 
This boundary consists of eight hyperfaces, each taken with due regard to orientation. 
Integral over upper hyperface (“ Ax Ay Az)” gives amount of source present at later 
moment; over lower hyperface gives amount of souce present at earlier moment; 
over such hyperfaces as “At Ax Ay” gives outflow of source over intervening period 
of time. Conservation demands that sum of these eight three-dimensional integrals 
shall be zero (details in Chapter 5). 
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Box 15.1 (continued) 


Vanishing of this sum of three-dimensional integrals states the conservation 
requirement, but does not provide the machinery for “automatically” (or, in mathe¬ 
matical terms, “identically”) meeting this requirement. For that, turn to principle 
that “boundary of a boundary is zero.” 

Demand that integral of source density-current over any oriented hyperface L V 
(three-dimensional region; “cube”) shall equal integral of field over faces of this 
“cube” (each face being taken with the appropriate orientation and the cube being 
infinitesimal): 



S,f -T. f 

\ rotation / 


Sum over the six faces of this cube and continue summing until the faces of all 
eight cubes are covered. Find that any given face (as, for example, the hatched face 
in the diagram) is counted twice, once with one orientation, once with the other 
(“boundary of a boundary is zero”). Thus is guaranteed the conservation of source: 
integral of source density-current over three-dimensional boundary of four-dimen¬ 
sional region is automatically zero, making integral of creation over interior of that 
four-dimensional region also identically zero. 

Repeat calculation with boundary of that four-dimensional region slightly dis¬ 
placed in one locality [the “bubble differentiation” of Tomonaga (1946) and Schwin¬ 
ger (1948)], and conclude that conservation is guaranteed, not.only in the four-di¬ 
mensional region as a whole, - but at every point within it, and, by extension, 
everywhere in spacetime. 


E. Relation of Source to Field 

One view: Source is primary. Field may have other duties, but its prime duty is 
to serve as “slave” of source. Conservation of source comes first; field has to adjust 
itself accordingly. 

Alternative view: Field is primary. Field takes the responsibility of seeing to it 
that the source obeys the conservation law. Source would not know what to do in 
absence of the field, and would not even exist. Source is “built” from field. Conser¬ 
vation of source is consequence of this construction. 

One model illustrating this view in an elementary context: Concept of “classical” 
electric charge as nothing but “electric lines of force trapped in the topology of a 
multiply connected space” [Weyl (1924b); Wheeler (1955); Misner and Wheeler 
(1957)]. 

On any view: Integral of source density-current over any three-dimensional region 
(a “cube” in simplified analysis above) equals integral of field over boundary of 
this region (the six faces of the cube above). No one has ever found any other way 
to understand the correlation between field law and conservation law. 
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F. Electromagnetism as a Model: How to "Wire Up" Source to Field 
to Give Automatic Conservation of Source Via “33 = 0” in Its 
2-3-4-Dimensional Form 

Conservation means zero creation of charge (zero creation in four-dimensional 
region i2). 

Conservation therefore demands zero value for integral of charge density-current 
over three-dimensional boundary of this volume; thus, 

3 J* 


° = f = f J* 

J B ^ J ds2 » 


in the Track-1 language of Chapters 3 and 5. Equivalently, in the coordinate-free 
abstract language of §§4.3-4.6, one has 


0 = f d*J = f *J, 

J a cij? 


where 


*J = V 123 dx 1 A dx 2 A dx 3 + V 023 dx° A dx 2 A dx 3 

+ V 031 dx° A dx 3 A dx 1 + V 012 dx° A dx 1 A dx 2 

(“eggcrate-hke structure” of the 3-form of charge-density and current-density). 

Fulfill this conservation requirement automatically (“identically”) through the 
principle that “the boundary of a boundary is zero” by writing 477 *J = d*F ; thus, 


4rr f *J = f d*F = f *F 


9J? v bS2 % 'ddS2(zerol) 

or, in Track-1 language, write 477 /^ = F^ „, and have 

477 J J* d 3 !^ = f F".,d 3 Z,= f F» a d 2 I = 0. 

d£2 9S2 99J?(zero!) 

In other words, half of Maxwell’s equations in their familiar flat-space form, 

div E = V ■ E = 477p, curl B — V x B = E + 4-rrJ, 

“wire up” the source to the field in such a way that the law of conservation of source 
follows directly from “331? = 0.” 

G. Electromagnetism Also Employs “33 = 0” in its 
1-2-3-Dimensional Form ("No Magnetic Charge") 

Magnetic charge is linked with field via 4 t rJ mag = dF (see point F above for transla¬ 
tion of this compact Track-2 language into equivalent Track-1 terms). Absence of 
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Box 15.1 (continued) 


any magnetic charge says that integral of J mag over any 3 -volume T is necessarily 
zero; or (“integration by parts/’ generalized Stokes theorem) 




total magnetic flux 
exiting through 3 T 


) 


In order to satisfy this requirement “automatically,” via principle that “the boundary 
of a boundary is zero,” write F = dA (“expression of field in terms of 4-potential”), 
and have 


/ F = f dA = f >1=0. 

9T 9T 99T(zero!) 


H. Structure of Electrodynamics in Outline Form 


V 


. r . r: ,. ' v< 





A (potential) 

1 

K 

e 

0 , 

T 

F (field; Faraday) = dA 

1 

-► -F (dual field; Maxwell) 

.2 

c 

w 

1 

dF = 0 (identity based 

\ 

d'F = 4tt* J 

Tn 

on 99 = 0) 

1 

5 


♦ 

d *J = 0 (expressed as an 


identity based 
on 98 = 0) 


d*J = 0 or VJ a= 0 
(“automatic” conservation of source) 


In gravitation physics, one has to go to the “double dual” (two pairs of alternating 
indices, two places to take the dual) <3 = *R* of Riemann to have a feature of 
the field that offers a handle to the source: 

® = Tr£ = Einstein = 8 77 7 " = 877 x (density of energy-momentum). 

The conservation of the source T = e tl T> l v (u v can be stated V • T = 0. But better 
suited for the present purpose is the form (see Chapter 14 and exercise 14.18) 




§15.1. BIANCHI IDENTITIES IN BRIEF 


371 



I. Structure of Geometrodynamics in Outline Form 





d*G = 0 plus Einstein field 
equation =^* d*T = 0 

This conservation law arises as a consequence of the “contracted Bianchi identity”, 
d*G — 0, again interpretable in terms of the vanishing of the boundary of a 
boundary. 


d*T = 0, 


where 
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Bianchi identity, d$l = 0 
interpreted in terms of 
parallel transport around 
six faces of a cube. 



Figure 15.1. 

Combine rotations associated with each of the six faces of the illus¬ 
trated 3-volume and end up with zero net rotation (“full Bianchi 
identity”). Reason: Contribution of any face is measured by change 
in a test vector A carried in parallel transport around the perimeter 
of that face. Combine contributions of all faces and end up with each 
edge traversed twice, once in one direction, once in the other direction 
[boundary (here one-dimensional) of boundary (two-dimensional) of 
indicated three-dimensional figure is zero]. Detail: The vector A, 
residing at the indicated site, is transported parallel to itself over to 
the indicated face, then carried around the perimeter of that face by 
parallel transport, experiencing in the process a rotation measured by 
the spacetime curvature associated with that face, then transported 
parallel to itself back to the original site. To the lowest relevant order 
of small quantities one can write 

(change in A) = — Ay Az &(e y , e z ) A 
in operator notation; or in coordinate language, 

— 8A a = R a p yz (aXx + Ax)A& Ay Az. 


§15.2. BIANCHI IDENTITY diZ = 0 AS A MANIFESTATION 
OF "BOUNDARY OF BOUNDARY = 0" 

Such is the story of the two Bianchi identities in outline form; it is now appropriate 
to fill in the details. Figure 15.1 illustrates the full Bianchi identity, d$ = 0 (see 
exercise 14.17), saying in brief, “The sum of the curvature-induced rotations associ¬ 
ated with the six faces of any elementary cube is zero.” The change in a vector A 
associated with transport around the perimeter of the indicated face evaluated to 
the lowest relevant order of small quantities is given by 

— 8A a = R a p yz (at jc + Ax)AP Ay Az. (15.1) 

The opposite face gives a similar contribution, except that now the sign is reversed 
and the evaluation takes place at jc rather than at jc + Ax. The combination of the 
contributions from the two faces gives 

7\ J?a 

— ^ VZ A& Ax Ay Az , 
dx 


(15.2) 
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when Riemann normal coordinates are in use. In such coordinates, the vanishing 
of the total —SA a contributed by all six faces implies 

+ * Vw + ^ / 3xy;z 0. (15.3) 

Here semicolons (covariant derivatives) can be and have been inserted instead of 
commas (ordinary derivatives), because the two are identical in the context of 
Riemann normal coordinates; and the covariant version (15.3) generalizes itself to 
arbitrary curvilinear coordinates. Turn from an xyz cube to a cube defined by any 
set of coordinate axes, and write Bianchi’s identity in the form 

= 0. (15.4) 

(See exercise 14.17 for one reexpression of this identity in the abstract coordinate- 
independent form, d6l = 0, and §15.3 for another.) This identity occupies much the 
same place in gravitation physics as that occupied by the identity dF = ddA = 0 
in electromagnetism: 

= 0- (15.5) 

§15.3. MOMENT OF ROTATION: KEY TO CONTRACTED 

BIANCHI IDENTITY 

The contracted Bianchi identity, the identity that offers a “handle to couple to the 
source,” was shown by Elie Cartan to deal with “moments of rotation” [Cartan 
(1928); Wheeler (1964b); Misner and Wheeler (1972)]. Moments are familiar in 
elementary mechanics. A rigid body will not remain at rest unless all the forces acting 
on it sum to zero: 


^ ^ (i) = 0. (15.6) 


Although necessary, this condition is not sufficient. The sum of the moments of these 
forces about some point 9 must also be zero: 

^ (^ (i) - 9) A F {i) = 0. (15.7) 

i 

Exactly what point these moments are taken about happily does not matter, and 
this for a simple reason. The arbitrary point in the vector product (15.7) has for 
coefficient the quantity which already has been required to vanish. The 

situation is similar in the elementary cube of Figure 15.1. Here the rotation associated 
with a given face is the analog of the force F {i) in mechanics. That the sum of these 
rotations vanishes when extended over all six faces of the cube is the analog of the 
vanishing of the sum of the forces F {i) . 

What is the analog for curvature of the moment of the force that one encounters 
in mechanics? It is the moment of the rotation associated with a given face of the 
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Net moment of rotation over 
all six faces of a cube: 


(1) described 


(2) equated to integral of 
source, f*T, over interior of 
cube 


cube. The value of any individual moment depends on the reference point 9. 
However, the sum of these moments taken over all six faces of the cube will have 
a value independent of the reference point 9, for the same reason as in mechanics. 
Therefore 9 can be taken where one pleases, inside the elementary cube or outside 
it. Moreover, the cube may be viewed as a bit of a hypersurface sliced through 
spacetime. Therefore 9 can as well be off the slice as on it. It is only required that 
all distances involved be short enough that one obtains the required precision by 
calculating the moments and the sum of moments in a local Riemann-normal 
coordinate system. One thus arrives at a ^-independent totalized moment of rotation 
(not necessarily zero; gravitation is not mechanics!) associated with the cube in 
question. 

Now comes the magic of “the boundary of the boundary is zero.” Identify this 
net moment of rotation of the cube, evaluated by summing individual moments of 
rotation associated with individual faces, with the integral of the source density- 
current (energy-momentum tensor *7") over the interior of the 3-cube. Make this 
identification not only for the one 3-cube, but for all eight 3-cubes (hyperfaces) that 
bound the four-dimensional cube in Box 15.1. Sum the integrated source density- 
current not only for the one hyperface of the 4-cube, but for all eight hyperfaces. 
Thus have 


(3) conserved 


/ 

4-cube 


source 

creation 


d*T 


i 

3-boundary 
of this 4-cube 


( source currents 
density, *T ) 


= 2 

these eight 
bounding 
3-cubes 


net moment of rotation 
associated with speci¬ 
fied cube 


= 2 2 

eight. six faces 
bounding bounding 
3-cubes given 3-cube 


moment of rotation 
associated with specified 
face of specified cube 


(zero!) 


(15.8) 


Let the moments of rotation, not only for the six faces of one cube, but for all the 
faces of all the cubes, be taken with respect to one and the same point 9. Recall 
(Box 15.1) that any given face joins two cubes or hyperfaces. It therefore appears 
twice in the count of faces, once with one orientation (“sense of circumnavigation 
in parallel transport to evaluate rotation”) and once with the opposite orientation. 
Therefore the double sum vanishes identically (boundary of a boundary is zero!) 
This identity establishes existence of a new geometric object, a feature of the curva¬ 
ture, that is conserved, and therefore provides a handle to which to couple a source. 
The desired result has been achieved. Now to translate it into standard mathematics! 
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§15.4. CALCULATION OF THE MOMENT OF ROTATION 

It remains to find the tensorial character and value of this conserved Cartan moment 

of rotation that appertains to any elementary 3-volume. The rotation associated with (4) evaluated 

the front face Ay Aze y A e z of the cube in Figure 15.1 will be represented by the 

bivector 


( rotation associated \ 
with front Ay Az face/ 


= e x A e^R^^AyAz 


(15.9) 


located at 9 front = (r — | At, x + Ax, y + \ Ay, z + \ Az). This equation uses Rie- 
mann normal coordinates; indices enclosed by strokes, as in IVI, are summed with 
the restriction A < ju. The moment of this rotation with respect to the point 9 will 
be represented by the trivector 


( moment of rotation \ 

associated with — (Renter - A e x A Ay Az. (15.10) 

front Ay Az face / f a f C e° nt 


Here neither ^ cen ter front nor ^ h as an y well-defined meaning whatsoever as a vec¬ 
tor, but their difference is a vector in the limit of infinitesimal separation, A9 = 
9 center front ~ . With the back face a similar moment of rotation is associated, 

with the opposite sign, and with ^ centerfront replaced by S? center back . In the difference 
between the two terms, the factor 9 is of no interest, because one is already assured 
it will cancel out [Bianchi identity (15.4); analog of ZF {i) = 0 in mechanics]. The 
difference ^ centerfront - ^ ce nterback has the value Axe x . Summing over all six faces, 
one has 

( net moment of 
rotation associated 
with cube or hyper¬ 
face Ax Ay Az 

e x A e x A e ii R M yz Ax Ay Az (front and back) 

— — - ^ + e y A e x A e^R lX/xl zx Ay Az Ax (sides) 

+ e z A e x A e^R 1 ^ 1 ^ Az Ax Ay (top and bottom). (15.11) 

This sum one recognizes as the value (on the volume element e x A e y A e z Ax 
Ay Az) of the 3-form 

e v A e x A e^R^\ a(i , dx v A dx a A dx&. 

Moreover this 3-form is defined, and precisely defined, at a point, whereas (15.11), 
applying as it does to an extended region, does not lend itself to an analysis that 
is at the same time brief and precise. Therefore forego (15.11) in favor of the 3-form. 
Only remember, when it comes down to interpretation, that this 3-form is to be 
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(5) abstracted to give 

d3 A it 


(6) abstracted to give 
*(<& A fl) = e a G aT d 3 2 7 


evaluated for the “cube” e x A e y A e z Ax Ay Az. Now note that the “trivector¬ 
valued moment-of-rotation 3-form” can also be written as 

(ro^Tion ^ = <*? A * = «r A e x A e„/J^' |a/}| dx" A dx« A dx (15.12) 
Here 

d§> = e a dx° (15.13) 

is Cartan’s (}) unit tensor. Also <31 is the curvature operator, treated as a bivector¬ 
valued 2-form: 

S = e x A j dx a A dx?. (15.14) 

Using the language of components as in (15.11), or the abstract language intro¬ 
duced in (15.12), one finds oneself dealing with a trivector. A trivector can be left 
a trivector, as, in quite another context, an element of 3-volume on a hypersurface 
in 4-space can be left as a trivector. However, there it is more convenient to take 
the dual representation, and speak of the element of volume as a vector. Denote 
by ★ a duality operation that acts only on contravariant vectors, trivectors, etc. (but 
not on forms). Then in a Lorentz frame one has *(e x A e 2 A e 3 ) = e 0 ; but 
*(</x 3 ) = dx 3 . More generally, 

★(e„ A e x A ej = e„ x /e a . (15.15) 

In this notation, the “vector-valued moment-of-rotation 3-form” is 

( moment \ , _ 

of rotation) = * (< ^ A *> = **' A dx° A dx» 

= •„(**)/ w »i dx" A dx a A dx?, 

or, in one more step, 

/moment \ _ _ ^,r d z S (15.16) 

Vof rotation/ 7 oV T v ’ 

Here d 3 I T is a notation for basis 3-forms, as in Box 5.4; thus, 

dx" A dx a A dx<* = e" a ^ T d 3 S r (15.17) 

(In a local Lorentz frame, dx 1 A dx 2 A dx 3 = d 3 I 0 .) 

Nothing is more central to the analysis of curvature than the formula (15.16). 
It starts with an element of 3-volume and ends up giving the moment of rotation 
in that 3-volume. The tensor that connects the starting volume with the final moment, 
the “contracted double-dual” of Riemann, is so important that it deserves and 
receives a name of its own, G = Einstein; thus 

(Einstein) 0 ’’ = G° T = €/" T = (*/?•)/". (15.18) 

This tensor received attention in §§13.5 and 14.2, and also in the examples at the 
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end of Chapter 14. In terms of Einstein, the connection between element of 3-vol¬ 
ume and “vector-valued moment of rotation” is 

/moment \ _ _ 

Vof rotation/ v ' v ’ 

The amount of “vector-valued moment of rotation” contained in the element of 
3-volume <P2 is identified by general relativity with the amount of energy-mo¬ 
mentum contained in that 3-volume. However, defer this identification for now. 
Concentrate instead on the conservation properties of this moment of rotation. See 
them once in the formulation of integral calculus, as a consequence of the principle 
“00 ~ 0.” See them then a second time, in differential formulation, as a consequence 
of “</</ = 0.” 


§15.5. CONSERVATION OF MOMENT OF ROTATION SEEN 
FROM “BOUNDARY OF A BOUNDARY IS ZERO 1 ' 

The moment of rotation defines an automatically conserved quantity. In other words, 
the value of the moment of rotation for an elementary 3-volume Ax Ay Az after the 
lapse of a time At is equal to the value of the moment of rotation for the same 
3-volume at the beginning of that time, corrected by the inflow of moment of rotation 
over the six faces of the 3-volume in that time interval (quantities proportional to 
Ay Az At, etc.) Now verify this conservation of moment of rotation in the language 
of “the boundary of a boundary ” Follow the pattern of equation (15.8), but translate 
the words into formulas, item by item. Evaluate the amount of moment of rotation 
created in the elementary 4-cube Q, and find 


“creation of moment of \ 


“creation” = f I rotation” in the elementary I — J" 
| \ 4-cube of spacetime Q j | 0 


d*G; 


definition 


definition 


I d " e ff " G jf *< rf?AS) 7 2 * 

S2 | 3j? T 3J? T the eight 


step 1 step 2 


the eight 
3-cubes 
that bound S1 


j moment of rotation ^ 

f (rtf A #) 

d-cube 

associated with 
^ specified 3-cube J 


step 3 


2 2 


eight bounding six faces bounding 
3-cubes specified 3-cube 


j moment of rotation ^ 

J (?p A #) 

race 

associated with 
specified face of 
\ specified cube j 


step 4 


: 0 . ( 15 . 20 ) 


Conservation of net moment 
of rotation: 


(1) derived from "00 = 0" 


step 4 


step 5 
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Here step 1 is the theorem of Stokes. Step 2 is the identification established by (15.19) 
between the Einstein tensor and the moment of rotation. Step 3 breaks down the 
integral over the entire boundary dQ into integrals over the individual 3-cubes that 
constitute this boundary. Moreover, in all these integrals, the star ★ is treated as 
a constant and taken outside the sign of integration. The reason for such treatment 
is simple: the duality operation ★ involves only the metric, and the metric is locally 
constant throughout the infinitesimal 4-cube over the boundary of which the inte¬ 
gration extends. Step 4 uses the formula 

A *1) = &j> A $ + sP A dSL = d& A # (15.21) 

and the theorem of Stokes to express each 3-cube integral as an integral of 9 A $ 
over the two-dimensional boundary of that cube. The culminating step is 5. It has 
nothing to do with the integrand. It depends solely on the principle dd ^0. 

In brief, the conservation of moment of rotation follows from two circumstances. 
(1) The moment of rotation associated with any elementary 3-cube is by definition 
a net value, obtained by adding the six moments of rotation associated with the 
six faces of that cube. (2) When one sums these net values for all eight 3-cubes 
in (15.20), which are the boundary of the elementary 4-cube ,12, one counts the 
contribution of a given 2-face twice, once with one sign and once with the opposite 
sign. In virtue of the principle that “the boundary of a boundary is zero,” the 
conservation of moment of rotation is thus an identity. 


§15.6. CONSERVATION OF MOMENT OF ROTATION 
EXPRESSED IN DIFFERENTIAL FORM 


derived from "dd = O’* 


Every conservation law stated in integral form lends itself to restatement in differ¬ 
ential form, and conservation of moment of rotation is no exception. The calculation 
is brief. Evaluate the generalized exterior derivative of the moment of rotation in 
three steps, and find that it vanishes; thus: 


d*G = d[*(dif A #)] 

= *[d{0> A #)] } ste P 1 

= A g — d& A tM\ 1 StCp 2 

= 0 j ste P ^ 


Step 1 uses the relation d * = *</. The star duality and the generalized exterior 
derivative commute because when d is applied to a contravariant vector, it acts as 
a covariant derivative, and when * is applied to a covariant vector or 1-form, it 
is without effect. Step 2 applies the standard rule for the action of d on a product 
of tensor-valued forms [see equation (14.13b)]. Step 3 deals with two terms. The 
first term vanishes because the first factor in it vanishes; thus, d 2 9 = 0 [Cartan’s 
equation of structure; expresses the “vanishing torsion” of the covariant derivative; 
see equation (14.26)]. The second term also vanishes, in this case, because the second 
factor in it vanishes; thus, d® = 0 (the full Bianchi identity). Thus briefly is conser¬ 
vation of moment of rotation established. 






Energy-momentum as source of 
gravitation (curvature of space- 
time) 

Moment of rotation as automati¬ 
cally conserved feature of the 
geometry 

Representation as a vector-valued 
3-form, a coordinate-independent 
geometric object 

Machine to tell how much energy- 
momentum is contained in an 
elementary 3-volume; 

*T = e a T° T <P2 T 
(“dual of stress-energy tensor”) 

Machine to tell how much net 
moment of rotation—expressed 
as a vector—is obtained by add¬ 
ing the six moments of rotation 
associated with the six faces of 
the elementary 3-cube: 

*(«0> A #)=*<? = b 9 G* t &2 t 
(“dual of Einstein”) 

Representation as a (g)-tensor 
(also a coordinate independent 
geometric object) 

Stress-energy tensor itself: 

T = e a T aT e r 

Einstein itself: 

G = e a G OT e T 

Representation in language of 
components (values depend on 
choice of coordinate system) 

t° t 

G OT 

Conservation law in language of 
components 

T° t . t = 0 

G°\ r : o 

Conservation in abstract lan¬ 
guage, for the (2)-tensor 

◄ 

II 

o 

◄ 

Qi 

III 

o 

Conservation in abstract lan¬ 
guage, as translated into exterior 
derivative of the dual tensor (vec¬ 
tor-valued 3-form) 

d*T= 0 

d*G e 0 or 
d*(di9 A tf) = 0 


f *G = 0 or 

Same conservation law expressed r * r aj? 

in integral form for an element J ^ = ® *J (&? A tf) = 0 or 

of 4-volume S7 di? di? 

* f (if A rf) = 0 

_ 


j 


-—- 

Box 15.2 THE SOURCE OF GRAVITATION AND THE MOMENT OF ROTATION: 

THE TWO KEY QUANTITIES AND THE MOST USEFUL MATHEMATICAL 
REPRESENTATIONS FOR THEM 


§15.7. FROM CONSERVATION OF MOMENT OF ROTATION TO 
EINSTEIN'S GEOMETRODYNAMICS: A PREVIEW 


Mass, or mass-energy, is the source of gravitation. Mass-energy is one component 
of the energy-momentum 4-vector. Energy and momentum are conserved. The 
amount of energy-momentum in the element of 3-volume d 3 Z is 

*T = e a T° T d*Z T (15.22) 


Einstein field equation 
"derived" from demand that 
(conservation of net moment 
of rotation) => (conservation 
of source) 


(see Box 15.2). Conservation of energy-momentum for an elementary 4-cube Q 
expresses itself in the form 



0 . 


(15.23) 
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EXERCISES 


This conservation is not an accident. According to Einstein and Cartan, it is “auto¬ 
matic”; and automatic, moreover, as a consequence of exact equality between 
energy-momentum and an automatically conserved feature of the geometry. What 
is this feature? It is the moment of rotation, which satisfies the law of automatic 
conservation, 

[ *G = 0. (15.24) 

J dS2 


In other words, the conservation of momentum-energy is to be made geometric in 
character and automatic in action by the following prescription: Identify the stress- 
energy tensor (up to a factor 8tt, or 8ttG/c 4 , or other factor that depends on choice 
of units) with the moment of rotation; thus, 


★(cfc? A <&) = *G = 8 tt *T ; (15.25) 

or equivalently (still in the language of vector-valued 3 -forms) 

(rotation ^ = * ( ^ A *) = e ^ ffT d*2 r = 8 «e a T°' d*S T ; (15.26) 

or, in the language of tensors, 

G = e a G° T e T = 877 e a T (TT e T = 8 ttT\ (15.27) 

or, in the language of components, 

G aT -%7rT OT •• • (15.28) 

(Einstein’s field equation; more detail, and more on the question of uniqueness, will 
be found in Chapter 17; see also Box 15.3). Thus simply is all of general relativity 
tied to the principle that the boundary of a boundary is zero. No one has ever 
discovered a more compelling foundation for the principle of conservation of mo¬ 
mentum and energy. No one has ever seen more deeply into that action of matter 
on space, and space on matter, which one calls gravitation. 

In summary, the Einstein theory realizes the conservation of energy-momentum as 
the identity , “the boundary of a boundary is zero. ” 


Exercise 15.1. THE BOUNDARY OF THE BOUNDARY OF A 4-SIMPLEX 

In the analysis of the development in time of a geometry lacking all symmetry, when one 
is compelled to resort to a computer, one can, as one option, break up the 4-geometry into 
simplexes [four-dimensional analog of two-dimensional triangle, three-dimensional tetrahe¬ 
dron; vertices of “central simplex” conveniently considered to be at (/, x,y, z) = (0, 1, 1, 1), 
(0, 1, —1, —1), (0, -1, 1, —1), (0, -1,-1, 1), (5 1/2 ,0,0,0), for example], sufficiently nu¬ 
merous, and each sufficiently small, that the geometry inside each can be idealized as flat 
(Lorentzian), with all the curvature concentrated at the join between simplices (see discussion 
of dynamics of geometry via Regge calculus in Chapter 42). Determine (“give a mathematical 
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| Box 15.3 OTHER IDENTITIES SATISFIED BY THE CURVATURE 


(1) The source of gravitation is energy-momentum. 

( 2 ) Energy-momentum is expressed by stress- 
energy tensor (or by its dual) as a vector-valued 

3- form (“energy-momentum per unit 3-volume”). 

(3) This source is conserved (no creation in an 
elementary spacetime 4-cube). 

These principles form the background for the 
probe in this chapter of the Bianchi identities. That 
is why two otherwise most interesting identities 
[Allendoerfer and Weil (1943); Chem (1955,1962)] 
are dropped from attention. One deals with the 

4- form 

^ ~ 24^2 S 7 S l3S ^a/3 A (1) 

and the other with the 4 -form 

^ ~ 8 ir 2 |det | 1/2 ^ 12 ^ ® 30 ^ 13 ^ ®02 

+ S ? 10 A $ 23 ). 

( 2 ) 

Both quantities are built from the tensorial “cur¬ 
vature 2 -forms” 

®«y = jKypsdxfi A dx\ (3) 

The four-dimensional integral of either quantity 
over a four-dimensional region Q has a value that 
( 1 ) is a scalar, ( 2 ) is not identically equal to zero, 

(3) depends on the boundary of the region of 
spacetime over which the integral is extended, but 

(4) is independent of any changes made in the 


spacetime geometry interior to that surface (pro¬ 
vided that these changes neither abandon the con¬ 
tinuity nor change the connectivity of the 4 -geom¬ 
etry in that region). Property (1) kills any 
possibility of identifying the integral, a scalar, with 
energy-momentum, a 4-vector. Property (2) kills 
it for the purpose of a conservation law, because 
it implies a non-zero creation in O. 

Also omitted here is the Bel-Robinson tensor 
(see exercise 15.2), built bilinearly out of the cur¬ 
vature tensor, and other tensors for which see, e.g., 
Synge (1962). 

One or all of these quantities may be found 
someday to have important physical content. 

The integral of the 4-form r of equation (2) 
over the entire manifold gives a number, an inte¬ 
ger, the so-called Euler-Poincare characteristic of 
the manifold, whenever the integral and the inte¬ 
ger are well-defined. This result is the four-dimen¬ 
sional generalization of the Gauss-Bonnet integral, 
widely known in the context of two-dimensional 
geometry; 

( Riemannian scalar curvature \ 
invariant (value 2/a 2 J g 1/2 d 2 x. 

for a sphere of radius a) J 

This integral has the value 877 for any closed, 
oriented, two-dimensional manifold with the to¬ 
pology of a 2 -sphere, no matter how badly dis¬ 
torted; and the value 0 for any 2 -torus, again no 
matter how rippled and twisted; and other equally 
specific values for other topologies. 


J 


description of”) the boundary (three-dimensional) of such a simplex. Take one piece of this 
boundary and determine its boundary (two-dimensional). For one piece of this two-dimen¬ 
sional boundary, verify that there is at exactly one other place, and no more, in the book¬ 
keeping on the boundary of a boundary, another two-dimensional piece that cancels it 
(“facelessness” of the 3-boundary of the simplex). 
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Exercise 15.2. THE BEL-ROBINSON TENSOR [Bel (1958, 1959, 1962), 

Robinson (1959b), Sejnowski (1973); see also Pirani (1957) 


and Lichnerowicz (1962)]. 

Define the Bel-Robinson tensor by 

T aPyi = Ran.W + *WVi' 0 5 - 29 ) 

Show that in empty spacetime this tensor can be rewritten as 

T *fV = R ae W Vl' + R apS' R f>V ~ jgafigyi^ R ^- ( 15 . 30 a) 
Show also that in empty spacetime 

T a pyS:a = (15.30b) 

T apy S is symmetric and traceless on all pairs of indices. (15.30c) 


Discussion: It turns out that Einstein’s “canonical energy-momentum pseudotensor” (§20.3) 
for the gravitational field in empty spacetime has a second derivative which, in a Riemann- 
normal coordinate system, is 

^Ea/J ,ya = - *9 — J (15.31a) 

Here T af3yS is the completely symmetric Bel-Robinson tensor, and S a p ys is defined by 

S a fiyi = R a Spo R py P ° + R ay po R p S P ° + J g a pgy S R^ppR^ ■ (15.31b) 

S aj 3 y 5 appears in the empty-space covariant wave equation 

A R a/3yS = + R aPp< ,R y r + 2 - Wt') = °> < 1531c ) 

where A is a variant of the Lichnerowicz-de Rham wave operator [Lichnerowicz (1964)], 
when one rewrites this wave equation as 

n R ys — d ya TO [y 5] 


(15.3 Id) 



PART 


IV 


EINSTEIN'S GEOMETRIC 
THEORY OF GRAVITY 


Wherein the reader is seduced into marriage with the most elegant 
temptress of all — Geometrodynamics—and learns from her 
the magic potions and incantations that control the universe. 



CHAPTER 


EQUIVALENCE PRINCIPLE AND 
MEASUREMENT OF THE 
"GRAVITATIONAL FIELD" 


Rather than have one global frame with gravitational forces we 
have many local frames without gravitational forces. 

STEPHEN SCHUTZ (1966) 


y 


§16.1. OVERVIEW 

With the mathematics of curved spacetime now firmly in hand, one is tempted to 
rush headlong into a detailed study of Einstein’s field equations. But such temptation 

must be resisted for a short time more. To grasp the field equations fully, one must Purpose of this chapter 

first understand how the classical laws of physics change, or do not change, in the 

transition from flat spacetime to curved (§§16.2 and 16.3); and one must understand 

how the “gravitational field” (metric; covariant derivative; spacetime curvature;...) 

can be “measured” (§§16.4 and 16.5). 

§16.2. THE LAWS OF PHYSICS IN CURVED SPACETIME 

Wherever one is and whenever one probes, one finds that then and there one can 
introduce a local inertial frame in which all test particles move along straight lines. 

Moreover, this local inertial frame is also locally Lorentz: in it the velocity of light 
has its standard value, and light rays, like world lines of test particles, are straight. 

But physics is more, and the analysis of physics demands more than an account 
solely of the motions of test particles and light rays. What happens to Maxwell’s 
equations, the laws of hydrodynamics, the principles of atomic structure, and all 
the rest of physics under the influence of “powerful gravitational fields”? 
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Einstein's equivalence 
principle 


Equivalence principle as tool 
to mesh nongravitational 
laws with gravity 


The answer is simple: in any and every local Lorentz frame , anywhere and anytime 
in the universe, all the ( nongravitational ) laws of physics must take on their familiar 
special-relativistic forms. Equivalently: there is no way, by experiments confined to 
infinitesimally small regions of spacetime, to distinguish one local Lorentz frame 
in one region of spacetime from any other local Lorentz frame in the same or any 
other region. This is Einstein’s principle of equivalence in its strongest form—a 
principle that is compelling both philosophically and experimentally. (For the 
relevant experimental tests, see §38.6.) 

The principle of equivalence has great power. With it one can generalize all the 
special relativistic laws of physics to curved spacetime. And the curvature need not 
be small. It may be as large as that in the center of a neutron star; as large as that 
at the edge of a black hole; arbitrarily large, in fact—or almost so. Only at the 
endpoint of gravitational collapse and in the initial instant of the “big bang,” i.e., 
only at “singularities of spacetime,” will there be a breakdown in the conditions 
needed for direct application of the equivalence principle (see §§28.3, 34.6, 43.3, 
43.4, and chapter 44). Everywhere else the equivalence principle acts as a tool to 
mesh all the nongravitational laws of physics with gravity. 

Example: Mesh the “law of local energy-momentum conservation,” ▼ * T = 0, 
with gravity. Solution: 

(1) The law in flat spacetime, written in abstract geometric form, reads 

▼ = 0. (16.1a) 

(2) Rewritten in a global Lorentz frame of flat spacetime, it reads 

= 0. (16.1b) 

(3) Application of equivalence principle gives same equation in local Lorentz frame 
of curved spacetime: 

y v — 0 at origin of local Lorentz frame. (16.1c) 

Because the connection coefficients vanish at the origin of the local Lorentz frame, 
this can be rewritten as 

= 0 at origin of local Lorentz frame. (16.Id) 

(4) The geometric law in curved spacetime, of which these are the local-Lorentz 
components, is 

V-r = 0; (16.le) 

and its component formulation in any reference frame reads 

T^ ; , = 0. (16.If) 

Compare the abstract geometric law (16.le) in curved spacetime with the corre¬ 
sponding law (16.1a) in flat spacetime. They are identical! That this is not an accident 
one can readily see by tracing out the above four-step argument for any other law 
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of physics (e.g., Maxwell’s equation ▼ • F = 47 tJ). The laws of physics, written in 
abstract geometric form, differ in no way whatsoever between curved spacetime and 
flat spacetime; this is guaranteed by, and in fact is a mere rewording of, the equiva¬ 
lence principle. 

Compare the component version of the law ▼ • T = 0, as written in an arbitrary 
frame in curved spacetime [equation (16.If)], with the component version in a global 
Lorentz frame of flat spacetime [equation (16.1b)]. They differ in only one way: the 
comma (partial derivative; flat-spacetime gradient) is replaced by a semicolon 
(covariant derivative; curved-spacetime gradient). This procedure for rewriting the 
equations has universal application. The laws of physics, written in component form, 
change on passage from flat spacetime to curved spacetime by a mere replacement of 
all commas by semicolons (no change at all physically or geometrically; change due 
only to switch in reference frame from Lorentz to non-Lorentz!). This statement, 
like the nonchanging of abstract geometric laws, is nothing but a rephrased version 
of the equivalence principle. 

The transition in formalism from flat spacetime to curved spacetime is a trivial 
process when performed as outlined above. But it is nontrivial in its implications. 
It meshes gravity with all the laws of physics. Gravity enters in an essential way 
through the covariant derivative of curved spacetime, as one sees clearly in the 
following exercise. 


Exercise 16.1. HYDRODYNAMICS IN A WEAK GRAVITATIONAL FIELD 

(a) In §18.4 it will be shown that for a nearly Newtonian system, analyzed in an appropriate 
nearly global Lorentz coordinate system, the metric has the form 

ds 2 = -(1 + 20) dt 2 + (1 - 2 0){dx 2 + dy 2 + dz 2 ) (16.2a) 

where 0 is the Newtonian potential (-1 < 0 < 0). Consider a nearly Newtonian perfect 
fluid [stress-energy tensor 

T afi = (p + p)u a u& 4- pg a/ *> P < P; (16.2b) 

see Box 5.1 and §5.10] moving in such a spacetime with ordinary velocity 

v* == dxfdt < 1 . (16.2c) 

Show that the equations T^. v = 0 for this system reduce to the familiar Newtonian law 
of mass conservation, and the Newtonian equation of motion for a fluid in a gravitational 
field: 


dp _ 

dt 9 dx* ’ 


dv i 


d0 


9 dt P dx’ 


cp_ 

dx*' 


(16.3a) 


where d/dt is the time derivative comoving with the matter 


A = ± + vi J-. 

dt dt ox } 


"Comma-goes-to-semicolon" 

rule 


EXERCISES 


(16.3b) 
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(b) Use these equations to calculate the pressure gradient in the Earth’s atmosphere as 
a function of temperature and pressure. In the calculation, use the nonrelativistic relation 
p = n M p M , where n M is the number density of molecules and is the mean rest mass per 
molecule; use the ideal-gas equation of state 

p = n M kT (k = Boltzmann’s constant); 

and use the spherically symmetric form, 0 = — M/r , for the Earth’s Newtonian potential. 
If the pressure at sea level is 1.01 x 10 6 dynes/cm 2 , what, approximately, is the pressure 
on top of Mount Everest (altitude 8,840 meters)? (Make a reasonable assumption about the 
temperature distribution of the atmosphere.) 

Exercise 16.2. WORLD LINES OF PHOTONS 

Show that in fiat spacetime the conservation law for the 4-momentum of a freely moving 
photon can be written 

V= °* ( 16 - 4a > 

According to the equivalence principle, this equation must be true also in curved spacetime. 
Show that this means photons move along null geodesics of curved spacetime with affine 
parameter A related to 4-momentum by 


p = d/dX (16.4b) 

In exercise 18.6 this result will be used to calculate the deflection of light by the sun. 


§16.3. FACTOR-ORDERING PROBLEMS 
IN THE EQUIVALENCE PRINCIPLE 

Factor-ordering problems and 
coupling to curvature 


+ A V = 477/“, 

then its transition (“comma goes to semicolon”) reads 

+ A*.« = 4t tJ*. (16.5) 

However, if the flat-spacetime equation is written with two of its partial derivatives 
interchanged 


On occasion in applying the equivalence principle to get from physics in flat space- 
time to physics in curved spacetime one encounters “factor-ordering problems” 
analogous to those that beset the transition from classical mechanics to quantum 
mechanics.* Example: How is the equation (3.56) for the vector potential of electro¬ 
dynamics to be translated into curved spacetime? If the flat-spacetime equation is 
written 


-A a >\ + = 4t7/“, 


*For a discussion of quantum-mechanical factor-ordering problems, see, e.g., Merzbacher (1961), pp. 
138-39 and 334-35; also Pauli (1934). 
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then its translation reads 


-A 

which can be rewritten 


- A = 477/“. (16.5') 

(Ricci tensor appears as result of interchanging covariant derivatives; see exercise 
16.3.) Which equation is correct—(16.5) or (16.5')? This question is nontrivial, just 
as the analogous factor-ordering problems of quantum theory are nontrivial. For 
rules-of-thumb that resolve this and most factor-ordering problems, see Box 16.1. 
These rules tell one that (16.5') is correct and (16.5) is wrong (see Box 16.1 and 
§22.4). 


Exercise 16.3. NONCOMMUTATION OF COVARIANT DERIVATIVES 

Let B be a vector field and S be a second-rank tensor field. Show that 

(16.6a) 

SV-'afi = S* 9 ;fia + ( 16 . 6 b) 

From equation (16.6a), show that 

= B (16.6c) 

[Hint for Track-1 calculation: Work in a local Lorentz frame, where = 0 but ^ 0; 

expand the lefthand side in terms of Christoffel symbols and partial derivatives; and use 
equation (8.44) for the Riemann tensor. An alternative Track-2 calculation notices that 
▼p is not linear in e a , and that are not its components; but, rather, that 

^ = (16.7) 

l—{Third-rank tensor] 


The calculation then proceeds as follows: 

(a/*, V,, V a B> = (tv 11 , Vp(e a • Vfi)> 

= <»*,(T /S . b ).T* + * b .(T < ,T*)> 

= (w»,r’ afl e r • VS + VVS(..., ® a , •„» 
= 


Consequently 

B'rt, - Br 0a = <«/*, [V„, TJS> - **.,(/•'„ - r^ a ) 

= [T„, ▼.]«> - <a»*, T,, T J> 

= TJ - T [b eJ )S> = (ai", «(•„. eJS> 

- 

in agreement with (16.6a). Note: because of slight ambiguity in the abstract notation, one 
must think carefully about each step in the above calculation. Component notation, by 
contrast, is completely unambiguous.] 


EXERCISES 


(continued on page 392) 
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Box 16.1 FACTOR ORDERING AND CURVATURE COUPLING IN APPLICATIONS 
OF THE EQUIVALENCE PRINCIPLE 

The Problem 

In what order should derivatives be written when applying the “comma-goes-to- 
semicolon rule”? Interchanging derivatives makes no difference in flat spacetime, 
but in curved spacetime it produces terms that couple to curvature, e.g., 2B a . [y/i] = 
B a . yJ3 — B a .p y = R a ^ y B^ for any vector field (see exercise 16.3). Hence, the 
problem can be restated: When must the comma-goes-to-semicolon rule be augmented 
by terms that couple to curvature? 


The Solution 

There is no solution in general, but in most cases the following types of mathematical 

and physical reasoning resolve the problem unambiguously. 

A. Mathematically , curvature terms almost always arise from the noncommutation of 

covariant derivatives . Consequently, one needs to worry about curvature terms in 
any equation that contains a double covariant derivative (e.g., — f — 

4tt J a )\ or in any equation whose derivation from more fundamental laws 
involves double covariant derivatives (e.g. V U S = 0 in Example B(3) below). But 
one can ignore curvature coupling everywhere else (e.g., in Maxwell’s first-order 
equations). 

B. Coupling to curvature can surely not occur without some physical reason . Therefore, 
if one applies the comma-goes-to-semicolon rule only to physically measurable 
quantities (e.g., to the electromagnetic field, but not to the vector potential), one 
can “intuit” whether coupling to curvature is likely. Examples: 

(1) Local energy-momentum conservation. A coupling to curvature in the equa¬ 
tions T afi .£ — 0—e.g., replacing them by T af *.p = R a ^ yS T py u s ~would not 
make sense at all. In a local inertial frame such terms as R a fiyS T fiy u 6 would 
be interpreted as forces produced at a single point by curvature. But it should 
not be possible to feel curvature except over finite regions (geodesic deviation, 
etc.)! Put differently, the second derivatives of the gravitational potential 
(metric) can hardly produce net forces at a point; they should only produce 
tidal forces! 
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(2) Maxwell's equations for the electromagnetic field tensor. Here it would also 
be unnatural to introduce curvature terms. They would cause a breakdown 
in charge conservation, in the sense of termination of electric and magnetic 
field lines at points where there is curvature but no charge. To maintain 
charge conservation, one omits curvature coupling when one translates 
Maxwell’s equations (3.32) and (3.36) into curved spacetime: 

Fafi -,0 = F afi ;y + F py;a + F ya)f3 = 0. 

Moreover, one continues to regard F^ y as arising from a vector potential by 
the curved-spacetime translation of (3.54') 


r — a _ a 

M (IV — 

These points granted, one can verify that the second of Maxwell’s equations 
is automatically satisfied, and verify also that the first is satisfied if and only 
if 


- A + A*.« + R a ^ = 4tt J a . 


(See §22.4 for fuller discussion and derivation.) 

(3) Transport law for Earth's angular-momentum vector. If the Earth were in flat 
spacetime, like any other isolated body it would parallel-transport its angu¬ 
lar-momentum vector S along the straight world line of its center of mass, 
V U S = 0 (“conservation of angular momentum”). When translating this 
transport law into curved spacetime (where the Earth actually resides!), can 
one ignore curvature coupling? No! Spacetime curvatures due to the moon 
and sun produce tidal gravitational forces in the Earth; and because the Earth 
has an equatorial bulge, the tidal forces produce a nonzero net torque about 
the Earth’s center of mass. (In Newtonian language: the piece of bulge nearest 
the Moon gets pulled with greater force, and hence greater torque, than the 
piece of bulge farthest from the Moon.) Thus, in curved spacetime one expects 
a transport law of the form 


= (Riemann tensor) x (Earth’s quadrupole moment). 

This curvature-coupling torque produces a precession of the Earth’s rotation 
axis through a full circle in the plane of the ecliptic once every 26,000 years 
(“general precession”; “precession of the equinoxes”; discovered by Hip¬ 
parchus about 150 b.c.). The precise form of the curvature-coupling term 
is derived in exercise 16.4. 
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Exercise 16.4. PRECESSION OF THE EQUINOXES 

(a) Show that the transport law for the Earth’s intrinsic angular momentum vector S a in 
curved spacetime is 

06 . 8 ) 

Here d/dr - u is 4-velocity along the Earth’s world line; is the Earth’s “reduced quadru- 
pole moment” (trace-free part of second moment of mass distribution), defined in the Earth’s 
local Lorentz frame by 

h» = % = 0, % = J P(x’x k - J m jk ) d 3 x- (16.9) 

and R 11 ^ is the Riemann curvature produced at the Earth’s location by the moon, sun, and 
planets. [Hint: Derive this result in the Earth’s local Lorentz frame, ignoring the spacetime 
curvature due to the Earth. (In this essentially Newtonian situation, curvature components 
due to the Earth, sun, moon, and planets superpose linearly; “gravity too weak to be 
nonlinear”). Integrate up the torque produced about the Earth's center of mass by tidal 
gravitational forces (“geodesic deviation”): 

acceleration at x\ relative to center of mass (x J = 0), \ 

produced by tidal gravitational forces but counterbalanced I 
l in part by Earth’s internal stresses / 

= () = ~R k 6l 6** ( see equation (1.8')]; 

V dt 2 /geodesic 
deviation 

( force per unit volume due to this\* .— mass density 

acceleration, relative to center l , 2 . 

of mass / =P^-=-P«V; 

/torque per unit volume relative\ = ] { _ pR i *t ); 

\to center of mass /1 1 


'"total torque about center' 


l of mass 


),- - / 


[ e 6l)L*’(-P /?k 6iaV)] d 3 x. 


Put this expression into a form involving equate it to dS'/dr, and then reexpress it in 
frame-independent, component notation. The result should be equation (16.8).] 

(b) Rewrite equation (16.8) in the Earth’s local Lorentz frame, using the equation 


Rhko = d 2 0/dx’ dx * 


for the components of Riemann in terms of the Newtonian gravitational potential. (Newto¬ 
nian approximation to Einstein theory. Track-2 readers have met this equation in Chapter 
12; track-one readers will meet it in §17.4.) 

(c) Calculate dS’/dt using Newton’s theory of gravity from the beginning. The answer 
should be identical to that obtained in part (b) using Einstein’s theory. 

(d) Idealizing the moon and sun as point masses, calculate the long-term effect of the 
spacetime curvatures that they produce upon the Earth’s rotation axis. Use the result of part 
(b), together with moderately accurate numerical values for the relevant solar-system param¬ 
eters. [Answer: The Earth’s rotation axis precesses relative to the axes of its local Lorentz 
frame (“precession of the equinoxes”; “general precession”); the precession period is 26,000 
years. The details of the calculation will be found in any textbook on celestial mechanics.] 
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§16.4. THE RODS AND CLOCKS USED TO MEASURE 
SPACE AND TIME INTERVALS 


Turn attention now from the laws of physics in the presence of gravity to the nature 
of the rods and clocks that must be used for measuring the length and time intervals 
appearing in those laws. 

One need not—and indeed must not!—postulate that proper length s is measured 
by a certain type of rod (e.g., platinum meter stick), or that proper time t is measured 
by a certain type of clock (e.g., hydrogen-maser clock). Rather, one must ask the 
laws of physics themselves what types of rods and clocks will do the job. Put 
differently, one defines an “ideal” rod or clock to be one which measures proper 
length as given by ds = (g a/3 dx a dx&) l/2 or proper time as given by dr = 
dx a dx@) 1/2 (the kind of clock to which one was led by physical arguments 
in §1.5). One must then determine the accuracy to which a given rod or clock is 
ideal under given circumstances by using the laws of physics to analyze its behavior. 

As an obvious example, consider a pendulum clock. If it is placed at rest on the 
Earth’s surface, if it is tiny enough that redshift effects from one end to the other 
and time dilation effects due to its swinging velocity are negligible, and if the 
accuracy one demands is small enough that time variations in the local gravitational 
acceleration due to Earth tides can be ignored, then the laws of physics report (Box 
16.2) that the pendulum clock is “ideal.” However, in any other context (e.g., on 
a rocket journey to the moon), a pendulum clock should be far from ideal. Wildly 
changing accelerations, or no acceleration at all, will make it worthless! 

Of greater interest are atomic and nuclear clocks of various sorts. Such a clock 
is analyzed most easily if it is freely falling. One can then study it in its local Lorentz 
rest frame, using the standard equations of quantum theory; and, of course, one 
will find that it measures proper time to within the'precision (At/t — 10“ 9 to 10“ 14 ) 
of the technology used in its construction. However, one rarely permits his atomic 
clock to fall freely. (The impact with the Earth’s surface can be expensive!) Never¬ 
theless, even when accelerated at “1 g” = 980 cm/sec 2 on the Earth’s surface, and 
even when accelerated at “2 g” in an airliner trying to avoid a midair collision 
(Box 16.3), an atomic clock—if built solidly—will still measure proper time dr = 
(““£«/? dx* dx^ 2 along its world line to nearly the same accurafcy as if it were freely 
falling. To discover this one can perform an experiment. Alternatively, one can 
analyze the clock in its own “proper reference frame” (§13.6), with Fermi-Walker- 
transported basis vectors, using the standard local Lorentz laws of quantum me¬ 
chanics as adapted to accelerated frames (local Lorentz laws plus an “inertial force,” 
which can be treated as due to a potential with a uniform gradient. 

Of course, any clock has a “breaking point,” beyond which it will cease to function 
properly (Box 16.3). But that breaking point depends entirely on the construction 
of the clock—and not at all on any “universal influence of acceleration on the march 
of time.” Velocity produces a universal time dilation; acceleration does not. 

The aging of the human body is governed by the same electromagnetic and 
quantum-mechanical laws as govern the periodicities and level transitions in atoms 
and molecules. Consequently, aging, like atomic processes, is tied to proper time 

{continued on page 396 ) 


"Ideal" rods and clocks 
defined 


How ideal are real clocks? 
(1) pendulum clocks 


(2) atomic clocks 


(3) human clocks 
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16. EQUIVALENCE PRINCIPLE AND 


MEASUREMENT OF GRAVITATIONAL FIELD 


_ 1fi 2 proof that a pendulum clock at rest 
B ox 16.2 P R n °° h F e T earTH , s surface IS IDEAL 

That is, a proof that it measures the interval dr = (-&/. dx * dx ^' 2 - 

A. Constraint on the Pendulum 

1. must be so small lha. 1. cannot ample to.he ~£%£ 


B. Coordinate System and Metric 

(,) 

this on faith until one reaches that point) in w ic 

ds 2 = _(l + 20) dt ' 2 + (1 - 20)(dx' 2 + dy ' 2 + dz% ‘ 

f- • 

where 0 is the Newtonian potential. /1 



4 - 


(2) r„, the origin of coordinates a. the penduhWs equiUbnun, post,ion, and 

orient the ttV-plane so the pendulum and ^ (ime 

(3) Renormalize the coordinates so they me p 
at the equilibrium position 

t - [1 + 2<P(0)] 1/2 t', x* = t 1 - 20{O)] 1/2 x’. 

Then near the pendulum (inhomogeneities in the held neglected!) 
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<P = <P(0) -f gz, g — “acceleration of gravity,” (1) 

ds 2 = -(1 + 2 gz) dt 2 + (1 — 2 gz)(dx 2 4- dy 2 4- dz 2 ). (2) 


C. Analysis of Pendulum Motion 


(1) Put the total mass m of the pendulum in its ball (negligible mass in its rod). 
Let its rod have proper length /. 

(2) Calculate the 4-acceleration a = V w i/ of the pendulum’s ball in terms of 
d 2 x a /dt 2 , using the velocity condition v <<< 1 and dt/dr zz 1: 

a x — d 2 x/dr 2 4- r x Q 0 (dt/dr ) 2 — d 2 x/dt 2 4- r x 00 = d 2 x/dt 2 , 

(3) 

a* = d 2 z/dr 2 4- r z 00 (dt/dr ) 2 = d 2 z/dt 2 + T* 00 = d 2 z/dt 2 4- g. 

(3) This 4-acceleration must be produced by the forces in the rod, and must be 
directed up the rod so that (for x < / so g > d 2 z/dt 2 ) 

d 2 x/dt 2 — a x — — ( x/l)a z — —(g/l)x. (4) 

(4) Solve this differential equation to obtain 

x = x 0 cos (tVg/T). (5) 

(5) Thus conclude that the pendulum is periodic in /, which is proper time at 
the ball’s equilibrium position (see equation 2). This means that the pendulum 
is an ideal clock when it is at rest on the Earth's surface. 

Note: The above analysis ignores the Earth’s rotation; for an alternative analysis 
including rotation, one can perform a similar calculation at the origin of the pendu¬ 
lum’s “proper reference frame” [§13.6; line element (13.71)]. The answer is the same; 
but now “g” is a superposition of the “gravitational acceleration,” and the “centrifu¬ 
gal acceleration produced by Earth’s rotation.” 
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Box 16.3 RESPONSE OF CLOCKS TO ACCELERATION AND TO TIDAL GRAVITATIONAL FORCES 


Consider an atomic clock with frequency stabilized 
by some atomic or molecular process—for exam¬ 
ple, fixed by the “umbrella vibrations” of ammo¬ 
nia molecules [see Feynman et. ai (1964)]. When 
subjected to sufficiently strong accelerations or 
tidal forces, such a clock will cease to measure 
proper time with its normal precision. Two types 
of effects could lead to such departures from 
“ideality”: 

A. Influence of the acceleration or tidal force on the 
atomic process that provides the frequency stability . 
Example: If tidal forces are significant over dis¬ 
tances of a few angstroms (e.g., near a spacetime 
“singularity” terminating gravitational collapse), 
then they can and will deform an ammonia mole¬ 
cule and destroy the regularity of its umbrella 
vibrations, thereby making useless any ammonia 
atomic clock, no matter how constructed. Simi¬ 
larly, if an ammonia molecule is subjected to ac¬ 
celerations of magnitude comparable to its internal 
atomic accelerations {a ~ 10 12 “g” — lO 10 cm/ 
sec 2 ), which change in times of the order of the 
“umbrella” vibration period, then it must cease to 
vibrate regularly, and any clock based on its vibra¬ 
tions must fail. Such limits of principle on the 
ideality of a clock will vary from one atomic proc¬ 
ess to another. However, they are far from being 
a limiting factor on clock construction in 1973. 
Much more important today is: 


which produces the periodic signal output, must 
be locked to the regulating atomic process in some 
way. The lock will be disturbed by moderate ac¬ 
celerations. The toughest task for the manufac¬ 
turer of aircraft clocks is to guarantee that precise 
locking will be maintained, even when the aircraft 
is maneuvering desperately to avoid collision with 
another aircraft or with a missile. In 1972 a solidly 
built rubidium clock will maintain its lock, with 
no apparent degradation of stability 

[At/t ~~ 10" 12 (1 sec/f) 1/2 for 1 sec < t < 10 3 sec] 

under steady-state accelerations up to 50 “g” or 
more. But, because of the finite bandwidth of the 
lock loop (typically Av~~ 20 to 50 Hz), sudden 
changes in acceleration will temporarily break the 
lock, degrading the clock stability to that of the 
unlocked crystal oscillator—for which an accelera¬ 
tion a produces a change in frequency of about 
(a/l “ g ”) X 10” 9 . But the lock to the rubidium 
standard is restored quickly (St — 1 /Av), bring¬ 
ing the clock back to its normal highly stable 
performance.* 

Tidal forces are so small in the solar system that 
the clock manufacturer can ignore them. However, 
a 1973 atomic clock, subjected to the tidal acceler¬ 
ations near a spacetime singularity, should break 
the “lock” to its atomic process long before the 
tidal forces become strong enough to influence the 
atomic process itself. 


B. Influence of the acceleration or tidal force on the * For ^ information on the response of rubidium clocks 

macroscopic structure of the clock—a Structure die - to acceleration, we thank H. P. Stratemeyer of General Radio 
tated by current technology . The crystal oscillator, Company, Concord, Massachusetts. 


as governed by the metric—though, of course, it is also tied to other things, such 
as cigarette smoking. 

Ideal rods and clocks In principle, one can build ideal rods and clocks from the geodesic world lines 

constructed from geodesic of freely falling test particles and photons. (See Box 16.4.) In other words, spacetime 

world lines has it$ own rods and clocks bu j] t in to i tse lf, even when matter and nongravitational 

fields are absent! 
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Box 16.4 IDEAL RODS AND CLOCKS BUILT FROM GEODESIC WORLD LINES* 


The Standard Interval A specific timelike inter¬ 
val—the interval between two particular neigh¬ 
boring events d and is chosen as the standard 
interval, and is assigned unit length. It is used to 
calibrate a huge set of geodesic clocks that pass 
through d . 


Each geodesic clock is constructed and calibrated 

as follows: 

(1) A timelike geodesic dG (path of freely falling 
particle) passes through d . 

(2) A neighboring world line, everywhere parallel 
to dG (and thus not a geodesic), is constructed 
by the method of Schild’s ladder (Box 10.2), 
which relies only on geodesics. 

(3) Light rays (null geodesics) bounce back and 
forth between these parallel world lines; each 
round trip constitutes one “tick.” 

(4) The proper time lapse, t 0 , between ticks is 
related to the interval d® by 

-1 = m? = -(JVoX^To), 

where N x and N 2 are the number of ticks be¬ 
tween the events shown in the diagrams. 
[Proof: see diagram at right.] 


Spacetime is tilled with such geodesic clocks. 
Those that pass through d are calibrated as above 
against the standard interval d l J), and are used 
subsequently to calibrate all other clocks they 
meet. 




In local Lorentz rest frame of geodesic clock: 


(JV 1 t 0 XA/’.t 0 ) = (I - x)(l + x) 

= t 2 - x 2 = -{a$f 


•Based on Marzke and Wheeler (1964). 
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Box 16.4 ( continued ) 

Any interval 92 along the world line of a geodesic 
clock can be measured by the same method as was 
used in calibration. The interval 92 can be time¬ 
like, spacelike, or null; its squared length in all 
three cases will be 

(<?2Y = -(N 3 t 0 )(N 4 t 0 ) 


To achieve a precision of measurement good to 
one part in N , where N is some large number, take 
two precautions: 

(1) Demand that the intervals <X C $ and 92 be 
sufficiently small compared to the scale of cur¬ 
vature of spacetime; or specifically, 

R UB \d®) 2 4 1 /N 

and 

R^\92 ) 2 < 1/A, 

where R UB) and R iPQ) are the largest relevant 
components of the curvature tensor in the two 
regions in question. 

(2) Demand that the time scale, t 0 , of the geodesic 
clocks employed be small compared to d9> and 
92 individually; thus, 

r 0 < aa/N, 
t 0 < 92/N. 

The Einstein principle that spacetime is de¬ 
scribed by Riemannian geometry exposes itself to 
destruction by a “thousand” tests. Thus, from the 
fiducial interval, d$, to the interval under meas¬ 
urement, 92, there are a “score” of routes of in¬ 
tercomparison, all of which must give the same 
value for the ratio 92/(19 . Moreover, one can 
easily select out “fifty” intervals 92 to which the 
same kind of test can be applied. Such tests are 
not all items for the future. 

Some 5 x 10 9 years ago, electrons arrived by 
different routes at a common location, a given 
atom of iron in the core of the earth. This iron 
atom does not collapse. The Pauli principle of 
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exclusion keeps the electrons from all falling into 
the K-orbit. The Pauli principle would not apply 
if the electrons were not identical or nearly so. 
From this circumstance it would appear possible 
to draw an important conclusion (Marzke and 
Wheeler). With each electron is associated a 
standard length, its Compton wavelength, fi/mc. 
If these lengths had started different, or changed 
by different amounts along the different routes, 
and if the resulting difference in properties were 
as great as one part in 

—(5 X 10 9 yr) X (3 X 10 7 sec/yr) 

X (5 X IO 18 rev/sec) — 10 36 , 

by now this difference would have shown up, the 
varied electrons would have fallen into the K- 
orbit, and the earth would have collapsed, contrary 
to observation. 

The Marzke-Wheeler construction expresses an 
arbitrary small interval anywhere in space- 
time, in terms of the fiducial interval an inter¬ 
val which itself may be taken for definiteness to 
be the “geometrodynamic standard centimeter” of 
§1.5. This construction thus gives a vivid meaning 
to the idea of Riemannian geometry. 

The M-W construction makes no appeal what- 

v___ 



soever to rods and clocks of atomic constitution. 
This circumstance is significant for the following 
reasons. The length of the usual platinum meter 
stick is some multiple, N x {fi 2 /me 2 ), of the Bohr 
atomic radius. Similarly, the wavelength of the 
Kr 86 line is some multiple, A T 2 (tic/e 2 )(ti 2 /me*), of 
a second basic length that depends on the atomic 
constants in quite a different way. Thus, if there 
is any change with time in the dimensionless ratio 
fic/e 2 = 137.038, one or the other or both of these 
atomic standards of length must get out of kilter 
with the geometrodynamic standard centimeter. In 
this case, general relativity says, “Stick to the geo¬ 
metrodynamic standard centimeter.” 

Hermann Weyl at first thought that one could 
carry out the comparison of lengths by light rays 
alone, but H. A. Lorentz pointed out that one can 
dispense with the geodesics neither of test particles 
nor of light rays in the measurement process, the 
construction for which, however, neither Weyl nor 
Lorentz supplied [literature in Marzke and 
Wheeler (1964)]. Ehlers, Pirani, and Schild (1972) 
have given a deeper analysis of the separate parts 
played in the measurement process by the affine 
connection, by the conformal part of the metric, 
and by the full metric. 


§16.5. THE MEASUREMENT OF THE GRAVITATIONAL FIELD 

“I know how to measure the electromagnetic field using test charges; what is the 
analogous procedure for measuring the gravitational field?” This question has, at 
the same time, many answers and none. 

It has no answers because nowhere has a precise definition of the term “gravita¬ 
tional field” been given—nor will one be given. Many different mathematical entities The many faces of gravity, 
are associated with gravitation: the metric, the Riemann curvature tensor, the Ricci and how one meas ures them 
curvature tensor, the curvature scalar, the covariant derivative, the connection 
coefficients, etc. Each of these plays an important role in gravitation theory, and 
none is so much more central than the others that it deserves the name “gravitational 
field.” Thus it is that throughout this book the terms “gravitational field” and 
“gravity” refer in a vague, collective sort of way to all of these entities. Another, 
equivalent term used for them is the “geometry of spacetime.” 
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To “measure the gravitational field,” then, means to “explore experimentally 
various properties of the spacetime geometry.” One makes different kinds of meas¬ 
urements, depending on which geometric property of spacetime one is interested 
in. However, all such measurements must involve a scrutiny of the effects of the 
spacetime geometry (i.e., of gravity) on particles, on matter, or on nongravitational 
fields. 

For example, to “measure” the metric near a given event, one typically lays out 
a latticework of rods and clocks (local orthonormal frame, small enough that curva¬ 
ture effects are negligible), and uses it to determine the interval between neighboring 
events. To measure the Riemann curvature tensor near an event, one typically studies 
the geodesic deviation (relative accelerations) that curvature produces between the 
world lines of a variety of neighboring test particles; alternatively, one makes 
measurements with a “gravity gradiometer” (Box 16.5) if the curvature is static or 
slowly varying; or with a gravitational wave antenna (Chapter 37) if the curvature 
fluctuates rapidly. To study the large-scale curvature of spacetime, one examines 
large-scale effects of gravity, such as the orbits of planets and satellites, or the bending 
of light by the sun’s gravitational field. 

But whatever aspect of gravity one measures, and however one measures it, one 
is studying the geometry of spacetime. 


EXERCISE 


Exercise 16.5. GRAVITY GRADIOMETER 

The gravity gradiometer of Box 16.5 moves through curved spacetime along an accelerated 
world line. Calculate the amplitude and phase of oscillation of one arm of the gradiometer 
relative to the other. [Hint: Perform the calculation in the gradiometer’s “proper reference 
frame” (§13.6), with Fermi-Walker-transported basis vectors. Use, as the equation for the 
relative angular acceleration of the two arms, 


2 ml 2 (a + a/r 0 + co 0 2 a) = ^ 


/Driving torque produced by\ 
Riemann curvature / 


where 


2ml 2 = (moment of inertia of one arm), 

a - (angular displacement of one arm from equilibrium), 

y + 2a = (angular separation of the two arms), 

2mX 2 to 0 2 = (torsional spring constant), 

<o 0 = (angular frequency of free vibrations), 
r 0 = (decay time for free vibrations to damp out due to 
internal frictional forces). 


If ( is the vector from the center of mass of the gradiometer to mass 1, then one has 


j curvature-produced \ 
acceleration of mass 1 
relative to center of 
\ gradiometer / 


-(£U 

deviation 


= - R kwAb 


(continued on page 403) 
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Box 16.5 GRAVITY GRADIOMETER FOR MEASURING THE RIEMANN 
CURVATURE OF SPACETIME 


This gravity gradiometer was designed and built by Robert M. Forward and his 
colleagues at Hughes Research Laboratories, Malibu, California. It measures the 
Riemann curvature of spacetime produced by nearby masses. By flying a more 
advanced version of such a gradiometer in an airplane above the Earth’s surface, 
one should be able to measure subsurface mass variations due to varying geological 
structure. In an Earth-orbiting satellite, such a gradiometer could measure the 
gravitational multipole moments of the Earth. Technical details of the gradiometer 
are spelled out in the papers of Forward (1972), and Bell, Forward, and Williams 
(1970). The principles of its operation are outlined below. 


The gradiometer consists of two orthogonal arms 
with masses m on their ends, connected at their 
centers by a torsional spring. When the arms are 
twisted out of orthogonal alignment, they oscillate. 
A piezoelectric strain transducer is used to meas¬ 
ure the oscillation amplitude. 



Torsional spring 
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Box 16.5 (continued) 

When placed near an external mass, M, the gra- 
diometer experiences a torque: because of the gra¬ 
dient in the gravitational field of M (i.e., because 
of the spacetime curvature produced by M), the 
Newtonian forces F l and F 2 are greater than F 3 
and F 4 ; so a net torque pulls masses 1 and 2 to¬ 
ward each other, and 3 and 4 toward each other. 
[Note: the forces F» F 3 , F 4 depend on whether 
the gradiometer is in free fall (geodesic motion; 
V u u = 0) or is moving on an accelerated world 
line. But the net torque is unaffected by accelera¬ 
tion; acceleration produces equal Newtonian 
forces on all four masses, with zero net torque.] 




10/ = 0 


to/ = tt/4 


to/ = tt/2 


When in operation the gradiometer rotates with 
angular velocity co about its center. As it rotates, 
the torques on its arms oscillate: 

at co/ = 0 net torque pushes 1 and 2 toward each 
other; 

at co/ = 77/4 net torque is zero; 
at co/ = 77/2 net torque pushes 1 and 2 away from 
each other. 

The angular frequency of the oscillating torque is 
2co. If 2co is set equal to co 0 = (natural oscillation 
frequency of the arms), the oscillating torque 
drives the arms into resonant oscillation. The re¬ 
sulting oscillation amplitude, in the 1970 prototype 


of the gradiometer, was easily detectable for grav¬ 
ity gradients (Riemann curvatures) of magnitude 


> 0.0002 


2 (mass of earth) 


[ (radius of earth ) 3 J 

- 1 X IO - 30 cm" 2 


- .01 g/cm 3 


Riemann curvature produced by a two-kilometer 
high mountain, idealized as a two-kilometer high 
cube, at a distance of 15 kilometers. (Neglected 
in this idealization are isostacy and any lowering 
of density of Earth’s crust in regions of mountain 
_uplift.) 


For a mathematical analysis of the gradiometer, 
see exercise 16.5. 


j 
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( torque acting on mass l\ 
relative to center of = tr,k£](- mR km£l)- 
gradiometer j 

The torque on mass 4 is identical to this (replace ( by — f), so the total torque on arm 
1-4 is twice this. The components ^516 °f Riemann can be regarded as components of 
a 3 X 3 symmetric matrix. By appropriate orientation of the reference frame’s spatial axes 
(orientation along “principal axes” of i^oia), one can ma ke R^io diagonal at some initial 
moment of time 


R i6i6 # 0, R {6{6 # 0, R- 6 - o # 0, all others vanish. 

Assume that Riemann changes sufficiently slowly along the gradiometer’s world line that 
throughout the experiment R$%$ remains diagonal and constant. For simplicity, place the 
gradiometer in the x, /-plane, so it rotates about the £ axis with angular velocity w zz 

( Angle of arm l-4\ _ 
relative to x axis / 

Show that the resultant equation of oscillation is 

& 4- «/t 0 + « 0 2 “ = yCKiSia - R ioi&) sin 2ut\ 


and that the steady-state oscillations are 


a — Im 


2 (^£6x6 — RyOyo) 


12w 0 ( w o - 2w + i/2r 0 ) 


Thus, for fixed to (e.g., 2w = w 0 ), by measuring the amplitude and phase of the oscillations, 
one can learn the magnitude and sign of R — R$$. The other differences, R^q — 
R-Zb'zb an< ^ ~ can Pleasured by placing the gradiometer’s rotation axis along the 
x and / axes, respectively.] 



r 



HOW MASS-ENERGY 
GENERATES CURVATURE 


The physical world is represented as a four-dimensional 
continuum. If in this l adopt a Riemannian metric, and look for 
the simplest laws which such a metric can satisfy, I arrive at the 
relativistic gravitation theory of empty space. If I adopt in this 
space a vector field, or the antisymmetrical tensor field derived 
from it, and if I look for the simplest laws which such a field 
can satisfy, / arrive at the Maxwell equations for free space. 
... at any given moment, out of all conceivable constructions, 
a single one has always proved itself absolutely 
superior to all the rest . . . 

ALBERT EINSTEIN {1934, p. 18) 




§17.1. AUTOMATIC CONSERVATION OF THE SOURCE AS 
THE CENTRAL IDEA IN THE FORMULATION OF 
THE FIELD EQUATION 


This section derives the 
"Einstein field equation" 


p = u • T* u — u a T a pU&. (17.1) 

Therefore the stress-energy tensor T is the frame-independent “geometric object” 
that must act as the source of gravity. 


Turn now from the response of matter to geometry (motion of a neutral test particle 
on a geodesic; “comma-goes-to-semicolon rule” for the dynamics of matter and 
fields), and analyze the response of geometry to matter. 

Mass is the source of gravity. The density of mass-energy as measured by any 
observer with 4-velocity u is 
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This source, this geometric object, is not an arbitrary symmetric tensor. It must 
have zero divergence 

▼ •7 = 0, (172) 

because only so can the law of conservation of momentum-energy be upheld. 

Place this source, 7, on the righthand side of the equation for the generation of 
gravity. On the lefthand side will stand a geometric object that characterizes gravity. 
That object, like 7, must be a symmetric, divergence-free tensor; and if it is to 
characterize gravity, it must be built out of the geometry of spacetime and nothing 
but that geometry. Give this object the name “Einstein tensor” and denote it by 
G, so that the equation for the generation of gravity reads 

G = kT. (17.3) 

t_ proportionality factor;] 

_ to be evaluated later J 

(Do not assume that G is the same Einstein tensor as was encountered in Chapters 
8 , 13, 14, and 15; that will be proved below!) 

The vanishing of the divergence ▼ • G is not to be regarded as a consequence 
of ▼ • 7 = 0. Rather, the obedience of all matter and fields to the conservation law 
▼ • 7 = 0 is to be regarded (1) as a consequence of the way [equation (17.3)] they 
are wired into the geometry of spacetime, and therefore (2) as required and enforced 
by an automatic conservation law, or identity , that holds for any smooth Riemannian 
spacetime whatsoever, physical or not: ▼ *G=0. (See Chapter 15 for a fuller 
discussion and § 172 below for a fuller justification.) Accordingly, look for a symme¬ 
tric tensor G that is an “automatically conserved measure of the curvature of 
spacetime” in the following sense: 


(1) G vanishes when spacetime is fiat. 

(2) G is constructed from the Riemann curvature tensor and the metric, and from 
nothing else. 

(3) G is distinguished from other tensors which can be built from Riemann and 
g by the demands (i) that it be linear in Riemann , as befits any natural 
measure of curvature; (ii) that, like 7, it be symmetric and of second rank; 
and (iii) that it have an automatically vanishing divergence, 

▼ * G = 0. (17.4) 


Apart from a multiplicative constant, there is only one tensor (exercise 17.1) that 
satisfies these requirements of being an automatically conserved, second-rank tensor, 
linear in the curvature, and of vanishing when spacetime is flat. It is the Einstein 
curvature tensor, G, expressed in Chapter 8 in terms of the Ricci curvature tensor: 


G ^ 


- R°.. 


= R.. 


r 


Equation describing how 
matter generates gravity 
must have form G = kT, 
where 7 is stress-energy 
tensor 


Properties that the tensor G 
must have 


Proof that G must be the 
Einstein curvature tensor of 
Chapter 8 


(17.5) 
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This quantity was given vivid meaning in Chapter 15 as the “moment of rotation 
of the curvature” or, more simply, the “moment of rotation,” constructed by taking 
the double-dual 

G — *Riemann* (17.6a) 

of the Riemann curvature tensor, and then contracting this double dual, 

<v = «Vr- ( 17 - 6b ) 

In Chapter 15 the vanishing of V * G was shown to follow as a consequence of the 
elementary principle of topology that “the boundary of a boundary is zero.” 

To evaluate the proportionality constant k in the “Einstein field equation” G = kT.\ 
one can compare with the well-tested Newtonian theory of gravity. To facilitate the 
comparison, examine the relative acceleration (geodesic deviation) of particles that 
fall down a pipe inserted into an idealized Earth of uniform density p (Figure 1.12). 
According to Newton, the relative acceleration is governed by the density; according 
to Einstein, it is governed by the Riemann curvature of spacetime. Direct comparison 
of the Newtonian and Einstein predictions using Newtonian coordinates (where 
Su.v ~ V^ y ) reveals the relation 

*00 — * a 0a0 = (17.7) 

(See §1.7 for details of the derivation; see Chapter 12 for extensive discussion of 
Newtonian gravity using this equation.) When applied to the Earth’s interior, the 
Einstein field equation G = kT must thus reduce to R 00 — 4^p. In component form, 
the Einstein field equation reads 

<v = R V - J8*V R = kJ ,\v- 


Evaluation of k (in G = kT) 
by comparing with 
Newtonian theory of gravity 


Its trace reads 


-R = R - 2R = kT. 


In consequence, it predicts 

*oo = "^Soo* “Too — —k(21 "qq — goo*) 

-1 

= y <C[2 T 00 + (r° o + Pfl 

= y<c (r 00 + p,x 



which reduces to 


(17.8) 
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when one recalk that for the Earth—as for any nearly Newtonian system—the 
stresses T* are very small compared to the density of mass-energy 7 ^ = p: 


| T jk \ pressure 

Too density 


dp_ 

dp 


• (velocity of sound ) 2 < 1 . 


The equation = 4i rp (derived by comparing relative accelerations in the Newton 
and Einstein theories) and the equation Rqq — £/cp (derived directly from the Einstein 
field equation) can agree only if the proportionality constant ic is 877 . 

Thus, the Einstein field equation, describing the generation of curvature by 
mass-energy, must read 

G - 8irT. (17.9) 

The lefthand side (“curvature”) has units cm -2 , since a curvature tensor is a linear 
machine into which one inserts a displacement (units: cm) and from which one gets 
a relative acceleration (units: cm/sec 2 — cm/cm 2 — cm' 1 ). The right-hand side also 
has dimensions cm" 2 , since it is a linear machine into which one inserts 4-velocity 
(dimensionless) and from which one gets mass density [units: g/ cm 3 — cm/cm 3 ~~ 
cm" 2 ; recall from equation (1.12) and Box 1.8 that Ig = (Ig) x ( G/c 2 ) = (Ig) x 
(0.742 x 10 " 28 cm/g) = 0.742 x 10 " 28 cm]. ~ ^ " 

This concludes the simplest derivation of Einstein's field equation that has come 
to hand, and establishes its correspondence with the Newtonian theory of gravity 
under Newtonian conditions. That correspondence had to be worked out to deter¬ 
mine the factor k = on the righthand side of (17.9). Apart from this factor, the 
central point in the derivation was the demand for, and the existence of, a unique 
tensorial measure of curvature G with an identically vanishing divergence. 


Exercise 17.1. UNIQUENESS OF THE EINSTEIN TENSOR 

(a) Show that the most general second-rank, symmetric tensor constructable from Riemann 
and g, and linear in Riemann , is 


aR «,s + bRg at 3 + Ag a/3 (17 101 

= aRt ‘anfi + + 4g a/3 , 

where a , b, and A are constants. 

(b) Show that this tensor has an automatically vanishing divergence if and only if 
b — — \a. 

(c) Show that, in addition, this tensor vanishes in flat spacetime, if and only if A = 0—i.e., 
if and only if it is a multiple of the Einstein tensor G a/3 = R a(i — \Rg a ^ (Do not bother 
to prove that V • G = 0; assume it as a result from Chapter 13.) 

Exercise 17.2. NO TENSOR CONSTRUCTABLE FROM FIRST 
DERIVATIVES OF METRIC 

Show that there exists no tensor with components constructable from the ten metric coeffi¬ 
cients and their 40 first derivatives g ^^—except the metric tensor g ; and products of 
it with itself; e.g., g ® g. [Hint: Assume there exists some other such tensor, and examine 
its hypothesized components in a local inertial frame.] 




Result: "Einstein field 
equation" G = 87jT 


EXERCISES 
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Einstein field equation 
governs the evolution 
spacetime geometry 


Exercise 17.3. RIEMANN AS THE ONLY TENSOR CONSTRUCTABLE 

FROM, AND LINEAR IN SECOND DERIVATIVES OF METRIC 

Show that (1) Riemann, (2) g, and (3) tensors (e.g., Ricci) formed from Riemann and 
g but linear in Riemann , are the only tensors that (a) are constructable from the ten g Q( g, 
the 40 g n/3 and the 100 g aj8(<) „ and (b) are linear in the g a$ ^ r . [Him: Assume there exists 
some other such tensor, and examine its hypothesized components in an orthonormal, 
Riemann-normal coordinate system. Use equations (11.30) to (11.32).] 

Exercise 17.4. UNIQUENESS OF THE EINSTEIN TENSOR 

(a) Show that the Einstein tensor, G a/3 = R a/3 - bRg afi , is the only second-rank, symmetric 
tensor that ( 1) has components constructable solely from g a0 , g a/3 >(i , g a /3,^'- (2) has components 
linear in g a/3 fll ,: (3) has an automatically vanishing divergence, ▼ ■ G = 0; and (4) vanishes 
in flat spacetime. This provides added motivation for choosing the Einstein tensor as the 
left side of the field equation G = SttT. 

(b) Show that, when condition (4) is dropped, the most general tensor is G + Ag, where 
A is a constant. (See §17.3 for the significance of this.) 


§17.2. AUTOMATIC CONSERVATION OF THE SOURCE: 

A DYNAMIC NECESSITY 

The answer G = 8 vT is now on hand; but what is the question? An equation has 
been derived that connects the Einstein-Cartan “moment of rotation” G with the 
stress-energy tensor T, but what is the purpose for which one wants this equation 
in the first place? If geometry tells matter how to move, and matter tells geometry 
how to curve, does one not have in one’s hands a Gordian knot? And how then 
can one ever untie it? 

The story is no different in character for the dynamics of geometry than it is for 
other branches of dynamics. To predict the future, one must first specify, on an 
“initial” hypersurface of “simultaneity,” the position and velocity of every particle, 
and the amplitude and time-rate of change of every field that obeys a second-order 
wave equation. One can then evolve the particles and fields forward in time by means 
of their dynamic equations. Similarly, one must give information about the geometry 
and its first time-rate of change on the “initial” hypersurface if the Einstein field 
equation is to be able to predict completely and deterministically the future time- 
development of the entire system, particles plus fields plus geometry. (See Chap¬ 
ter 21 for details.) 

If a prediction is to be made of the geometry, how much information has to be 
supplied for this purpose? The geometry of spacetime is described by the metric 

ds 2 = g^dx-dx/ 3 ', 

that is, by the ten functions g a/} of location 9 in spacetime. It might then seem that 
ten functions must be predicted; and, if so, that one would need for the task ten 
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equations. Not so. Introduce a new set of coordinates x ** by way of the coordinate 
transformations 

x a — x a (x^), 

and find the same spacetime geometry, with all the same bumps, rills, and waves, 
described by an entirely new set of metric coefficients gap(&)- 

It would transgress the power as well as the duty of Einstein’s “geometrodynamic 
law” G = 8 nT if, out of the appropriate data on the “initial-value hypersurface,” 
it were to provide a way to calculate, on out into the future, values for all ten 
functions To predict all ten functions would presuppose a choice of the 

coordinates; and to make a choice among coordinate systems is exactly what the 
geometrodynamic law cannot and must not have the power to do. That choice resides 
of necessity in the man who studies the geometry, not in the Nature that makes the 
geometry. The geometry in and by itself, like an automobile fender in and by itself, 
is free of coordinates. The coordinates are the work of man. 

It follows that the ten components G aP — of the field equation must not 

determine completely and uniquely all ten components g of the metric. On the 
contrary, G a/} = T a(i must place only six independent constraints on the ten 
leaving four arbitrary functions to be adjusted by man’s specialization of the four 
coordinate functions x a (^). 

How can this be so? How can the ten equations G afi = %nT a p be in reality only 
six? Answer: by virtue of the “automatic conservation of the source.” More specifi¬ 
cally, the identity G ap ;fi = 0 guarantees that the ten equations G afi = 8 tt T a(i contain 
the four “conservation laws” T a & ;y3 = 0. These four conservation laws—along with 
other equations—govern the evolution of the source. They do not constrain in any 
way the evolution of the geometry. The geometry is constrained only by the six 
remaining, independent equations in G a p — 

When viewed in this way, the “automatic conservation of the source” is not merely 
a philosophically attractive principle. It is, in fact, an absolute dynamic necessity. 
Without “automatic conservation of the source,” the ten G af3 = 8would place 
ten constraints on the ten g afi , thus fixing the coordinate system as well as the 
geometry. With “automatic conservation,” the ten G a/3 = 8flT a/3 place four con¬ 
straints (local conservation of energy and momentum) on the source, and six con¬ 
straints on the ten g afi , leaving four of the g afi to be adjusted by adjustment of the 
coordinate system. 


§17.3. COSMOLOGICAL CONSTANT 

In 1915, when Einstein developed his general relativity theory, the permanence of 
the universe was a fixed item of belief in Western philosophy. “The heavens endure 
from everlasting to everlasting.” Thus, it disturbed Einstein greatly to discover 
(Chapter 27) that his geometrodynamic law G = 8tt 7" predicts a «6>«permanent 
universe; a dynamic universe; a universe that originated in a “big-bang” explosion, 


G = QttT must determine 
only six metric components; 
the other four are adjustable 
by changes of .coordinates 


G = 877 T leaves four 
components of metric free 
because it satisfies four 
identities 

0 == V*G = v*r 
("automatic conservation of 
source") 


Einstein's motivation for 
introducing a cosmological 
constant 
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or will be destroyed eventually by contraction to infinite density, or both. Faced 
with this contradiction between his theory and the firm philosophical beliefs of the 
day, Einstein weakened; he modified his theory. 

The only conceivable modification that does not alter vastly the structure of the 
theory is to change the lefthand side of the geometrodynamic law G = 87 tT. Recall 
that the lefthand side is forced to be the Einstein tensor, G at 3 = R aP ~ b >' 

three assumptions: 

(1) G vanishes when spacetime is flat. 

(2) G is constructed from the Riemann curvature tensor and the metric and 
nothing else. 

(3) G is distinguished from other tensors that can be built from Riemann and 
g by the demands (1) that it be linear in Riemann , as befits any natural 
measure of curvature; (2) that, like 7, it be symmetric and of second rank; 
and (3) that it have an automatically vanishing divergence, V • G = 0. 


Denote a new, modified lefthand side by “G”, with quotation marks to avoid 
confusion with the standard Einstein tensor. To abandon V • “G” = 0 is impossible 
on dynamic grounds (see §17.2), To change the symmetry or rank of “G” is impossi¬ 
ble on mathematical grounds, since “G” must be equated to 7. To let “G” be 
nonlinear in Riemann would vastly complicate the theory. To construct “G” from 
anything except Riemann and g would make “G” no longer a measure of spacetime 
geometry and would thus violate the spirit of the theory. After much anguish, one 
concludes that the assumption which one might drop with least damage to the beauty 
and spirit of the theory is assumption (1), that “G” ^vanish when spacetime is flat. 
But even dropping this assumption is painful: (1) although “G” might still be in 
some sense a measure of geometry, it can no longer be a measure of curvature; 
and ( 2 ) flat, empty spacetime will no longer be compatible with the geometrodynamic 
law (G ^ 0 in fiat, empty space, where 7 — 0). Nevertheless, these consequences 
were less painful to Einstein than a dynamic universe. 

The only tensor that satisfies conditions (2) and (3) [with (1) abandoned] is the 
Einstein tensor plus a multiple of the metric: 


Einstein's field equation with 
the cosmological constant 


Why Einstein abandoned the 
cosmological constant 


(exercise 17.1; see also exercise 17.4). Thus was Einstein (1917) led to his modified 
field equation 

G 4* Ag = 877-7 (17.11) 

The constant A he called.the “cosmological constant”; it has dimensions cm -2 . 

The modified field equation, by contrast with the original, admits a static, un¬ 
changing universe as one particular solution (see Box 27.5). For this reason, Einstein 
in 1917 was inclined to place his faith in the modified equation. But thirteen years 
later Hubble discovered the expansion of the universe. No longer was the cosmo¬ 
logical constant necessary. Einstein, calling the cosmological constant “the biggest 
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blunder of my life,” abandoned it and returned to his original geometrodynamic 
law, G = 8tt T [Einstein (1970)]. 

A great mistake A was indeed!—not least because, had Einstein stuck by his 
original equation, he could have claimed the expansion of the universe as the most 
triumphant prediction of his theory of gravity. 

A mischievous genie, once let out of a bottle, is not easily reconfined. Many 
workers in cosmology are unwilling to abandon the cosmological constant. They insist 
that it be abandoned only after cosmological observations reveal it to be negligibly 
small. As a modern-day motivation for retaining the cosmological constant, one 
sometimes rewrites the modified field equation in the form 

G = 87T[r + r< VAC) ], (17.12a) 

T<vac>= -(A/iir)g (17.12b) 



A modern-day motivation for 
the cosmological constant: 
vacuum polarization 


and interprets as a stress-energy tensor associated with the vacuum. This 

viewpoint speculates [Zel’dovich (1967)] that the vacuum polarization of quantum 
field theory endows the vacuum with the nonzero stress-energy tensor (17.12b), which 
is completely unobservable except by its gravitational effects. Unfortunately, today’s 
quantum field theory is too primitive to allow a calculation of 7 ^ VAC) from first 
principles. (See, however, exercise 17.5.) 

The mass-energy density that the cosmological constant attributes to the vacuum 
is 


p (VAC> _ pvAC) _ + 4 / 8 ^ (17.13) 

If A 0, it must at least be so small that p (VAC) has negligible gravitational effects Observational limit on the 
[|p (VAC) | < p(MATTEK)j wherever Newton’s theory of gravity gives a successful account cosmological constant 
of observations. The systems of lowest density to which one applies Newtonian theory 
with some (though not great) success are small clusters of galaxies. Hence, one can 
place the limit 


|P (y AC)| = 14/8* < p(CLUSTER) _ \Q-29 g/cm 3 _ JQ-57 cm ~2 ( 17 . 14 ) 

on the value of the cosmological constant. Evidently, even if A ^ 0, A is so small Why one ignores the 
that it is totally unimportant on the scale of a galaxy or a star or a planet or a cosmological constant 
man or an atom. Consequently it is reasonable to stick with Einstein’s original cosmology 6 6XCept m 
geometrodynamic law (G = 877T; A = 0) everywhere, except occasionally when 
discussing cosmology (Chapters 27-30). 


Exercise 17.5. MAGNITUDE OF COSMOLOGICAL CONSTANT 

(a) What is the order of magnitude of the influence of the cosmological constant on the 
celestial mechanics of the solar system if yl — 10~ 57 cm" 2 ? 


EXERCISE 
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(b) Show that the mass-energy density of the vacuum p (%AC) = /I /877 ~ 10 29 g/cm 3 , 
corresponding to the maximum possible value of A, agrees in very rough magnitude with 

rest mass of an elementary particle x itational fine . s tructure constant) 
(Compton wavelength of particle) 3 

— m m2 — m6 
(ti/m) 3 h fi 4 

[Zel’dovich (1967, 1968)]. This numerology is suggestive, but has not led to any believable 
derivation of a stress-energy tensor for the vacuum. 


§17.4. THE NEWTONIAN LIMIT 

Just as quantum mechanics reduces to classical mechanics in the “correspondence 
limit” of large actions, / /), so general relativity reduces to Newtonian theory in 

the “correspondence limit” of weak gravity and low velocities. (On “correspondence 
limits” see Bdx 17.1.) This section elucidates, in some mathematical detail, the 
correspondence between general relativity and Newtonian theory. It begins with 
“passive” aspects of gravitation (response of matter to gravity) and then turns to 
“active” aspects (generation of gravity by matter). 

Consider an isolated system—e.g., the solar system—in which Newtonian theory 
is highly accurate. In order that special relativistic effects not be noticeable, all 

r —-- 


Box 17.1 CORRESPONDENCE PRINCIPLES 


A. General Remarks and 
Specific Examples 


1. As physics develops and expands, its unity is 
maintained by a network of correspondence 
principles, through which simpler theories 
maintain their vitality by links to more sophis¬ 
ticated but more accurate ones, 

a. Physical optics, with all the new diffraction 
and interference phenomena for which it 
accounted, nevertheless also had to account, 
and did account, for the old, elementary, 
geometric optics of mirrors and lenses. Geo¬ 
metric optics is recovered from physical 
optics in the mathematical “correspondence 



principle limit” in which the wavelength is 
made indefinitely small in comparison with 
all other relevant dimensions of the physical 
system. 

b. Newtonian mechanics is recovered from the 
mechanics of special relativity in the mathe¬ 
matical “correspondence principle limit” in 
which all relevant velocities are negligibly 
small compared to the speed of light. 

c. Thermodynamics is recovered from its suc¬ 
cessor theory, statistical mechanics, in the 
mathematical “correspondence principle 
limit” in which so many particles are taken 
into account that fluctuations in pressure. 
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particle number, and other physical quanti¬ 
ties are negligible compared to the average 
values of these parameters of the system, 
d. Classical mechanics is recovered from quan¬ 
tum mechanics in the “correspondence prin¬ 
ciple limit” in which the quantum numbers 
of the quantum states in question are so 
large, or the quantities of action that come 
into play are so great compared to ft , that 
wave and diffraction phenomena make neg¬ 
ligible changes in the predictions of standard 
deterministic classical mechanics. Niels Bohr 
formulated and took advantage of this cor¬ 
respondence principle even before any 
proper quantum theory existed. He used it 
to predict approximate values of atomic en¬ 
ergy levels and of intensities of spectral lines. 
He also expounded it as a guide to all physi¬ 
cists, first in searching for a proper version 
of the quantum theory, and then in elucidat¬ 
ing the content of this theory after it was 
found. 

2. In all these examples and others, the newer, 
more sophisticated theory is “better” than its 
predecessor because it gives a good description 
of a more extended domain of physics, or a 
more accurate description of the same domain, 
or both. 

3. The correspondence between the newer theory 
and its predecessor (a) gives one the power to 
recover the older theory from the newer; 
(b) can be exhibited by straightforward mathe¬ 
matics; and (c), according to the historical rec¬ 
ord, often guided the development of the 
newer theory. 

B. Correspondence Structure of 
General Relativity 

1. Einstein’s theory of gravity has as distinct limit¬ 
ing cases (a) special relativity; (b) the “linear- 

__ 



ized theory of gravity”; (c) Newton’s theory of 
gravity; and (d) the post-Newtonian theory of 
gravity. Thus, it has a particularly rich corre¬ 
spondence structure. 

a. Correspondence with special relativity: Gen¬ 
eral relativity has two distinct kinds of corre¬ 
spondence with special relativity. The first is 
the limit of vanishing gravitational field 
everywhere (vanishing curvature); in this 
limit one can introduce a global inertial 
frame, set g^ = tj , and recover completely 
and precisely the theory of special relativity. 
The second is local rather than global; it is 
the demand (“correspondence principle”; 
“equivalence principle”) that in a local iner¬ 
tial frame all the laws of physics take on 
their special relativistic forms. As was seen 
in Chapter 16, this puts no restrictions on the 
metric (except that g^ = tj and = 0 
in local inertial frames); but it places severe 
constraints on the behavior of matter and 
fields in the presence of gravity. 

b. Correspondence with Newtonian theory: In 
the limit of weak gravitational fields, low 
velocities, and small pressures, general rela¬ 
tivity reduces to Newton’s theory of gravity. 
The correspondence structure is explored 
mathematically in the text of §17.4. 

c. Correspondence with post-Newtonian theory: 
When Newtonian theory is nearly valid, but 
“first-order relativistic corrections” might be 
important, one often uses the “post-Newton¬ 
ian theory of gravity.” Chapter 39 expounds 
the post-Newtonian theory and its corre¬ 
spondence with both general relativity and 
Newtonian theory. 

d. Correspondence with linearized theory: In the 
limit of weak gravitational fields, but possi¬ 
bly large velocities and pressures (u~~ 1, 
Tjk ~~ T 00 ) general relativity reduces to the 
“linearized theory of gravity”. This corre¬ 
spondence is explored in Chapter 18. 


j 
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Conditions which a system 
must satisfy for Newton's 
theory of gravity to be 
accurate 


"Newtonian coordinates" 
defined 


velocities in the system, relative to its center of mass and also relative to the Newto¬ 
nian coordinates, must be small compared to the speed of light 

v<l. _(17.15a) 

As a particle falls from the outer region of the system to the inner region, gravity 
accelerates it to a kinetic energy \mv 2 ~~ |w0[ max . [Here <P < 0 is Newton’s gravita¬ 
tional potential, so normalized that #(oo) = 0.] The resulting velocity will be small 
only if 

|#| < 1. (17.15b) 

Internal stresses in the system also produce motion—e.g., sound waves. Such waves 
have characteristic velocities of the order of \t*’/T 00 \ 1/2 —for example, the speed 
of sound in a perfect fluid is 

i; = (dp/dp) 1/2 - (p/p) l/2 - \T^/T 00 \ l/2 . 

In order that these velocities be small compared to the speed of light, all stresses 
must be small compared to the density of mass-energy 

\T^\/T 00 = \T l i\/p < 1. (17.15c) 


When, and only when conditions (17.15) hold, one can expect Newtonian theory 
to describe accurately the system being studied. Correspondence of general relativity 
with Newtonian theory for gravity in a passive role then demands that the geodesic 
world lines of freely falling particles reduce to the Newtonian world lines 

d 2 x { /dt 2 = -d<P/dx\ (17,16) 

Moreover, they must reduce to this form in any relativistic coordinate system where 
the source and test particles have low velocities v < 1, and where coordinate lengths 
and times agree very nearly with the lengths and times of the Newtonian coor¬ 
dinates—which in turn are proper lengths and times as measured by rods and clocks. 
Thus, the relevant coordinates (called “ Galilean ” or “ Newtonian ” coordinates) are 
ones in which 

gnw = v + M = \ dxi / dt \ < 1 (17.17) 

(weak gravitational field; nearly inertial coordinates; low velocities). In such a 
coordinate system, the geodesic world lines of test particles have the form 


d 2 x % _ d 2 x { 
dt 2 ~ ~dr r 


(since dt/dr ss 1 when < 1 and \v’\ < 1) 


ri dx0L jfcf 
^ dr dr 


(geodesic equation) 



(since dt/dr zz 1 and \dx j /dr\ < 1) 
(since zz tj^) 

(equation for 7 T a/3y in terms of g a p,y) 


\_ 

2 


* 00,1 


( all velocities small compared to c implies time 
derivatives small compared to space derivatives 
—i. e *, hafi,0 ~~ Vhafl'i 
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These geodesic world lines do, indeed, reduce to those of Newtonian theory [equation 
(17.16)] if one makes the identification 

^00 = ~~ y^oo.i — (17.18) 


Together with the boundary conditions <P(r = oo) = 0 and h^fr = oo) = 0 (coor¬ 
dinates Lorentz far from the source), this identification implies h ^ = —2$; i.e., 


goo = — 1 — 20 for nearly Newtonian systems in Newtonian coordinates. (17.19) 


Note that the correspondence tells one the form of h 0Q for nearly Newtonian 
systems, but not the forms of the other components of the metric perturbation. In 
fact, the other h^ could perfectly well be of the same order of magnitude as h Q0 — <P , 
without influencing the world lines of slowly moving particles, because they always 
enter the geodesic equation multiplied by the small numbers v or v 2 , or differentiated 
by t rather than by x l . The forms of the other h ^ and their small corrections to 
the Newtonian motion will be explored in Chapters 18, 39, and 40. 

The relation g 00 = — 1 — 2 <P is the mathematical embodiment of the correspondence 
between general relativity theory and Newtonian theory for passive aspects of gravity. 
Together with the “validity conditions” (17.15, 17.17), it is a foundation from which 
one can derive all other aspects of the correspondence for “passive gravity,” including 
the relation 

Ri 0 j 0 = d 2 0/dx l dx> (17.20) 

(exercise 17.6). Alternatively, all other aspects of this correspondence can be derived 
by direct comparison of Newton’s predictions with Einstein’s. For example, to derive 
equation (17.20), examine the relative acceleration of two test particles, one at x* + £* 
and the other at x\ According to Newton 


d 2 ? 

~df 


d 2 (x l + | 4 ) d 2 x i 
dt 2 ~~dF 


d<P 
dx { 


aXx ij t f-* 


dx* 


six’ 


— 5 2 <? 

dx*dx> 


e. 


For comparison, Einstein predicts (equation of geodesic deviation) 


D 2 £‘ 

~dr r 


t 


d 2 g 

Ht 2 


— ■” R'oioZ- 


[by conditions (17.15) and (17.17)] 


Direct comparison gives relation (17.20). 

Turn now from correspondence for passive aspects of gravity to correspondence 
for active aspects. According to Einstein, mass generates gravity (spacetime curva¬ 
ture) by the geometrodynamic law G = 8 t>T. Apply this law to a nearly Newtonian 
system, and by the chain of reasoning that preceeds equation (17.8) derive the 
relation 


Einstein gravity reduces to 
Newton gravity only if, in 
Newtonian coordinates, 

Soo = “ 1 “20 


The correspondence between 
Einstein theory and Newton 
theory for all "passive" 
aspects of gravity 


The Newtonian limit of the 
Einstein field equation is 
= 4t7 P 


*oo — 4^0* 


(17.21) 
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EXERCISES 


There are many ways (Box 
17.2) to derive the Einstein 
field equation 


Combine with the contraction of (17.20), 

*00 = Ri 0 i 0 + *°ooo = W 3 ** = v2 ^> 

t- 

0 

and thereby obtain Newton’s equation for the generation of gravity by mass 

V 2 <P = 4t jp. (17.22) 

Thus, Einstein’s field equation reduces to Newton’s field equation in the Newtonian 
limit. 

The correspondence between Newton and Einstein, although clear and straight¬ 
forward as outlined above, is even more clear and straightforward when Newton s 
theory of gravity is rewritten in Einstein’s language of curved spacetime (Chapter 
12; exercise 17.7). 


Exercise 17.6. RAMIFICATIONS OF CORRESPONDENCE FOR GRAVITY IN 
A PASSIVE ROLE 

From the correspondence relation g 00 = —1 — 20, and from conditions (17.15) and (17.17) 
for Newtonian physics, derive the correspondence relations 

/^oo = d0/dx\ R { oj o = d 2 0/dx' dxK 

Exercise 17.7. CORRESPONDENCE IN THE LANGUAGE OF 
CURVED SPACETIME [Track 2] 

Exhibit the correspondence between the Einstein theory and Cartan’s curved-spacetime 
formulation of Newtonian theory (Chapter 12). 


§17.5. AXIOMATIZE EINSTEIN'S THEORY? 

Find the most compact and reasonable axiomatic structure one can for general 
relativity? Then from the axioms derive Einstein’s field equation, 

G = 8 77-7"? 

That approach would follow tradition. However, it may be out of date today. More 
than half a century has gone by since November 25, 1915. For all that time the 
equation has stood unchanged, if one ignores Einstein’s temporary “aberration” of 
adding the cosmological constant. In contrast the derivations have evolved and 
become more numerous and more varied. In the beginning axioms told what equa¬ 
tion is acceptable. By now the equation tells what axioms are acceptable. Box 17.2 
sketches a variety of sets of axioms, and the resulting derivations of Einstein s 
equation. 


(<continued on page 429) 
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Box 17.2 SIX ROUTES TO EINSTEIN'S GEOMETRODYNAMIC LAW 

OF THE EQUALITY OF CURVATURE AND ENERGY DENSITY 
("EINSTEIN'S FIELD EQUATION") 


[Recommended to the attention of Track-1 readers are only route 1 (automatic 
conservation of the source, plus correspondence with Newtonian theory) and 
route 2 (Hilbert’s variational principle); and even Track-2 readers are advised to 
finish the rest of this chapter before they study route 3 (physics on a spacelike slice), 
route 4 (going from superspace to Einstein’s equation), route 5 (field of spin 2 in 
an “unobservable flat spacetime” background), and route 6 (gravitation as an 
elasticity of space that arises from particle physics).] 

1. Model geometrodynamics after electrodynamics and treat “automatic conserva¬ 
tion of the source” and correspondence with the Newtonian theory of gravity 
as the central considerations. 

a. Particle responds in electrodynamics to field; in general relativity, to geometry. 

b. The potential for the electromagnetic field is the 4-vector A (components A ^). 
The potential for the geometry is th£ metric tensor g (components g ). 

c. The electromagnetic potential satisfies a wave equation with source term 
(4-current) on the right, 



so constructed that conservation of the source, = 0, is automatic (conse¬ 
quence of an identity fulfilled by the lefthand side). By analogy, the geometro- 
dynamic potential must also satisfy a wave equation with source term (stress- 
energy tensor) on the right, 

<v = ( 2 ) 

so constructed that conservation of the source, = 0 (Chapter 16) is 
“automatic.” This conservation is automatic here because the lefthand side 
of the equation is a tensor (the Einstein tensor; see Box 8.6 or Chapter 15), 
built from the metric components and their second derivatives, that fulfills 
the identity G^ v = 0. 

d. No other tensor which (1) is linear in the second derivatives of the metric 
components, (2) is free of higher derivatives, and (3) vanishes in flat spacetime, 
satisfies such an identity. 

e. The constant of proportionality (8 tt) is fixed by the choice of units [here 
geometric; see Box 1.8] and by the requirement (“correspondence with Newto¬ 
nian theory”) that a test particle shall oscillate back and forth through a 
collection of matter of density p, or revolve in circular orbit around that 
collection of matter, at a circular frequency given by u 2 — (4tt/ 3)p (Figure 
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~\ 

Box 17.2 (continued) 

1.12). The foregoing oversimplifies, and omits Einstein’s temporary false turns, 
but otherwise summarizes the reasoning he pursued in arriving at his field 
equation. This reasoning is spelled out in more detail in the text of Chap¬ 
ter 17. 

2. Take variational principle as central. 

a. Construct out of the metric components the only scalar that exists that (1) 
is linear in the second derivatives of the metric tensor, (2) contains no higher 
derivatives, and (3) vanishes in fiat spacetime: namely, the Riemann scalar 
curvature invariant, R. 

b. Construct the invariant integral, 


'-i h!*-#***- (3) 

c. Make small variations, Sgin the metric coefficients g^ v in the interior of 
the four-dimensional region (2, and find that this integral changes by the 
amount 

81 = -±- Jf G„ Sg^(-g) 1/2 d*x. (4) 

d. Demand that I should be an extremum with respect to the choice of geometry 
in the region interior to Q (81 = 0 for arbitrary 8g* iV \ “principle of extremal 
action”). 

e. Thus arrive at the Einstein field equation for empty space, 

<v = °- (5) 

f. The continuation of the reasoning leads to the identity 


V =0- 


2 '. 


Chapter 21, on the variational principle, gives more detail and takes up the 
additional term that appears on the righthand side of (5) when matter or fields 
or both are present. 

g. This approach goes back to David Hilbert (1915). No route to the field equa¬ 
tions is quicker. Moreover, it connects immediately (see the following section 
here, 2') with the quantum principle of the “democracy of all histories” [Feyn¬ 
man (1942); Feynman and Hibbs (1965)]. The variational principle is spelled 
out in more detail in Chapter 21. 

An aside on the meaning of the classical action integral for the real world of 

quantum physics. 

a. A “history of geometry,” H , is a spacetime, that is to say, a four-dimensional 
manifold with four-dimensional —b 4* + Riemann metric that (1) reduces on 
one spacelike hypersurface (“hypersurface of simultaneity”) to a specified 
“initial value 3-geometry,” A , with positive definite metric and (2) reduces on 
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another spacelike hypersurface to a specified “final value 3-geometry,” B , also 
with positive definite metric. 

b. The classical variational principle of Hilbert, as reformulated by Amowitt, 
Deser, and Misner, provides a prescription for the dynamical path length, 

of any conceivable history H , classically allowed or not, that connects A and 
B (see Chapter 21 for a fuller statement for what can and must be specified 
on the initial hypersurface of simultaneity, and on the final one, and for 
alternative choices of the integrand in the action principle). 

c. Classical physics says that a history H is allowed only if it extremizes the 
dynamic path length I as compared to all nearby histories. Quantum physics 
says that all histories occur with equal probability amplitude, in the following 
sense. The probability amplitude for “the dynamic geometry of space to transit 
from A to B” by way of the history H with action integral and by way 
of histories that lie within a specified infinitesimal range, tyH, of the history 
H , is given by the expression 


I probability amplitude \ 
to transit from A to 
B by way of history H 
and histories lying 
within the range <$H 
\ about H y 


- exp (U H /h)N<$H. 


( 6 ) 


Here the normalization factor, jV, is the same for all conceivable histories i/, 
allowed or not, that lead from A to B (“principle of democracy of histories”). 
The quantum of angular momentum, ft == h/2ir, expressed in geometric units, 
has the value 

h = h C0nv G/c 3 = (L*) 2 , (7) 

where L* is the Planck length, L* = 1.6 X 10“ 33 cm. 

d. The classically allowed history receives “preference without preference.” That 
history, and histories H that differ from it so little that 5 1 = h - Iclass is only 
of the order fi and less, give contributions to the probability amplitude that 
interfere constructively. In contrast, destructive interference effectively wipes 
out the contribution (to the probability amplitude for a transition) that comes 
from histories that differ more from the classically allowed history. Thus there 
are quantum fluctuations in the geometry, but they are fluctuations of limited 
magnitude. The smallness of fi ensures that the scale of these fluctuations is 
unnoticeable at everyday distances (see the further discussion in Chapters 43 
and 44). In this sense classical geometrodynamics is a good approximation to 
the geometrodynamics of the real world of quantum physics. 

3. “Physics on a spacelike slice or hypersurface of simultaneity,” again with electro¬ 
magnetism as the model. 

a. Say over and over “lines of magnetic force never end” and come out with 
half of Maxwell’s equations. Say over and over “lines of electric force end 



- \ 

The rest of this 
chapter is Track 
2. No previous 
track-2 material 
is needed as 
preparation for 
it, nor is it 
necessary prep¬ 
aration for any 
later chapter, 
but it will be 
helpful in Chap¬ 
ter 21 (initial- 
value equations 
and variational 
principle) and 
in Chapter 39 
(other theories 
of gravity). 

J 
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Box 17.2 (continued) 


only on charge” and arrive at the other half of Maxwell’s equations. Similarly, 
say over and over 

( intrinsic \ /extrinsic \ /local density \ 

curvature I + I curvature |= 167rlofmass- I (8) 

scalar / \scalar / \ energy / 

and end up with all ten components of Einstein’s equation. To “say over and 
over” is an abbreviation for demanding that the stated principles hold on every 
spacelike slice through every event of spacetime, 
b. Spell out explicitly this “spacelike-slice formulation” of the equations of 
Maxwell and Einstein. Consider an arbitrary point of spacetime, 9 (“event”), 
and an arbitrary “simultaneity” S through 9 (hypersurface of simultaneity; 
spacelike slice through spacetime). Magnetic lines of force run about through¬ 
out 5, but nowhere is even a single one of them permitted to end. Recall (§3.4) 
that the demand “lines of magnetic force never end,” when imposed on all 
reference frames at 9 (for all choices of the “simultaneity” 5), guarantees not 
only V * B = 0, but also V x E + dB/dt = 0. Similarly (§3.4) the demand 
that “electric lines of force never end except on electric charge,” V * E = 477/°, 
when imposed on all “simultaneities” through 9, guarantees the remaining 
Maxwell equation V x B = dE/dt + 477 /. 
c. Each simultaneity S through 9 has its own slope and curvature. The possibility 
of different slopes (different local Lorentz frames at 9) is essential for deriving 
all of Maxwell’s equations from the requirements of conservation of flux. 
Relevant though the slope thus is, the curvature of the hypersurface S never 
matters for the analysis of electromagnetism. It does matter, however, for any 
analysis of gravitation modeled on the foregoing treatment of electromagne¬ 
tism. 



“Simultaneity” S (spacelike hypersurface or “slice 
through spacetime”) that cuts through event 9. 
The “simultaneity” may be considered to be de¬ 
fined by a set of “observers” a,b,c, .... Their 
world lines cross the simultaneity orthogonally, 
and their clocks all read the same proper time at 
the instant of crossing. Another simultaneity 
through 9 may have at 9 a different curvature or 
a different slope or both; and it is defined by a 
different band of observers, with other wrist 
watches. 
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d. “Mass-energy curves space” is the central principle of gravitation. To spell out 
this principle requires one to examine in succession the terms “space” and 
“curvature of space” and “density of mass-energy in a given region of space.” 
“Space” means spacelike hypersurface; or, more specifically, a hypersurface 
of simultaneity S that includes the point 9 where the physics is under exami¬ 
nation. 

e. Denote by u the 4-vector normal to S at 9. Then the density of mass-energy 
in the spacelike hypersurface* S at 9 is 

P = ““^11* (9) 

in accordance with the definition of the stress-energy tensor given in Chap¬ 
ter 5. 

f. This density is a single number, dependent on the inclination of the slice one 
cuts through spacetime, but independent of how curved one cuts this slice. 
If it is to be equated to “curvature of space,” that curvature must also be 
independent of how curved one cuts the slice. 

g. Conclude that the geometric quantity, “curvature of space,” must (1) be a single 
number (a scalar) that (2) depends on the inclination u of the cut one makes 
through spacetime at 9 in constructing the hypersurface S , but (3) must be 
unaffected by how one curves his cut. The demand made here appears para¬ 
doxical. One seems to be asking for a measure of curvature that is independent 
of curvature! 

h. A closer look discloses that three distinct ideas come into consideration here. 
One is the scalar curvature invariant {3) R of the 3-geometry intrinsic to the 
hypersurface S at 9: “intrinsic” in the sense that it is defined by, and depends 
exclusively on, measurements of distance made within the hypersurface. The 
second is the “extrinsic curvature” of this 3-geometry relative to the 4-geometry 
of the enveloping spacetime (“how curved one cuts his slice”; see Box 14.1 
for more on the distinction between extrinsic and intrinsic curvature). The third 
is the curvature of the four-dimensional spacetime itself, “normal to w,” in 
some sense yet to be more closely defined. This is the quantity that is inde¬ 
pendent of how curved one cuts his slice. It is the quantity that is to be 
identified, up to a factor that depends on the choice of units, with the density 
of mass-energy. 
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Box 17.2 (continued) 


These three quantities are related in the following way: 

I a correction term that (a) 
depends only on the “ex¬ 
trinsic curvature” K aB (Box 


^scalar curvature invariant, ^ 
(3) R , of the 3-geometry 
intrinsic to the spacelike 
hypersurface S, a quantity 
dependent on “how curved 
\ one cuts the slice” / 


+ 


I a measure of the curvature \ 

' of spacetime that depends \ 
on the 4-geemetry of 
the spacetime and on 
the inclination u of the 
spacelike slice S cut 
through spacetime, but is 
independent, by construc¬ 
tion, of “how curved one , 
\ cuts the slice” / 


/2u a G a puP , where G a(i is^ 
the Einstein curvature 
tensor of equation 8.49 
\and Box 8.6 


14.1 and Chapter 21) of the 
hypersurface relative to 
the four-dimensional geometry 
in which it is imbedded, and 
(b) is so calculated (a 
uniquely determinate calcu¬ 
lation) that the sum of this 
correction term and {3) R is 
independent of “how curved 
one cuts his slice,” and (c) 
has the precise value 

K) 2 — Tr/C 2 = (K a a ) 2 — K a p 


\ nas 
\(Tr 




/ a scalar quantity that 
(a) is completely defined 
by what has just been 
said and (b) can there¬ 
fore be calculated in all 
completeness by standard 
differential geometry 
(details in Chapter 21) 




= 2 


I a quantity interpreted in \ 
Track 2, Chapter 15, as the 
“moment of rotation” asso¬ 
ciated with a unit element 
of 3-volume located at 9 
i in the hypersurface orth¬ 
ogonal to u f 


( 10 ) 


j. Conclude that the central principle, “mass-energy curves space,” translates to 
the formula 


i3) R + (Tr K) 2 - Tr K 2 = 16irp, 


(11) 
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or, in shorthand form, 


( moment of\ _ / intrinsic \ / 

rotation / Vcurvature/ \ 


extrinsic \ _/ density of 


curvature 


r\ 


mass-energy 


)• 


(12) 


valid for every spacelike slice through spacetime at any arbitrary point 9. 

k. All of Einstein’s geometrodynamics is contained in this statement as truly as 
all of Maxwell’s electrodynamics is contained in the statement that the number 
of lines of force that end in an element of volume is equal to 477 times the 
amount of charge in that element of volume. The factor I677 is appropriate 
for the geometric system of units in use in this book (density p in cm" 2 given 
by G/c 2 = 0.742 X 10" 28 cm/g multiplied by the density p conv expressed in 
the conventional units of g/cm 3 ). 

l. Reexpress the principle that “mass-energy curves space” in the form 


2u a G a ^ = \6iru a T afi uP. (13) 

Demand that this equation should hold for every simultaneity that cuts 
through 9, whatever its “inclination” u. 

m. Conclude that the coefficients on the two sides of (13) must agree; thus, 


G a/3 = %* T a /}> (14) 

Einstein’s equation in the language of components; or, in the language of 
abstract geometric quantities, 

G = 8777 . (15) 

4. Going from superspace to Einstein’s equation rather than from Einstein’s equa¬ 
tion to superspace. 

a. A fourth route to Einstein’s equation starts with the advanced view of geome¬ 
trodynamics that is spelled out in Chapter 43. One notes there that the 
dynamics of geometry unfolds in superspace. Superspace has an infinite 
number of dimensions. Any one point in superspace describes a complete 
3-geometry, (3) £, with all its bumps and curvatures. The dynamics of geom¬ 
etry leads from point to point in superspace. 

b. Like the dynamics of a particle, the dynamics of geometry lends itself to 
distinct but equivalent mathematical formulations, associated with the names 
of Lagrange, of Hamilton, and of Hamilton and Jacobi. Of these the most 
convenient for the present analysis is the last (“H-J”). 

c. In the problem of one particle moving in one dimension under the influence 
of a potential V(x ), the H-J equation reads 


dS _ 

ct 


,-A__ 

_£ , 

total 

kinetic 

energy 

f ner gy, 


(16) 
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Box 17.2 (continued) 


It has the solution 


S E (x,t) = -Et + J [2 m{E - V)] 1/2 dx. (17) 

Out of this solution one reads the motion by applying the “condition of 
constructive interference,” 

^>=0 (18) 

oE 

(one equation connecting the two quantities * and /; for more on the condition 
of constructive interference and the H-J method in general, see Boxes 25.3 
and 25.4). 

d. In the corresponding equation for the dynamics of geometry, one deals with 
a function S — S( i3) h) of the 3-geometry. It depends on the 3-geometry itself, 
and not on the vagaries of one’s choice of coordinates or on the corresponding 
vagaries in the metric coefficients of the 3-geometry, 

ds 2 = i3) g mn dx m dx n (19) 

( (3) to indicate 3-geometry omitted hereafter for simplicity).-This function obeys 
the H-J equation [the analog of (16)] 

+ ~ gijgmn) = 16w P‘ ( 20 ) 

e. Out of this equation for the dynamics of geometry in superspace one can 
deduce the Einstein field equation by reasoning similar to that employed in 
going from (17) to (18) (Gerlach 1969). 

f. It would appear that one must break new ground, and establish new founda¬ 
tions, if one is to find out how to regard the “Einstein-Hamilton-Jacobi equa¬ 
tion” (20) as more basic than the Einstein field equation that one derives from 
it. [Since done, by Hojman, Kuchar, and Teitelboim (1973 preprint).] 

5. Einstein’s geometrodynamics viewed as the standard field theory for a field of 

spin 2 in an “unobservable flat spacetime” background. 

a. This approach to Einstein’s field equation has a long history, references to 
which will be found in §7.1 and §18.1. (Further discussion of this approach 
will be found in those two sections and in Box 7.1, exercise 7.3, and Box 18.1). 

b. The following summary is quoted from Deser (1970): “We wish to give a simple 
physical derivation of the nonlinearity..., using a now familiar argument 
; . . leading from the linear, massless, spin-2 field to the full Einstein equa¬ 
tions .... 
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“The Einstein equations may be derived nongeometrically by noting that the 
free, massless, spin-2 field equations, 


^ 2 ^ aai^y^fiv — ^ fiufo )— [Ol/A cJIvft 

"E *b Va/3^v ^ fo afoaft (^1) 

whose source is the matter stress-tensor T v9 must actually be coupled to the 
total stress-tensor, including that of the <f>-field itself. That is, while the free-field 
equations (21) are of course quite consistent as they stand, [they are not] when 
there is a dynamic system’s T^ v as a source. For then the left side, which is 
identically divergenceless, is inconsistent with the right, since the coupling 
implies that T ,fiV v> as computed from the matter equations of motion, is no 
longer conserved. 

d. “To remedy this [violation of the principle of conservation of momentum and 

energy], the stress tensor (2) 6 arising from the quadratic Lagrangian (2) L 

responsible for equation (21) is then inserted on the right. 

e. “But the Lagrangian (3) L leading to these modified equations is then cubic, 
and itself contributes a cubic (3) 8^ v . 

f. “This series continues indefinitely, and sums (if properly derived!) to the full 
nonlinear Einstein equations, G^ ([calculated from] 7j a/3 -f- <j> a p) = — kT' 
[ 4 - 877 Tp V in the geometric units and sign conventions of this book], which are 
an infinite series in the deviation of the metric g^ v from its Minkowskian 
value 7 ) {XV . 

g. Once the iteration is begun (whether or not a is actually present), it must 
be continued to all orders, since conservation only holds for the full series 

x 

^ (n) 8^ Thus, the theory is either left in its (physically irrelevant) free linear 

n=2 

form (21), or it must be an infinite series.” 

h. For details, see Deser (1970); the paper goes on (1) to take advantage of a 
well-chosen formalism (2) to rearrange the calculation, and thus (3) to “derive 
the full Einstein equations, on the basis of the same self-coupling requirement, 
but with the advantages that the full theory emerges in closed form with just 
one added (cubic) term, rather than as an infinite series.” 

i. Deser summarizes the analysis at the end thus: “Consistency has therefore led 
us to universal coupling, which implies the equivalence principle. It is at this 
point that the geometric interpretation of general relativity arises, since all 
matter now moves in an effective Riemann space of metric v = r\ lxv + 

. . . [The] initial flat ‘background’ space is no longer observable.” In other 
words, this approach to Einstein’s field equation can be summarized as “cur¬ 
vature without curvature” or—equally well—as “flat spacetime without flat 
spacetime”! 
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6. Sakharov’s view of gravitation as an elasticity of space that arises from particle 
physics. 

a. The resistance of a homogeneous isotropic solid to deformation is described 
by two elastic constants, Young’s modulus and Poisson’s ratio. 

b. The resistance of space to deformation is described by one elastic constant, 
the Newtonian constant of gravity. It makes its appearance in the action 
principle of Hilbert 


,= ltd /“*<-*>*'’<'** 

+ / (^matter + fields )(~g) 1/2 d** = extremum. 


( 22 ) 


c. According to the historical records, it was first learned how many elastic 
constants it takes to describe a solid from microscopic molecular models of 
matter (Newton, Laplace, Navier, Cauchy, Poisson, Voigt, Kelvin, Bom), not 
from macroscopic considerations of symmetry and invariance. Thus, count the 
energy stored up in molecular bonds that are deformed from natural length 
or natural angle or both. Arrive at an expression for the energy of deformation 
per unit volume of the elastic material of the form 

e = A(Tx s ) 2 + BTt(s 2 ). (23) 

Here the strain tensor 


J7 . gf, \ 

2\dx n + dx m J 


(24) 


measures the strain produced in the elastic medium by motion of the typical 
point that was at the location x m to the location x m + f m (x). The constants 
A and B are derived out of microscopic physics. They fix the values of the 
two elastic constants of the macroscopic theory of elasticity. 

d. Andrei Sakharov (1967) ( the Andrei Sakharov) has proposed a similar micro¬ 
scopic foundation for gravitation or, as he calls it, the “metric elasticity of 
space.” He identifies the action term of Einstein’s geometrodynamics [the first 
term in (22)] “with the change in the action of quantum fluctuations of the 
vacuum [associated with the physics of particles and fields and brought about] 
when space is curved.” 

e. Sakharov notes that present-day quantum field theory “gets rid by a renor¬ 
malization process” of an energy density in the vacuum that would formally 
be infinite if not removed by this renormalization. Thus, in the standard 
analysis of the degrees of freedom of the electromagnetic field in flat space, 
one counts the number of modes of vibration per unit volume in the range 
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of circular wave numbers from k to k + dk as (2 • An/i^k 2 dk. Each mode 
of oscillation, even at the absolute zero of temperature, has an absolute irre¬ 
ducible minimum of “zero-point energy of oscillation,” %hv = \fick [the fluctu¬ 
ating electric field associated with which is among the most firmly established 
of all physical effects. It acts on the electron in the hydrogen atom in supple¬ 
ment to the electric field caused by the proton alone, and thereby produces 
most of the famous Lamb-Retherford shift in the energy levels of the hydrogen 
atom, as made especially clear by Welton (1948) and Dyson (1954)]. The 
totalized density of zero-point energy of the electromagnetic field per unit 
volume of spacetime (units: cm 4 ) formally diverges as 

( ti/2n 2 ) f k 3 dk. (25) 

J o 

Equally formally this divergence is “removed” by “renormalization” [for more 
on renormalization see, for example, Hepp (1969)]. 

f. Similar divergences appear when one counts up formally the energy associated 
with other fields and with vacuum fluctuations in number of pairs of electrons, 
/x-mesons, and other particles in the limit of quantum energies large in com¬ 
parison with the rest mass of any of these particles. Again these divergences 
in formal calculations are “removed by renormalization.” 

g. Removed by renormalization is a contribution not only to the energy density, 
and therefore to the stress-energy tensor, but also to the total Lagrange function 
£ of the variational principle for all these fields and particles, 

/ = J £ d 4 x = extremum. (26) 

h. Curving spacetime alters all these energies, Sakharov points out, extending 
an argument of Zel’dovich (1967). Therefore the process of “renormalization” 
or “subtraction” no longer gives zero. Instead, the contribution of zero-point 
energies to the Lagrangian, expanded as a power series in powers of the 
curvature, with numerical coefficients A, B, ... of the order of magnitude of 
unity, takes a form simplified by Ruzmaikina and Ruzmaikin (1969) to the 
following: 

£{R) = Atifk 3 dk + Bft^R fkdk 

+ fi[C( (4 >R) 2 + DR a AR a/3 ] J Ar 1 dk 

+ (higher-order terms). (27) 

[For the alteration in the number of standing waves per unit frequency in a 
curved manifold, see also Berger (1966), Sakharov (1967), Hill in De Witt 
(1967c), Polievktov-Nikoladze (1969), and Berger, Gauduchon, and Mazet 
(1971).] 

i. Renormalization physics argues that the first term in (27) is to be dropped. 
The second term, Sakharov notes, is identical in form to the Hilbert action 
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Box 17.2 (i continued ) 


principle, equation (3) above, with the exception that there the constant 
that multiplies the Riemann scalar curvature invariant is — c 3 /16ttG (in con¬ 
ventional units), whereas here it is Bfijk dk (in the same conventional units). 
The higher order terms in (27) lead to what Sakharov calls “corrections ... to 
Einstein’s equations.” 

j. Overlooking these corrections, one evidently obtains the action principle of 
Einstein’s theory when one insists on the equality 


( Newtonian \ 
^constant of gravity/ 


\6vBhfkdk 


With B a dimensionless numerical factor of the order of unity, it follows, 
Sakharov argues, that the effective upper limit or “cutoff” in the formally 
divergent integral in (28) is to be taken to be of the order of magnitude of 
the reciprocal Planck length [see equation (7)], 

Cutoff ^ {c z /fiGY /2 = 1/L* = 1/1.6 X 10- 33 cm. (29) 

In effect Sakharov is saying (1) that field physics suffers a sea change into 
something new and strange for wavelengths less than the Planck length, and 
for quantum energies of the order of ftc/c cutoff — 10 28 eV or 10 -5 g or more; 
(2) that in consequence the integral $k dk is cut off; and (3) that the value 
of this cutoff, arising purely out of the physics of fields and particles, governs 
the value of the Newtonian constant of gravity, G. 

k. In this sense, Sakharov’s analysis suggests that gravitation is to particle physics 
as elasticity is to chemical physics: merely a statistical measure of residual 
energies. In the one case, molecular bindings depend on departures of mole¬ 
cule-molecule bond lengths from standard values. In the other case, particle 
energies are affected by curvatures of the geometry. 

l. Elasticity, which looks simple, gets its explanation from molecular bindings, 
which are complicated; but molecular bindings, which are complicated, receive 
their explanation in terms of Schrodinger’s wave equation and Coulomb’s law 
of force between charged point-masses, which are even simpler than elasticity. 

m. Einstein’s geometrodynamics, which looks simple, is interpreted by Sakharov 
as a correction term in particle physics, which is complicated. Is particle physics, 
which is complicated, destined some day in its turn to unravel into something 
simple—something far deeper and far simpler than geometry (“pregeometry”; 
Chapter 44)? 
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§17.6. "NO PRIOR GEOMETRY": A FEATURE DISTINGUISHING 
EINSTEIN'S THEORY FROM OTHER THEORIES 
OF GRAVITY 


Whereas Einstein’s theory of gravity is exceedingly compelling, one can readily 
construct less compelling and less elegant alternative theories. The physics literature 
is replete with examples [see Ni (1972), and Thome, Ni, and Will (1971) for reviews]. 
However, when placed among its competitors, Einstein’s theory stands out sharp 
and clear: it agrees with experiment; most of its competitors do not (Chapters 38-40). 
It describes gravity entirely in terms of geometry; most of its competitors do not. 
It is free of any “prior geometry”; most of its competitors are not. 

Set aside, until Chapter 38, the issue of agreement with experiment. Einstein’s 
theory remains unique. Every other theory either introduces auxiliary gravitational 
fields [e.g., the scalar field of Brans and Dicke (1961)], or involves “prior geometry,” 
or both. Thus, every other theory is more complicated conceptually than Einstein’s 
theory. Every other theory contains elements of complexity for which there is no 
experimental motivation. 

The concept of “prior geometry” requires elucidation, not least because the 
rejection of prior geometry played a key role in the reasoning that originally led 
Einstein to his geometrodynamic equation G — 8 77 7 ”. By “prior geometry” one means 
any aspect of the geometry of spacetime that is fixed immutably, /.<?., that cannot be 
changed by changing the distribution of gravitating sources. Thus, prior geometry is 
not generated by or affected by matter; it is not dynamic. Example: Nordstrom (1913) 
formulated a theory in which the physical metric of spacetime g (the metric that 
enters into the equivalence principle) is generated by a “background” flat-spacetime 
metric rj, and by a scalar gravitational field <f>: 


Einstein's theory compared 
with other theories of gravity 


AM other theories introduce 
auxiliary gravitational fields 
or prior geometry 


"Prior geometry" defined 


Nordstrom’s theory as an 
illustration of prior geometry 


V a$ <t>, a p = — 47707)^ r a/ 3 


Sa/3 = <t> 2 y a /3 


( generation of <j> by 
stress-energy 

( construction of 
from <p and rj ) 


(17.23a) 

(17.23b) 


In this theory, the physical metric g (governor of rods and clocks and of test-particle 
motion) has but one changeable degree of freedom—the freedom in <j>. The rest of 
g is fixed by the flat spacetime metric (“prior geometry”) tj. One does not remove 
the prior geometry by rewriting Nordstrom’s equations (1723) in a form 


[curvature scalar 

j • 



fiV u 

constructed from o 

-Cr 73 ^] 


Weyl tensor 

L *\J 



constructed from g 


(17.24) 


devoid of reference to rj and $ [Einstein and Fokker (1914); exercise 17.8], Mass 
can still influence only one degree of freedom in the spacetime geometry. The other 
degrees of freedom are fixed a priori— they are prior geometry. And this prior 
geometry can perfectly well (in principle) be detected by physical experiments that 
make no reference to any equations (Box 17.3). 
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r a 

Box 17.3 AN EXPERIMENT TO DETECT OR EXCLUDE CERTAIN TYPES OF PRIOR GEOMETRY 
(Based on December 1970 discussions between Alfred Schild and Charles W. Misner) 


Choose a momentarily static universe populated 
with a large supply of suitable pulsars. The pulsars 
should be absolutely regular, periodically emitting 
characteristic pulses of both gravitational and 
electromagnetic waves. 

Two fleets of spaceships containing receivers are 
sent out “on station” to collect the experimental 
data. Admiral Weber’s fleet carries gravitational- 
wave receivers; Admiral Hertz’s fleet, electromag¬ 
netic receivers. The captain of each spaceship 
holds himself “on station” by monitoring three 
suitably chosen pulsars (of identical frequency) 
and maneuvering so that their pulses always arrive 
in coincidence. The experimental data he collects 
consist of the pulses received from all other pul¬ 
sars, which he is not using for station keeping, each 
registered as coincident with or interlaced among 
the reference (stationary) pulses. [For display pur¬ 
poses, the pattern produced by any single pulsar 
can be converted to acoustic form. The reference 
pulses can be played acoustically (by the data- 
processing computer) on one drum at a fixed rate, 
and the pulses from other pulsars can be played 
on a second drum. A pattern of rythmic beats will 
result.] 

When the data fleet is checked out and tuned 
up, each captain reports stationary patterns. Now 
the experiment begins. One or more massive stars 
are towed in among the fleet. The fleet reacts to 
stay on station, and reports changes in the data 
patterns. The spaceships on the outside edges of 
the fleet verify that no detectable changes occur 
at their stations; so the incident radiation from the 
distant pulsars can be regarded as unaffected by 
the newly placed stars. Data stations nearer the 
movable stars report the interesting data. 

What are the results? 

In a universe governed by the laws of special 
relativity (spacetime always flat), no patterns 
change. (Weber’s fleet was unable to get checked 


out in the first place, as no gravitational waves 
were ever detected from the pulsars). Neither stars, 
nor anything else, can produce gravitational fields. 
All aspects of the spacetime geometry are fixed a 
priori (complete prior geometry!). There is no 
gravity; and no light deflection takes place to make 
Hertz’s captains adjust their positions. 

In a universe governed by Nordstrom’s theory 
of gravity (see text) both fleets get checked out— 
i.e., both see waves. But neither fleet sees any 
changes in the rhythmic pattern of beats. The stars 
being towed about have no influence on either 
gravitational waves or electromagnetic waves. The 
prior geometry ( rj ) present in the theory precludes 
any light deflection or any gravitational-wave 
deflection. 

In a universe governed by Whitehead’s (1922) 
theory of gravity [see Will (1971b) and references 
cited therein], radio waves propagate along geo¬ 
desics of the “physical metric” g , and get deflected 
by the gravitational fields of the stars. But gravita¬ 
tional waves propagate along geodesics of a flat 
background metric rj , and are thus unaffected by 
the stars. Consequently, Hertz’s captains must ma¬ 
neuver to keep on station; and they hear a chang¬ 
ing beat pattern between the reference pulsars and 
the other pulsars. But Weber’s fleet remains on 
station and records no changes in the beat pattern. 
The prior geometry (rj) shows itself clearly in the 
experimental result. 

In a universe governed by Einstein’s theory, 
both fleets see effects (no sign of prior geometry 
because Einstein’s theory has no prior geometry). 
Moreover, if the fleets were originally paired, one 
Weber ship and one Hertz at each station, they 
remain paired. No differences exist between the 
propagation of high-frequency light waves and 
high-frequency gravitational waves. Both propa¬ 
gate along geodesics of g. 


\ _ j 
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Mathematics was not sufficiently refined in 1917 to cleave apart the demands for 
no prior geometry” and for a “geometric, coordinate-independent formulation of 
physics.” Einstein described both demands by a single phrase, “general covariance.” 
The “no-prior-geometry” demand actually fathered general relativity, but by doing 
so anonymously, disguised as “general covariance,” it also fathered half a century 
of confusion. [See, e.g., Kretschmann (1917).] 

A systematic treatment of the distinction between prior geometry (“absolute 
objects”) and dynamic fields (“dynamic objects”) is a notable feature of Anderson’s 
(1967) relativity text. 


"No prior geometry" as a 
part of Einstein's principle of 
"general covariance" 


Exercise 17.8. EINSTEIN-FOKKER REDUCES TO NORDSTR0M EXERCISE 

The vanishing of the Weyl tensor [equation (13.50)] for a spacetime metric g guarantees 
that the metric is conformally fiat—i.e., that there exists a scalar field <p such that g = <p 2 q, 
where q is a flat-spacetime metric. [See, e.g., Schouten (1954) for proof.] Thus, the Einstein- 
Fokker equation (17.24), C a *^ = 0, is equivalent to the Nordstrom equation (17.23b). With 
this fact in hand, show that the Einstein-Fokker field equation R = 24ttT reduces to the 
Nordstrom field equation (17.23a). 


§17.7. A TASTE OF THE HISTORY OF EINSTEIN'S EQUATION 

Nothing shows better what an idea is and means today than the battles and changes 
it has undergone on its way to its present form. A complete history of general 
relativity would demand a book. Here let a few key quotes from a few of the great 
papers give a little taste of what a proper history might encompass. 

Einstein (1908): “We . . . will therefore in the following assume the complete physical 
equivalence of a gravitational field and of a corresponding acceleration of the reference 
system. ... the clock at a point P for an observer anywhere in space runs (1 + <P/c 2 ) 
times faster than the clock at the coordinate origin. ... it follows that light rays are 
curved by the gravitational field. ... an amount of energy E has a mass E/c 2 ” 
Einstein and Grossmann (1913): “The theory described here originates from the 
conviction that the proportionality between the inertial and the gravitational mass of 
a body is an exact law of nature that must be expressed as a foundation principle 
of theoretical physics. ... An observer enclosed in an elevator has no way to decide 
whether the elevator is at rest in a static gravitational field or whether the elevator 
is located in gravitation-free space in an accelerated motion that is maintained by forces 
acting on the elevator (equivalence hypothesis). ... In the decay of radium, for 
example, that decrease [of mass] amounts to 1/10,000 of the total mass. If those changes 
in inertial mass did not correspond to changes in gravitational mass, then deviations 
of inertial from gravitational masses would arise that are far larger than the Eotvos 
experiments allow. It must therefore be considered as very probable that the identity 
of gravitational and inertial mass is exact. 
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'‘The sought for generalization will surely be of the form 

r „v = kT ,u- 

where k is a constant and r is a contravariant tensor of the second rank that arises 
out of the fundamental tensor g fiV through differential operations. ... it proved 
impossible to find a differential expression for r^ that is a generalization of [Poisson’s] 
A<$>, and that is a tensor with respect to arbitrary transformations. ... It seems most 
natural to demand that the system of equations should be covariant against arbitrary 
transformations. That stands in conflict with the result that the equations of the 
gravitational field do not possess this property.” 

Einstein and Grossman (1914): “In a 1913 treatment ... we could not show general 
covariance for these gravitational equations. [Origin of their difficulty: part of the 
two-index curvature tensor was put on the left, to constitute the second-order part of 
the field equation, and part was put on the right with and was called gravitational 
stress-energy. It was asked that lefthand and righthand sides transform as tensors, which 
they cannot do under general coordinate transformations.] 

Einstein (1915a): “In recent years I had been trying to found a general theory of 
relativity on the assumption of the relativity even of nonuniform motions. I believed 
in fact that I had found the only law of gravitation that corresponds to a reasonably 
formulated postulate of general relativity, and I sought to establish the necessity of 
exactly this solution in a paper that appeared last year in these proceedings. 

“A renewed analysis showed me that that necessity absolutely was not shown in 
the approach adopted there; that it nevertheless appeared to be shown rested on an 
error. 

“For these reasons, I lost all confidence in the field equations I had set up, and 
I sought for an approach that would limit the possibilities in a natural way. In this 
way I was led back to the demand for the general covariance of the field equations, 
from which I had departed three years ago, while working with my friend Grossmann, 
only with a heavy heart. In fact we had already at that time come quite near to the 
solution of the problem that is given in what follows. 

“According to what has been said, it is natural to postulate the field equations of 
gravitation in the form 

R — _k T 

J fiv-i 

since we already know that these equations are covariant with respect to arbitrary 
transformations of determinant 1. In fact, these equations satisfy all conduions-4hat 
we have to impose on them. [Here is a piece of the Ricci tensor that Einstein 
regarded as covariant.] . . . 

“Equations (22a) give in the first approximation 

82 g a/i __ o 

dx^xl 3 

By this [condition] the coordinate system is Still not determined, in the sense that for 
this determination four equations are necessary.” (Session of Nov. 4, 1915, published 
Nov. 11.) 

Einstein (1915b): “In a recently published investigation, I have shown how a theory 
of the gravitational field can be founded on Riemann’s covariant theory of many-di- 
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mensional manifolds. Here it will now be proved that, by introducing a surely bold 
additional hypothesis on the structure of matter, a still tighter logical structure of the 
theory can be achieved. ... it may very well be possible that in the matter to which 
the given expression refers, gravitational fields play an essential part. Then T* can 
appear to be positive for the entire structure, although in reality only T* 4- t* is 
positive, and T * vanishes everywhere. We assume in the following that in fact the 
condition T * = 0 is fulfilled [quite] generally. 

“Whoever does not from the beginning reject the hypothesis that molecular [small- 
scale] gravitational fields constitute an essential part of matter will see in the following 
a strong support for this point of view. 

“Our hypothesis makes it possible ... to give the field equations of gravitation in 
a generally covariant form . . . 



[where G ' „ is the Ricci tensor].” (Session of Nov. 11, 1915; published Nov. 18.) 

Einstein (1915c): “I have shown that no objection of principle stands in the way 
of this hypothesis [the field equations], by which space and time are deprived of the 
last trace of objective reality. In the present work I find an important confirmation 
of this most radical theory of relativity: it turns out that it explains qualitatively and 
quantitatively the secular precession of the orbit of Mercury in the direction of the 
orbital motion, as discovered by Leverrier, which amounts to about 45" per century, 
without calling on any special hypothesis whatsoever.” 

Einstein (1915d; session of Nov. 25, 1915; published Dec. 2): “More recently I have 
found that one can proceed without hypotheses about the energy tensor of matter when 
one introduces the energy tensor of matter in a somewhat different way than was done 
in my two earlier communications. The field equations for the motion of the perihelion 
of Mercury are undisturbed by this modification.... 

“Let us put 


Gim = ~ K ( T im ~ 

[where G im is the Ricci tensor].” . . . 

. . . these equations, in contrast to (9), contain no new condition, so that no other 
assumption has to be made about the energy tensor of matter than obedience to the 
energy-momentum [conservation] laws. 

“With this step, general relativity is finally completed as a logical structure. The 
postulate of relativity in its most general formulation, which makes the spacetime 
coordinates into physically meaningless parameters, leads compellingly to a completely 
determinate theory of gravitation that explains the perihelion motion of Mercury. In 
contrast, the general-relativity postulate is able to open up to us nothing about the 
nature of the other processes of nature that special relativity has not already taught. 
The opinion on this point that I recently expressed in these proceedings was erroneous. 
Every physical theory compatible with special relativity can be aligned into the system 
of general relativity by means of the absolute differential calculus, without [general 
relativity] supplying any criterion for the acceptability of that theory.” 

Hilbert (1915): “Axiom I [notation changed to conform to usage in this book]. The 
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law of physical events is determined through a world function [Mie’s terminology; 
better known today as “Lagrangian”] L, that contains the following arguments: 

S >“” dx a ’ dxVdxP ’ 



and specifically the variation of the integral 

f L(—g) ,/2 d i x 

must vanish for [changes in] every one of the 14 potentials g av9 A a . . . . 

“Axiom II (axiom of general invariance). The world function L is invariant with 
respect to arbitrary transformations of the world parameters [coordinates] . . . 

“For the world function L, still further axioms are needed to make its choice 
unambiguous. If the gravitation equations are to contain only second derivatives of 
the potentials g av , then L must have the form 

L = R 4- L e i ec , 

where R is the invariant built from the Riemann tensor (curvature of the four-dimen¬ 
sional manifold.” (Session of Nov. 20, 1915.) 

Einstein (1916c): “Recently H. A. Lorentz and D. Hilbert have succeeded in giving 
general relativity an especially transparent form in deriving its equations from a single 
variation principle. This will be done also in the following treatment. There it is my 
aim to present the basic relations as transparently as possible and in a way as general 
as general relativity allows.” 

Einstein (1916b): “From this it follows, first of all, that gravitational fields spread 
out with the speed of light. . . . [plane] waves transport “energy. . . . One thus gets . . . 
the radiation of the system per unit time. . . . 





Hilbert (1917): “As for the principle of causality, the physical quantities and their 
time-rates of change may be known at the present time in any given coordinate system; 
a prediction will then have a physical meaning only when it is invariant with respect 
to all those transformations for which exactly those coordinates used for the present 
time remain unchanged. I declare that predictions of this kind for the future are alL 
uniquely determined; that is, that the causality principle holds in this formulation: 

“From the knowledge of the 14 physical potentials g^, A vt in the present, all 
predictions about the same quantities in the future follow necessarily and uniquely 
insofar as they have physical meaning.” 



CHAPTER 


WEAK GRAVITATIONAL FIELDS 


The way that can be walked on is not the perfect way. 
The word that can be said is not the perfect word. 

LAO-TZU (—3rd century B.C.) 


J 


§18.1. THE LINEARIZED THEORY OF GRAVITY 


Because of the geometric language and abbreviations used in writing them, Einstein’s 
field equations, G^ v — 8?r7^, hardly seem to be differential equations at all, much 
less ones with many familiar properties. The best way to see that they are is to apply 
them to weak-field situations 




(18.1) 


"Linearized theory of 
gravity"; 

(1) as weak-field limit of 
general relativity 


e.g., to the solar system, where |~ \<P\ < M 0 /R 0 ~ 10~ 6 ; or to a weak gravita¬ 
tional wave propagating through interstellar space. 

In a weak-field situation, one can expand the field equations in powers of h , 
using a coordinate frame where (18.1) holds; and without much loss of accuracy, 
one can keep only linear terms. The resulting formalism is often called “the linearized 
theory of gravity,” because it is an important theory in its own right. In fact, it is 
precisely this “linearized theory” that one obtains when one asks for the classical (2) as standard 
field corresponding to quantum-mechanical particles of (1) zero rest mass and (2) ' field-theory'' description of 
spin two in (3) flat spacetime [see Fierz and Pauli (1939)]. Track-2 readers have 9ravi,y m flat s P ace fme 
already explored linearized theory somewhat in §7.1, exercise 7.3, and Box 7.1. There 
it went under the alternative name, “tensor-field theory of gravity in flat spacetime.” 
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(3) as a foundation for 
"deriving" general relativity 


Details of linearized theory: 


(1) connection coefficients 


(2) "gravitational potentials" 


Just as one can “descend” from general relativity to linearized theory by linearizing 
about flat spacetime (see below), so one can “bootstrap” one’s way back up from 
linearized theory to general relativity by imposing consistency between the linearized 
field equations and the equations of motion, or, equivalently, by asking about: (1) 
the stress-energy carried by the linearized gravitational field A ; (2) the influence 
of this stress-energy acting as a source for corrections h {1) ^ v to the field; (3) the 
stress-energy carried by the corrections A 0 * ; (4) the influence of this stress-energy 
acting as a source for corrections h i2) fiV to the corrections A (1) MJ ,; (5) the stress-energy 
carried by the corrections to the corrections; and so on. This alternative way to derive 
general relativity has been developed and explored by Gupta (1954, 1957, 1962), 
Kraichnan (1955), Thirring (1961), Feynman (1963a), Weinberg (1965), and Deser 
(1970). But because the outlook is far from geometric (see Box 18.1), the details 
of the derivation are not presented here. (But see part 5 of Box 17.2.) 

Here attention focuses on deriving linearized theory from general relativity. Adopt 
the form (18.1) for the metric components. The resulting connection coefficients 
[equations (8.24b)], when linearized in the metric perturbation h , read 




= + V.«- 




(18.2) 


The second line here introduces the convention, used routinely whenever one ex¬ 
pands in powers of A that indices of h^ are raised and lowered using t}^ v and 
not g^ v and g^. A similar linearization of the Ricci tensor [equation (8.47)] yields 


r = r a — r a 

fiv V-v,a fia,v 


— — (h a h a — h a — h ) 


(18.3) 


where 


h = h\ = (18.4) 

After a further contraction to form R = g^R^ ~ y^R^, one finds that the Einstein 
equations, 2 G = 16t 7 7^, read 


h a A- h 1 

i va,fi 


fiv,a 

~ VizAKiB ’ a0 - h ,p p ) = 16w7 ^ 


(18.5) 


The number of terms has increased in passing from R (18.3) to G^ v = R^ v — 
(18.5), but this annoyance can be counteracted by defining 


h 


fiv 


= h 





(18.6) 


and using a bar to imply a corresponding operation on any other symmetric tensor. 
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Box 18.1 DERIVATIONS OF GENERAL RELATIVITY FROM GEOMETRIC 
VIEWPOINT AND FROM SPIN-TWO VIEWPOINT, COMPARED 
AND CONTRASTED 


Einstein Spin-2 

derivation derivation 


Nature of primordial spacetime 
geometry? 

Topology (multiple connected¬ 
ness) of spacetime? 

Vision of physics? 


Starting points for this deri¬ 
vation of general relativity 


Resulting equations 

Resulting assessment of the 
spacetime geometry from which 
derivation started 


View about the greatest single 
crisis of physics to emerge 
from these equations: complete 
gravitational collapse 


Not primordial; geometry is a 
dynamic participant in physics 

Laws of physics are local; 
they do not specify the 
topology 

Dynamic geometry is the 
“master field” of physics 


1. Equivalence principle 
(world lines of photons and 
test particles are geo¬ 
desics of the spacetime 
geometry) 

2. That tensorial conserved 
quantity which is derived 
from the curvature (Cartan’s 
moment of rotation) is to 
be identified with the tensor 
of stress-momentum-energy 
(see Chapter 15). 

Einstein’s field equations 

Fundamental dynamic partici¬ 
pant in physics 


Central to understanding the 
nature of matter and the 
universe 


“God-given” flat Lorentz 
spacetime manifold 

Simply connected Euclidean 
topology 

This field, that field, and 
the other field all execute 
their dynamics in a flat¬ 
spacetime manifold 

1. Begin with field of spin 
two and zero rest mass in 
flat spacetime. 

2. Stress-energy tensor built 
from this field serves as a 
source for this field. 


Einstein’s field equations 

None. Resulting theory eradi¬ 
cates original flat geometry 
from all equations, showing it 
to be unobservable 

Unimportant or at most 
peripheral 






Thus G^ = R to first order in the h^, and = h^; i.e., h^ ] liv h. With 

this notation the linearized field equations become 

- - V^/3,^ + V“» + = 16 " 7 ^- ( 18 - 7 ) < 3 > linearized field equations 

The first term in these linearized equations is the usual flat-space d’Alembertian, 
and the other terms serve merely to keep the equations “gauge-invariant” (see Box 
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18.2). In Box 18.2 it is shown that, without loss of generality, one can impose the 
“gauge conditions” 

(4) gauge conditions = 0. (18.8a) 

These gauge conditions are the tensor analog of the Lorentz gauge A a a = 0 of 
electromagnetic theory. The field equations (18.7) then become 

(5) field equations and metric — 

in Lorentz gauge — 16t 7 7^. (18.8b) 

The gauge conditions {18.8a), the field equations (18.8b), and the definition of the 
metric 


T h^ v T h^ v ^ Tlfii>h (18.8c) 

are the fundamental equations of the linearized theory of gravity written in Lorentz 
gauge. 


EXERCISES 


Exercise 18.1. GAUGE INVARIANCE OF THE RIEMANN CURVATURE 

Show that in linearized theory the components of the Riemann tensor are 


v a J u/ 3v 


~ 2 




(18.9) 


Then show that these components are left unchanged by a gauge transformation of the form 
discussed in Box 18.2 [equation (4b)]. Since the Einstein tensor is a contraction of the 
Riemann tensor, this shows that it is also gauge-invariant. 

Exercise 18.2. JUSTIFICATION OF LORENTZ GAUGE 

Let a particular solution to the field equations (18.7) of linearized theory be given, in an 
arbitrary gauge. Show that there necessarily exist four generating functions { (/, x>) whose 
gauge transformation [Box 18.2, eq. (4b)] makes 

h ueyi ^ a a = 0 (Lorentz gauge). 

Also show that a subsequent gauge transformation leaves this Lorentz gauge condition 
unaffected if and only if its generating functions satisfy the sourceless wave equation 

= 0. 

Exercise 18.3. EXTERNAL FIELD OF A STATIC, SPHERICAL BODY 

Consider the external gravitational field of a static spherical body, as described in the body’s 
(nearly) Lorentz frame—i.e., in a nearly rectangular coordinate system \h v \ < 1, in which 
the body is located atx=j = z = 0 for all t. By fiat, adopt Lorentz gauge. 

(a) Show that the field equations (18.8b) and gauge conditions (18.8a) imply 

V> = 4M/(x 2 + y 2 + z 2 ) 1/2 , . h 0j = h jk = 0, 

h oo = h xx ~ h vv = = 2Af/(x 2 + f + z 2 ) 1/2 , h ap = 0 if a 0, 

where M is a constant (the mass of the body; see §19.3). 
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Box 18.2 GAUGE TRANSFORMATIONS AND COORDINATE TRANSFORMATIONS 
IN LINEARIZED THEORY 

A. The Basic Equations of Linearized Theory, written in any coordinate system that 
is nearly globally Lorentz, are (18.1) and (18.7): 

= V + V> IVI <t 1 ) 
- V,«“ ~ + V’“» + V*% = ( 2 ) 

Two different types of coordinate transformations connect nearly globally Lorentz 
systems to each other: global Lorentz transformations, and infinitesimal coordinate 
transformations. 

L Global Lorentz Transformations: 

= A\,x°\ = 1«T- ( 3a ) 

These transform the metric coefficients via 

3 v-i> 

= £a'/3' = 0 Y a' 0^/3' — 

= ’Ja'/r + A'php,. 

Thus, A —and likewise 72^^—transform like components of a tensor in fiat 
spacetime 

h*? = ( 3b ) 

2. Infinitesimal Coordinate Transformations (creation of “ripples” in the coordi¬ 
nate system): 

x*(?) = x^) 4- £^), (4a) 

where £^(3?) are four arbitrary functions small enough to leave \h^f\ < 1. 
Infinitesimal transformations of this sort make tiny changes in the functional 
forms of all scalar, vector, and tensor fields. Example: the temperature T is 
a unique function of position, T(&)\ so when written as a function of coordinates 
it changes 

T(xfi = a*) = T(x* + p = a*) = T{x* = a* - £*) 

= T(x* = a*) - TJ*; 

i.e., if £° = 0.001 sinfx 1 ), and if T = cos 2 (x°), then 


T — cos 2 (x°) 4- 0.002 sin(x 1 ') cos(x 0 ') sin(x°). 
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Box 18.2 (continued) 


These tiny changes can be ignored in all quantities except the metric, where 
tiny deviations from contain all the information about gravity. The usual 
tensor transformation law for the metric 

g P -Ax a X‘?)\ = J~7 ’ 

when combined with the transformation law (4a) and with 

$,,[*“(*)] = v + V [*“(?)]. 

reveals that 

gp’Ax*' = a°) = Vpa + h pa( xa = a “) - Sp.p ~ Sp.p 

+ negligible corrections — h paa i a and — h pa i a a . 

Hence, the metric perturbation functions in the new (x^) and old (;c^) coordinate 
systems are related by 


K? = h °™ ~ (4b) 

whereas the functionalforms of all other scalars, vectors, and tensors are unaltered, 
to within the precision of linearized theory. 

B. Gauge Transformations and Gauge Invariance, In linearized theory one usually 
regards equation (4b) as gauge transformations, analogous to those 

^new = ^old + ^ (5a) 

of electromagnetic theory. The fact that gravitational gauge transformations do not 
affect the functional forms of scalars, vectors, or tensors (i.e., observables) is called 
4 t g au g e invariance.” Just as a straightforward calculation reveals the gauge invariance 
of the electromagnetic field, 


iTnew — a new _ a new — jold \ \Tf _ Jold _ »/> _ zrold 

fiv fjl,v V t fl ' * ,VH * ,fiv * fiv > 


(5b) 


so a straightforward calculation (exercise 18.1) reveals the gauge invariance of the 
Riemann tensor 


= KU ( 6 ) 

Such gauge invariance was already guaranteed by the fact that Rp Va p are the 
components of a tensor, and are thus essentially the same whether calculated in 
an orthonormal frame g- j = q^, in the old coordinates where g^ v = q + /t®“, or 
in the new coordinates where g^ = ij + hPf. 
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Like the Riemann tensor, the Einstein tensor and the stress-energy tensor are 
unaffected by gauge transformations. Hence, if one knows a specific solution h^ v to 
the linearized field equations ( 2 ) for a given T, one can obtain another solution that 
describes precisely the same physical situation (all observables unchanged) by the change 
of gauge ( 4 ), in which ( are four arbitrary but small functions . 

C. Lorentz Gauge. One can show (exercise 18.2) that for any physical situation, 
one can specialize the gauge (i.e., the coordinates) so that h = 0. This is the 
Lorentz gauge introduced in §18.1. The Lorentz gauge is not fixed uniquely. The 
gauge condition h^ a a — 0 is left unaffected by any gauge transformation for which 


P-'b = 0. 


(See exercise 18.2.) 


D. Curvilinear Coordinate Systems. Once the gauge has been fixed by fiat for a 
given system (e.g., the solar system), one can regard h^ v and h^ as components of 
tensors in flat spacetime; and one can regard the field equations ( 2 ) and the chosen 
gauge conditions as geometric, coordinate-independent equations in flat spacetime. 
This viewpoint allows one to use curvilinear coordinates (e.g., spherical coordinates 
centered on the sun), if one wishes. But in doing so, one must everywhere replace 
the Lorentz components of the metric, 77 ^, by the metric’s components in 

the flat-spacetime curvilinear coordinate system; and one must replace all ordinary 
derivatives (“commas”) in the field equations and gauge conditions by covariant 
derivatives whose connection coefficients come from g^ n ^ See exercise 18.3 for an 
example. 


(b) Adopt spherical polar coordinates, 

x = r sin 6 cos <p y y = r sin 9 sin z = rcos9. 


By regarding h and h as components of tensors in flat spacetime (see end of Box 18.2), 
and by using the usual tensor transformation laws, put the solution found in (a) into the 
form 


V) = 4 'Wr, h nj = h jk = 0, 



hi = 0 . 


, _ 2 M 

hk r 8lk tw 


where g a/3n t are the components of the flat-spacetime metric in the spherical coordinate 
system 

Soon* = -1 ’ Sit fl a t “ S*» nat ~ V ' 

Soo nat = '• 2 sin2e ■ Sap nat = 0 when 
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Linearized theory and 
electromagnetic theory 
compared 


Plane gravitational waves 


How to analyze effects of 
weak gravity on matter 


Thereby conclude that the general relativistic line element, accurate to linearized order, is 
ds 2 = -(1 - 2 M/r) di 2 + (1 4- 2M/r)(dr 2 + r 2 dO 2 + r 2 sin 2 0 ^ 2 ). 

(c) Derive this general, static, spherically symmetric, Lorentz-gauge, vacuum solution to 
the linearized field equations from scratch, working entirely in spherical coordinates. [Hint: 
As discussed at the end of Box 18.2, tj^ in equation (18.8c) must be replaced by g Mr : 
and in the field equations and gauge conditions (18.8a, b), all commas (partial derivatives) 
must be replaced by covariant derivatives, whose connection coefficients come from g M ,, nat ] 
(d) Calculate the Riemann curvature tensor for this gravitational field. The answer should 
agree with equation (1.14). 


§18.2. GRAVITATIONAL WAVES 

The gauge conditions and field equations (18.8a, b) of linearized theory bear a close 
resemblance to the equations of electromagnetic theory in Lorentz gauge and flat 
spacetime, 


A a a = 0, (18.10a) 

-A»/ = And*. (18.10b) 

They differ only in the added index ( h** versus T^ v versus J A ). Consequently, 
from past experience with electromagnetic theory, one can infer much about linear¬ 
ized gravitation theory. — - ' 

For example, the field equations (18.8b) must have gravitational-wave solutions. 
The analog of the electromagnetic plane wave 

A x = A*(t-z), A v = A v (t — z), A* = 0, A 0 = 0, 


will be the gravitational plane wave 


h xx _ jfx( { _ z y h xy _ frryfj _ z y h w - h vvQ _ z y 

h* 0 = h** = 0 for all ja. 


(18.11) 


Although a detailed study of such waves will be delayed until Chapters 35-37, some 
properties of these waves are explored in the exercises at the end of the next section. 


§18.3. EFFECT OF GRAVITY ON MATTER 

The effects of weak gravitational fields on matter can be computed by using the 
linearized metric (18.1) and Christoffel symbols (18.2) in the appropriate equations 
of motion—i.e., in the geodesic equation (for the motion of particles or light rays), 
in the hydrodynamic equations (for fluid matter), in Maxwell’s equations (for elec¬ 
tromagnetic waves), or in the equation V • T — 0 for the total stress-energy tensor 
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of whatever fields and matter may be present. Exercises 18.5, 18.6 and 18.7 provide 
examples, as do the Newtonian-limit calculations in exercises 16.1 and 16.4, and 
in §17.4. If, however, the lowest-order (linearized) gravitational “forces” (Chris- 
toffel-symbol terms) have a significant influence on the motion of the sources of 
the gravitational field, one finds that the linearized field equation (18.7) is inadequate, 
and better approximations to Einstein’s equations must be considered. [Thus emission 
of gravitational waves by a mechanically or electrically driven oscillator falls within 
the scope of linearized theory, but emission by a double-star system, or by stellar 
oscillations that gravitational forces maintain, will require discussion of nonlinear 
terms (gravitational “stress-energy”) in the Einstein equations; see §§36.9 to 36.11.] 
The above conclusions follow from a consideration of conservation laws associated 
with the linearized field equation. Just as the electromagnetic equations (18.10a, b) 
guarantee charge conservation 


= 0, J J°(t, x)dx dydz = Q = const, 

all space 


so the gravitational equations (18.8a, b) guarantee conservation of the total 4 -mo¬ 
mentum and angular momentum of any body bounded by vacuum: 

7^, = 0 , (18.12a) 


Conservation of 4-momentum 
and angular momentum in 
linearized theory 


f T M0 (t, jc) dx dy dz = P* = const; (18.12b) 

body 

(,x a TP*-x*T*)' ll = 0, (18.13a) 


J (x a r /30 — x /3 r a0 )dxdydz = J a P = const. (18.13b) 

oody 


(See §5.11 for the basic properties of angular momentum in special relativity. The 
angular momentum here is calculated relative to the origin of the coordinate system.) 
Now it is important that the stress-energy components which appear in the 
linearized field equations (18.7) and in these conservation laws, are precisely the 
components one would calculate using special relativity (with g^ v = 77 ^). As a result, 
the energy-momentum conservation formulated here contains no contributions or 
effects of gravity! From this one sees that linearized theory assumes that gravitational 
forces do no significant work. For example, energy losses due to gravitational 
radiation-damping forces are neglected by linearized theory. Similarly, conservation 
-oF4-memerrtrm' each oFtKe Bodies acting as sources of h^ v means that each 
body moves along a geodesic of 77 ^ (straight lines in the nearly Lorentz coordinate 
system) rather than along a geodesic of + h^ v . Thus, linearized theory can 

be used to calculate the motion of test particles and fields, using g^ v = q + h ; 
but to include gravitational corrections to the motion of the sources themselves—to 
allow them to satisfy T^’. v = 0 rather than T^ v v = 0 —one must reinsert into the 
field equations the nonlinear terms that linearized theory discards. (See, e.g., Chapter 
20 on conservation laws; §§36.9-36.11 on the generation of gravitational waves and 
radiation reaction; and Chapter 39 on the post-Newtonian approximation.) 


Limit on validity of linearized 
theory: gravity must not 
affect motions of sources 
significantly 
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The energy, momentum, and angular momentum radiated by gravitational waves 
in linearized theory can be calculated by special-relativistic methods analogous to 
those used in electromagnetic theory for electromagnetic waves [Fiertz and Pauli 
(1939)], but it will be more informative and powerful to use a fully gravitational 
approach (Chapters 35 and 36). 


EXERCISES 


Exercise 18.4. SPACETIME CURVATURE FOR A PLANE 
GRAVITATIONAL WAVE 

Calculate the components of the Riemann curvature tensor [equations (18.9)] for the gravita¬ 
tional plane wave (18.11). [Answer: 

RtQtO ~ ^yOyO = — ^xOxz = + RyOyz ~ + ^xzxz = — ^yzys ~ “ ~K^xx ~ ^ w\tt ’ 


RxOyO — —RxOyg ~~ + ^xzyz ~~ ~ ^xzyO ~ “ 2 


all other components vanish except those obtainable from the above by the symmetries 

RctpyS = R[afi][y&\ = RySap- 

Exercise 18.5. A PRIMITIVE GRAVITATIONAL-WAVE DETECTOR 
(see Figure 18.1) 

Two beads slide almost freely on a smooth stick; only slight friction impedes their sliding. 
The stick falls freely through spacetime, with its center moving along a geodesic and its ends 
attached to gyroscopes, so they do not rotate. The beads are positioned equidistant (distance 
| X) from the stick’s center. Plane gravitational waves [equation (18.11) and exercise 18.4]. 
impinging on the stick, push the beads back and forth (“geodesic deviation”; “tidal gravita¬ 
tional forces”). The resultant friction of beads on stick heats the stick; and the passage of 
the waves is detected by measuring the rise in stick temperature.* (Of course, this is not the 
best of all conceivable designs!) Neglecting the effect of friction on the beads’ motion, 
calculate the proper distance separating them as a function of time. [Hints: Let f be the 
separation between the beads; and let n = f/|f| be a unit vector that points along the stick 
in the stick’s own rest frame. Then their separation has magnitude 1 = f ■ n. The fact that 
the stick is nonrotating is embodied in a parallel-transport law for n, V u o = 0. (“Fermi- 
Walker transport” of §§6.5,6.6, and 13.6 reduces to parallel transport, because the stick moves 
along a geodesic with a = V u i/ = 0.) Thus, 

di/dr = V„(f-«) = (V w f)-/J^ - 

d 2 i/dr 2 = V„ V„(f • n) = (▼„ ) * n. 


where t is the stick’s proper time. But V w V u f is produced by the Riemann curvature of 
the wave (geodesic deviation): 

= projection along n of [ — Riemann (...,«, f,«)]. 


(The geodesic-deviation forces perpendicular to the stick, i.e., perpendicular to n, are coun- 


*This thought experiment was devised by Bondi [1957, 1965; Bondi and McCrea (i960)] as a means 
for convincing skeptics of the reality of gravitational waves. 



§18.4. NEARLY NEWTONIAN GRAVITATIONAL FIELDS 


445 



Figure 18.1. 

A primitive detector for gravitational waves, consisting of a 
beaded stick with gyroscopes on its ends [Bondi (1957)]. See 
exercise 18.5 for discussion. 


terbalanced by the stick’s pushing back on the beads to stop them from passing through 
it—no penetration of matter by matter!) Thus, 

d 2 l/dr 2 = —Riemann (..., i/, f, u) • #i = — Riemann (n, i/, f, u). 

Evaluate this acceleration in the stick’s local Lorentz frame. Orient the coordinates so the 
waves propagate in the z-direction and the stick’s direction has components n z — cos 9 y 
n x = sin 9 cos n y = sin 9 sin Solve the resulting differential equation for X(r).] [Answer: 

1 = l ° [ 1 + \ - K«) sm2e cos 2< ?> + j K sin2 0 sin 2< ?>]. 

where h jk are evaluated on the stick’s world line (x = y — z — 0). Notice that, if the stick 
is oriented along the direction of wave propagation (if 9 = 0), the beads do not move. In 
this sense, the effect of the waves (geodesic deviation) is purely transverse. For further 
discussion, see §§35.4 to 35.6.] 
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The general solution to the linearized field equations in Lorentz gauge [equations 
(18.8a, b)] lends itself to expression as a retarded integral of the form familiar from 
electromagnetic theory: 


'ILV 


(,,*) = / 4 V'—I*-*'!•*'> 


d 3 x\ 


X - X' 


(18.14) 


The gravitational-wave aspects of this solution will be studied in Chapter 36. Here 
focus attention on a nearly Newtonian source: Too ^ ^oo ^ I T jk l and velocities 
slow enough that retardation is negligible. In this case, (18.14) reduces to 


*00 = -4 <P, 


K, = h ,k = o- 


<P{t,x) = — f — — — - d 3 x' = Newtonian potential. 
J \x - x'| 


(18.15a) 

(18.15b) 


The corresponding metric (18.8c) is 

ds 2 = -(1 + 20) dt 2 + (1 - 2 0)(dx 2 + df + dz 2 ) (18.15c) 

— (1 — 2 M/r)dt 2 + (1 4- 2M/r)(dx 2 + dy 2 + dz 2 ) far from source. 


Retarded-integral solution of 
linearized field equation 


Newtonian gravity as a limit 
of linearized theory 
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Bending of light and 
gravitational redshift 
predicted by linearized theory 


EXERCISES 


The errors in this metric are: (1) missing corrections of order <P 2 due to nonlinearities 
of which linearized theory is oblivious; (2) missing corrections due to setting h 0i = 0 
(these are of order h 0j - <Pv , where v ~~ \T 0 j \/T 00 is a typical velocity in the source); 
(3) missing corrections due to setting h jk — 0 [these are of order hj k ~~ ^QT^I/Tqq)]. 
In the solar system all these errors are ~~ 10~ 12 , whereas <P ■ 10~ 6 . 

Passive correspondence with Newtonian theory demanded only that g 00 = 
_(1 + 2 <P)\ see equation (17.19). However, linearized theory determines all the 
metric coefficients, up to errors of and ~~<P(\T ik \/T 00 ). This is sufficient 

accuracy to predict correctly (fractional errors ~10~ 6 ) the bending of light and the 
gravitational redshift in the solar system, but not perihelion shifts. 


Exercise 18.6. BENDING OF LIGHT BY THE SUN 

To high precision, the sun is static and spherical, so its external line element is (18.15c) with 
0 =z —M/r\ i.e., 

ds 2 = -(1 - 2M/r) dt 2 + (1 + 2M/r)(dx 2 + dy 2 + dz 2 ) everywhere outside sun. (18,16) 

A photon moving in the equatorial plane (z = 0) of this curved spacetime gets deflected 
very slightly from the world line 

x — t , y — b = “impact parameter,” z = 0. (18.17) 

Calculate the amount of deflection as follows. 

(a) Write down the geodesic equation (16.4a) for the photon’s world line, 

^l + fVV = ^ < 18 - 18 > 


[Here p = d/dX* = (4-momentum of photon) = (tangent vector to photon’s null geodesic).] 

(b) By evaluating the connection coefficients in the equatorial plane, and by using the 
approximate values, \p v \ p° zz p x , of the 4-momentum components corresponding to the 
approximate world line (18.17), show that 


dp v —2Mb x dx 

lx* - (x 2 + b-f 2 P d\*’ 


= /[ 1 + o(^)] = const [l +0(^). 


(c) Integrate this equation for p v , assuming p v = 0 at x = — oo (photon moving precisely 
in x-direction initially); thereby obtain 


py(x = + oo) = 



(d) Show that this corresponds to deflection of light through the angle 

A<t> - 4M/b = V'15 ( R 0 /b ), (18.19) 

where R Q is the radius of the sun. For a comparison of this prediction with experiment, 
see Box 40.1. 
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Exercise 18.7. GRAVITATIONAL REDSHIFT 

/ 

(a) Use the geodesic equation for a photon, written in the form 

dpjdx* - r a ^p a p< i = 0, 

to prove that any photon moving freely in the sun’s gravitational field [line element (18.16)] 
has d pjdkl ^ 0; i.e^- -- 

po = constant along photon’s world line. (18.20) 

(b) An atom at rest on the sun’s surface emits a photon of wavelength X e , as seen in its 
orthonormal frame. [Note: 

hv e = h/X e = (energy atom measures) = — p • i/ e , (18.21) 

where p is the photon’s 4-momentum and u e is the emitter’s 4-velocity.] An atom at rest 
far from the sun receives the photon, and measures its wavelength to be \ r . [Note: h/\ r = 
—p • u r ] Show that the photon is redshifted by the amount 

z = Xr 7 — = -^2- = 2 X 10~ 6 . (18.22) 

^ 

[Hint: u r = 0/0/; u e = (1 — 2M/r)~ l/2 0/0/. Why?] For further discussion of the gravitational 
redshift and experimental results, see §§7.4 and 38.5; also Figures 38.1 and 38.2. 
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MASS AND ANGULAR MOMENTUM 
OF A GRAVITATING SYSTEM 


Metric far from a weakly 
gravitating system, as a 
power series in 1 fr\ 


(1) derivation 


§19.1. EXTERNAL FIELD OF A WEAKLY 
GRAVITATING SOURCE 

Consider an isolated system with gravity so weak that in calculating its structure 
and motion one can completely ignore self-gravitational effects. (This is true of an 
asteroid, and of a nebula with high-energy electrons and protons spiraling in a 
magnetic field; it is not true of the Earth or the sun.) Assume nothing else about 
the system—for example, by contrast with Newtonian theory, allow velocities to be 
arbitrarily close to the speed of light, and allow stresses V k and momentum densities 
T°> to be comparable to the mass-energy density T 00 . 

Calculate the weak gravitational field, 


h/Av — hfiv J* 


guv — V' + 

r4TJf - |x - *1,*') 
J lx - x'\ 


(19.1) 

(19.2) 


produced by such a system [see “baired” version of equation (18.14)]. Restrict 
attention to the spacetime region far outside the system, and expand h^ in powers 
of Jt'/r = jc'/|jc|, using the relations 


T uv (t -\x- x'\ 


\x — x'\ r + r 2 r + 2 r 3 


r,Aol(r- Ix-x 1 ]f, 

(19.3a) 


(19.3b) 

’-r' 2 S jk ) 

2 ' 

(19.3c) 
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Perform the calculation in the system’s rest frame, where 

pi = J T °i d 3 x = 0, 

with origin of coordinates at the system’s center of mass 

f x’T°°d 3 x = 0. 

The result, after a change of gauge to simplify h 00 and h 0j , is 

+ <>(£)]* 2 - + °( Jf )]<* 


(19.4a) 


(19.4b) 


( j 2M\ ^ / gravitational 

r / \that die out 


radiation terms 
as 0(1 /r) 


^ dx’dx k . 


(19.5) 


(see exercise 19.1 for derivation.) Here M and S k are the body’s mass and intrinsic 
angular momentum. 

M = J r°°d 3 x, (19.6a) 

„x l T m0 d 3 x. (19.6b) 


W 1 




The corresponding Newtonian potential is 

* = - j (goo - Voo) = - y + 0 (4f)- (19.6c) 

Conclusion: With an appropriate choice of gauge, <P and g 00 far from any weak 
source are time-independent and are determined uniquely by the source’s mass M; 
g 0j is time-independent and is fixed by the source’s intrinsic angular momentum 
S>; but g jk has time-dependent terms (gravitational waves!) of 0(l/r). 

The rest of this chapter focuses on the “imprints” of the mass and angular mo¬ 
mentum in the gravitational field; the gravitational waves will be ignored, or almost 
so, until Chapter 35. 


Exercise 19.1. DERIVATION OF METRIC FAR OUTSIDE A WEAKLY 
GRAVITATING BODY 

(a) Derive equation (19.5). [Hints: (1) Follow the procedure outlined in the text. (2) When 
calculating h 00 , write out explicitly the n = 0 and n = 1 terms of (19.2), to precision 0(l/r 2 ), 
and simplify the n — 0 term using the identities 

T’ k = y (7' 00 .x*.x*) 00 + (T l >x k + T ,k x>), - j(T lmxi x k ), tm , (19.7a) 

T ll x m = (r 0l x l x m - j T° m r 2 } + ^T lk x k x m - 1 T lm r-\ . (19.7b) 


(2) result 


How metric depends on 
system's mass M and 
angular momentum S 


EXERCISE 
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(Verify that these identities follow from T a/3 ^ = 0.) (3) When calculating h 0m , write out 
explicitly the n = 0 term of (19.2), to precision 0(l/r 2 ), and simplify it using the identity 

p>k x i + p>i x k _ (T°°x>x k ) 0 + (T° l x>x k )j. (19.7c) 

(Verify that this follows from T tt/3 ^ = 0.) (4) Simplify h ou and h 0m by the gauge transforma¬ 
tion generated by 


Co = ~ f T°°'r' 2 d 3 x' +^f ^T ok ‘x k 'x>' - ±T*r ,a ) d 3 x' 

j d 3 x' 


c r x j x j 

+ J (V + hi) p2 




r xix? . (3x J 'x r - r ,2 8 jk )x i x h 
~ ' 2 r 4 

(r - |jc - x'\) n 


\x — x'\ 


d 3 x\ 


= - 73 -/ *»*'*“’+ 4 2 r ° 

ti — 1 


/ (r - lx - xp" 
71 |x — x'| 


</ 3 x' 


+ ^Co - 7(7) / 7-ooV' 2 * 3 *' - (jr) m f {t** t * v ~ 7 r ° fc> ' 2 )***' 
V 1 en " 2 f(7 * . j „r (r-lx-x1)» 1 

(00 +r “ ) L Ix-x'l J. 


Here TJ denotes T^{t - r, x').] 

(b) Prove that the system’s mass and angular momentum are conserved. [Note: Because 
T a & 0 = 0 (self-gravity has negligible influence), the proof is no different here than in flat 
spacetime (Chapter 5).] 


§19.2. MEASUREMENT OF THE MASS AND 
ANGULAR MOMENTUM 


For a weakly gravitating 
system: 


(1) total mass Mean be 
measured by applying 
Kepler's "1 -2-3" law to 
orbiting particles 


The values of a system’s mass and angular momentum can be measured by probing 
the imprint they leave in its external gravitational field. Of all tools one might use 
to probe, the simplest is a test particle in a gravitationally bound orbit. If the particle 
is sufficiently far from the source, its motion is affected hardly at all by the source’s 
angular momentum or by the gravitational waves; only the spherical, Newtonian 
part of the gravitational field has a significant influence. Hence, the particle moves 
in an elliptical Keplerian orbit. To determine the source’s mass M, one need only 
apply Kepler’s third law (perhaps better called “Kepler’s 1-2-3 law”): 


M = 


_277_\ 2 /Semi-major axis\ 3 

orbital period / \of ellipse / 


i.e., M 1 = <o 2 a 3 . (19.8) 


The source’s angular momentum is not measured quite so easily. One must use 
a probe that is insensitive to Newtonian gravitational effects, but “feels” the off- 
diagonal term, 


g 0j = — 2e ikl S k x l /r z , 


( 19 . 9 ) 
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in the metric (19.5). One such probe is the precession of the perihelion of a corevolv¬ 
ing satellite, relative to the precession for a counterrevolving satellite. A gyroscope 
is another such probe. Place a gyroscope at rest in the source’s gravitational field. 
By a force applied to its center of mass, prevent it from falling. As time passes, 
the g 0j term in the metric will force the gyroscope to precess relative to the basis 
vectors d/dx*; and since these basis vectors are “tied” to the coordinate system, which 
in turn is tied to the Lorentz frames at infinity, which in turn are tied to the “fixed 
stars” (cf. §39.12), the precession is relative to the “fixed stars.” The angular velocity 
of precession, as derived in exercise 19.2, is 


(2) total angular momentum 
S can be measured by 
examining the precession of 
gyroscopes 



-5 + 


3(5 * x)x~ 

7 s / 


(19.10) 


One sometimes says that the source’s rotation “drags the inertial frames near the 
source,” thereby forcing the gyroscope to precess. For further discussion, see §§21.12, 
40.7, and 33.4. 


Exercise 19.2. GYROSCOPE PRECESSION EXERCISE 

Derive equation (19.10) for the angular velocity of gyroscope precession. [Hints: Place an 
orthonormal tetrad at the gyroscope’s center of mass. Tie the tetrad rigidly to the coordinate 
system, and hence to the “fixed stars”; more particularly, choose the tetrad to be that basis 
i e a) which is dual to the following 1-form basis: 

cu i = [1 - (2M/r)] 1/2 dt + 2€j kt S k (x l /r 3 ) dx\ »*=[! + (2 M/r)] 1/2 dxK (19.11) 

The spatial legs of the tetrad, «j, rotate relative to the gyroscope with an angular velocity 
a given by [see equation (13.69)] 

= r V,b- 

Consequently, the gyroscope’s angular momentum vector L precesses relative to the tetrad 
with angular velocity 12 = — <o: 

~r = * m a'LK = r m . (19.12) 

Calculate f for the given orthonormal frame, and thereby obtain equation (19.10) for 12.] 


§19.3. MASS AND ANGULAR MOMENTUM OF FULLY 
RELATIVISTIC SOURCES 

Abandon, now, the restriction to weakly gravitating sources. Consider an isolated, 
gravitating system inside which spacetime may or may not be highly curved—a black 
hole, a neutron star, the Sun,... But refuse, for now, to analyze the system’s interior 
or the “strong-field region” near the system. Instead, restrict attention to the weak 
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gravitational field far from the source, and analyze it using linearized theory in 
vacuum. Expand A in multipole moments and powers ofl/r; and adjust the gauge, 
the Lorentz frame, and the origin of coordinates to simplify the resulting metric. 
The outcome of such a calculation is a gravitational field identical to that for a weak 
source [equation (19.5)]! (Details of the calculation are not spelled out here because 
of their length; but see exercise 19.3.) 

But before accepting this as the distant field of an arbitrary source, one should 
examine the nonlinear effects in the vacuum field equations. Two types of nonlinear¬ 
ities turn out to be important far from the source: (1) nonlinearities in the static, 
Newtonian part of the metric, which generate metric corrections 

8g 00 = — 2M 2 /r 2 , 8g jk = | (M 2 /r 2 ) 8 )k , 


Metric far from any 
gravitating system, as a 
power series in 1/r 


Failure of volume integrals 
for M and S when source 
has strong internal gravity 


(see exercise 19.3 and §39.8), thereby putting the metric into the form 

** = - [i - v + + 0 (pr)] <* 2 - [>. s * $ + 0 (pr)] * 


[(' 


+ IU + 2* + ^ 


. (19.13) 


(2) a gradual decrease in the source’s mass, gradual changes in its angular momen¬ 
tum, and gradual changes in its “rest frame” to compensate for the mass, angular 
momentum, and linear momentum carried off by gravitational waves (see Box 19.1, 
which is best read only after finishing this section). 

By measuring the distant spacetime geometry (19.13) of a given source, one cannot 
discover whether that source has strong internal gravity, or weak. But when one 
expresses the constants M and Sj, which determine g 00 and g ojy as integrals over 
the interior of the source, one discovers a crucial difference: if the internal gravity 
is weak, then linearized theory is valid throughout the source, and 

M = f T m d 3 x, Sj = f'i jkl x k T l0 d 3 x; (19.14) 

but if the gravity is strong, these formulas fail. Does this failure prevent one, for 
strong gravity, from identifying the constants M and S i of the metric (19.13) as 
the source’s mass and angular momentum? Not at all, according to the following 
argument. 

Consider, first, the mass of the sun. For the sun one expects Newtonian theory 
to be highly accurate (fractional errors ~~ M Q /R Q ~~ 10 -6 ); so one can assert that 
the constant M appearing in the line element (19.13) is, indeed 

M = j p d z x = J Too d z x = total mass. 

But might this assertion be wrong? To gain greater confidence and insight, adopt 
the viewpoint of “controlled ignorance i.e., do not pretend to know more than what 
is needed. (This style of physical argument goes back to Newton’s famous “Hypoth¬ 
eses non fingo,” i.e. “I do not feign hypotheses.”) In evaluating the volume integral 
of Tqq (usual Newtonian definition of M ), one needs a theory of the internal structure 
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of the sun. For example, one must know that the visible surface layers of the sun 
do not hide a massive central core, so dense and large that relativistic gravitational 
fields \0\ ~~ 1 exist there. If one makes use in the analysis of a fluid-type stress-energy 
tensor T fiy , one needs to know equations of state, opacities, and theories of energy 
generation and transport. One needs to justify the fluid description as an adequate 
approximation to the atomic constitution of matter. One needs to assume that an 
ultimate theory of matter explaining the rest masses of protons and electrons will 
not assign an important fraction of this mass to strong (nonlinear) gravitational fields 
on a submicroscopic scale. It is plausible that one could do all this, but it is also 
obvious that this is not the way the mass of the sun is, in fact, determined by 
astronomers! Theories of stellar structure are adjusted to give the observed mass; 
they are not constructed to let one deduce the mass from nongravitational observa¬ 
tions. The mass of the sun is measured in practice by studying the orbits of planets 
in its external gravitational field, a procedure equivalent to reading the mass M off 
the line element (19.13), rather than evaluating the volume integral fT 00 d 3 x. 

To avoid all the above uncertainties, and to make theory correspond as closely 
as possible to experiment, one defines the “total mass-energy 99 M of the sun or any 
other body to be the constant that appears in the line element (19.13) for its distant 
external spacetime geometry. Similarly , one defines the body's intrinsic angular mo¬ 
mentum as the constant 3-vector S appearing in its line element (19.13). Operationally, 
the total mass-energy Mis measured via Kepler’s third law; the angular momentum 
S is measured via its influence on the precession of a gyroscope or a planetary orbit. 
This is as true when the body is a black hole or a neutron star as when it is the 
sun. 

What kind of a geometric object is the intrinsic angular momentum 5? It is defined 
by measurements made far from the source, where, with receding distance, spacetime 
is becoming flatter and flatter (asymptotically flat). Thus, it can be regarded as a 
3-vector in the “asymptotically flat spacetime ” that surrounds the source. But in what 
Lorentz frame is 5 a 3-vector? Clearly, in the asymptotic Lorentz frame where the 
line element (19.13) is valid; i.e., in the asymptotic Lorentz frame where the source’s 
distant “coulomb” (“M/r”) field is static; i.e., in the “asymptotic rest frame ” of the 
source. Alternatively, one can regard S as a 4-vector, S, which is purely spatial 
(S° = 0) in the asymptotic rest frame. If one denotes the 4-velocity of the asymptotic 
rest frame by U, then the fact that S is purely spatial can be restated geometrically 
as S* U — 0, or 

SP = 0, (19.15) 

where 

P = MU = “total 4-momentum of source” (19.16) 

is still another vector residing in the asymptotically flat region of spacetime. 

The total 4-momentum P and intrinsic angular momentum S satisfy conservation 
laws that are summarized in Box 19.1. These conservation laws are valuable tools 
in gravitation theory and relativistic astrophysics, but the derivation of these laws 
(Chapter 20) does not compare in priority to topics such as neutron stars and basic 
cosmology; so most readers will wish to skip it on a first reading of this book. 

(continued on page 456 ) 


Definition of "total 
mass-energy" M and 
"angular momentum" S in 
terms of external gravitational 
field 


5 as a geometric object in an 
asymptotically flat region far 
outside source 


"Asymptotic rest frame" and 
"total 4-momentum" 


Conservation laws for total 
4-momentum and angular 
momentum 



Box 19.1 TOTAL MASS-ENERGY, 4-MOMENTUM, AND ANGULAR 
MOMENTUM OF AN ISOLATED SYSTEM 

A. Spacetime is divided into (1) the source’s inte¬ 
rior; which is surrounded by (2) a strong-field } 
vacuum region; which in turn is surrounded by I 
(3) a weak-field, asymptotically flat near-zone re- j 
gion; which in turn is surrounded by (4) a weak- { 
field, asymptotically flat, radiation-zone region. | 

This box and this chapter treat only the asympto- } 
tically flat regions. The interior and strong-field j 

regions are treated in the next chapter. j 

V 


B. The asymptotic rest frame of the source is that 
global, asymptotically Lorentz frame (coordinates 
r, x, y 9 z) in which the distant, “coulomb” part of 
the source’s field is at rest (see diagram). The 
asymptotic rest frame does not extend into the 
strong-field region; any such extension of it would 
necessarily be forced by the curvature into a highly 
non-Lorentz, curvilinear form. The spatial origin 
of the asymptotic rest frame is so adjusted that 
the source is centered on it—i.e., that the distant 
Newtonian potential is <P = — M/(x 2 + y 2 + 
z 2 ) 1/2 + 0(1/r 3 ); i.e., that <P has no dipole term, 
D • x/r 3 , such as would originate from an offset 
of the coordinates. 

C. To the source one can attribute a total mass-energy M, a 4-velocity U , a total 
4-momentum P, and an intrinsic angular momentum vector, S. The 4-vectors U 9 P, 
and S reside in the asymptotically flat region of spacetime and can be moved about 
freely there (negligible curvature => parallel transport around closed curves does not 
change U , P, or S). The source’s 4-velocity U is defined to equal the 4-velocity of 
the asymptotic rest frame (f/° = 1, t/ = 0 in rest frame). The total mass-energy M 
is measured via Kepler’s third (“1-2-3”) law [equation (19.8)]. The total 4-momentum 
is defined by P = MU. The intrinsic angular momentum S is orthogonal to the 
4-velocity U 9 S • U = 0 (so S° = 0; S ^ 0 in general in asymptotic rest frame); S 
is measured via gyroscope precession or differential perihelion precession (§19.2). 

In the asymptotic rest frame, with an appropriate choice of gauge (i.e., of ripples 
in the coordinates), the slight deviations from flat-spacetime geometry are described 
by the line element 



t 



Interior 









ds 2 


_ _ [i _ 2A 

r 


2M + 2Mi + 0 
r 2 

3 M 2 


(^)j dt 2 “ [Sti 5 * ff + 0 (jz)] dt dxi 


+ ^1 + + (gravitational radiation terms) dx i dx k . 


( 1 ) 


D. Conservation of 4-momentum and angular mo¬ 
mentum: Suppose that particles fall into a source 
or are ejected from it; suppose that electromag¬ 
netic waves flow in and out; suppose the source 
emits gravitational waves. All such processes break 
the source’s isolation and can change its total 4- 
momentum P, its intrinsic angular momentum S, 
and its asymptotic rest frame. Surround the source 
with a spherical shell 5, which is far enough out 
to be in the asymptotically flat region. Keep this 
shell always at rest in the source’s momentary 
asymptotic rest frame. By probing the source’s 
gravitational field near 5, measure its 4-momen- 
tum P and intrinsic angular momentum S as func¬ 
tions of the shell’s proper time r. An analysis given 
in the next chapter reveals that the 4-momentum 
is conserved, in the sense that 



Interstellar debris falls into a black hole, 
and gravitational waves emerge. 


dP a 

dr 



d(area) = 


rate at which 4-momentum 
flows inward through shell, 


( 2 ) 


where n is the unit outward normal to 5 and the integral is performed in the shell’s 
momentary rest frame. In words: the rate at which 4-momentum flows through the 
shell, cis measured in the standard special relativistic manner , equals the rate of change 
of the source's gravitationally measured 4-momentum. Similarly , the angular momen¬ 
tum is conserved in the sense that 


dS j rate at which angular \ 

—- = — I (c i j k x i T kl )n 1 d( area) = momentum flows inward I , (3a) 

5 \ through the shell j 


dS 0 

dr 


dU a ( change required to keep S orthogonal to U, \ 
dr a VTermi-Walker-transport law”: cf. §§6.5, 13.6/ 


(3b) 


In these conservation laws T Q P is the total stress-energy tensor at the shell, including 
contributions from matter, electromagnetic fields, and gravitational waves. The 
gravitational-wave contribution, called is treated in Chapter 35. 

Note: The conservation laws in the form stated above contain fractional errors 
of order M/r (contributions from “gravitational potential energy” of infalling mate¬ 
rial), but such errors go to zero in the limit of a very large shell (r —*- oo). 

Note: The formulation of these conservation laws given in the next chapter is 
more precise and more rigorous, but less physically enlightening than the one here. 
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EXERCISE 


Exercise 19.3. GRAVITATIONAL FIELD FAR FROM A STATIONARY, 

FULLY RELATIVISTIC SOURCE 

Derive the line element (19.13) for the special case of a source that is time-independent 
(gitvj = 0). This can be a difficult problem, if one does not proceed carefully along the 
following outlined route. (1) Initially ignore all nonlinearities in the Einstein field equations. 
The field is weak far from the source. These nonlinearities will be absent from the dominant 
terms. (2) Calculate the dominant terms using linearized theory in the Lorentz gauge [equa¬ 
tions (18.8)]. (3) In particular, write the general solution to the vacuum, time-independent 
wave equation (18.8b) in the following form involving n* = x*/r ^ (unit vector in radial 
direction): 



A jk = A^ k) , B* kl = B^ k)l . 


(Round brackets denote symmetrization.) (4) Then impose the Lorentz gauge conditions 
V\/8 = 0 on this general solution, thereby learning 


A* ~ 0, A* = 0, 

B’ k (8 jk - 3n>n k ) = 0, (19.18) 

B’ kl (8 ki — 3 n k n L ) = 0. 

(5) Write B’ k as the sum of its trace 3 B 9 its traceless symmetric part S ik , and its traceless 
antisymmetric part (these are its “irreducible parts”): 

B’ k = B8> k + S’ k + V kl F\ S” = 0. (19.19) 


Show that any tensor B* k can be put into such a form. Then show that the gauge conditions 
(19.18) imply S jk = 0. (6) Similarly show that any tensor B jkl that is symmetric on its first 
two indices can be put into the form 

B’ kl = 8’ k A l + C { ’8 k)l -f ^ m ^E k)m _|_ $jki 

E km symmetric and traceless, i.e,, E km = E ikm \ E kk = 0, (19.20) 

S ikl symmetric and traceless, i.e., S ikl = S iikl \ 

S m = S ikk = S iki = 0. 

Then show that the gauge conditions (19.18) imply O — —2A j and E km = S jkl = 0. 
(7) Combining all these results, conclude that 


r _ A 0 , Bin* , n / 1 \ 

h oo- — + —pr + 0 (. 73 )’ 


€ jkl n kpl g n j 


K - -p. + 

r _ S’ k A l n l - / 

fl — 


Bn’ „ / 1 \ 

r 2 0 (7^)’ 


(19.21) 


- _ S’ k A l n l — A’n k — A k n> , „ ( 1 \ 

ik ~ P + ° \?J- 
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Then use gauge transformations, which stay within Lorentz gauge, to eliminate B and A’ 
from h oj and h jk ; so 



(B> + A*)n> 
r 2 



S = £ ^ + o(i). C9-22) 


(8) Translate the origin of coordinates sox^ new = x’ old — (B j + A’)/A°; in the new coordinate 
system h ajQ has the same form as (19.22), but with B j + A j removed. From the resultant 
h a p, construct the metric and redefine the constants A 0 and F l to agree with equation (19.13). 

(9) All linear terms in the metric are now accounted for. The dominant nonlinear terms 
must be proportional to the square, (M/r) 2 , of the dominant linear term. The easiest way 
to get the proportionality constant is to take the Schwarzschild geometry for a fully relativistic, 
static, spherical source [equation (31.1)], by a change of coordinates put it in the form 


ds2=z ~ (' ! + m % )' d ’ 2 + ( 1 + ! r ) 4 (dx 2 + dy 2 + dz2) (19 - 23) 


(exercise 25.8), and expand it in powers of M/r. 


§19.4. MASS AND ANGULAR MOMENTUM OF 
A CLOSED UNIVERSE 

“There are no snakes in Ireland. " 

Statement of St. Patrick 
after driving the snakes 
out of Ireland (legend*) 


There is no such thing as “the energy (or angular momentum, or charge) of a closed 
universe,” according to general relativity, and this for a simple reason. To weigh 
something one needs a platform on which to stand to do the weighing. 

To weigh the sun, one measures the periods and semimajor axes of planetary orbits, 
and applies Kepler’s “1-2-3” law, M ~ cj 2 a 3 . To measure the angular momentum, 
S , of the sun (a task for space technology in the 1970’s or 1980’sl), one measures 
the precession of a gyroscope in a near orbit about the sun, or one examines some 
other aspect of the “dragging of inertial frames.” To determine the electric charge 


For a closed universe the 
total mass-energy M and 
angular momentum S are 
undefined and undefinable 


* Stokes (1887) and other standard references deny this legend. In part I of Stokes the basic manuscript 
references are listed, including especially codex manuscript Rawlinson B.512 in 154 folios, in double 
columns, written by various hands in the fourteenth and fifteenth centuries (cf Catalogi codicum 
manuscriptorum Bibliothecae Bodleianae Partis Quintae Fasciculus Primus ; Oxford, 1862, col. 728-732). 
In this manuscript, folio 97b. 1. line 14, reads in the translation of Stokes. Part 1. p. xxx: “as Paradise 
is without beasts, without a snake, without a lion, without a dragon, without a scorpion, without a mouse, 
without a frog, so is Ireland in the same manner without any harmful animal, save only the wolf.. .” 
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of a body, one surrounds it by a large sphere, evaluates the electric field normal 
to the surface at each point on this sphere, integrates over the sphere, and applies 
the theorem of Gauss. But within any closed model universe with the topology of 
a 3-sphere, a Gaussian 2-sphere that is expanded widely enough from one point 
finds itself collapsing to nothingness at the antipodal point. Also collapsed to 
nothingness is the attempt to acquire useful information about the “charge of the 
universe”: the charge is trivially zero. By the same token, every “surface integral” 
(see details in Chapter 20) to determine mass-energy or angular momentum collapses 
to nothingness. To make the same point in another way: around a closed universe 
there is no place to put a test object or gyroscope into Keplerian orbit to determine 
either any so-called “total mass” or “rest frame” or “4-momentum” or “angular 
momentum” of the system. These terms are undefined and undefinable. Words, yes; 
meaning, no. 

Not having a defined 4-momentum for a closed universe may seem at first sight 
disturbing; but it would be far more disturbing to be given four numbers and to 
be told authoritatively that they represent the components of some purported 
“total energy-momentum 4-vector of the universe.” Components with respect to what 
local Lorentz frame? At what point? And what about the change in this vector on 
parallel transport around a closed path leading back to that strangely preferred 
point? It is a happy salvation from these embarrassments that the issue does not 
and cannot arise! 

Imagine a fantastically precise measurement of the energy of a y-ray. The experi¬ 
menter wishes to know how much this y-ray contributes to the total mass-energy 
of the universe. Having measured its energy in the laboratory, he then corrects it 
for the negative gravitational energy by which it is bound to the Earth. The result, 


Asymptotic flatness as the 
key to the definability of M 
and S 


^corrected hv{\ 

is the energy the photon will have after it climbs out of the Earth’s gravitational 
field. But this is only the first in a long chain of corrections for energy losses (redshifts) 
as the photon climbs out of the gravitational fields of the solar system, the galaxy, 
the local cluster of galaxies, the supercluster, and then what? These corrections show 
no sign of converging, unless to £ corrected = 0. 

Quite in contrast to the charge-energy-angular-momentum facelessness of a closed 
universe are the attractive possibilities of defining and measuring all three quantities 
in any space that is asymptotically flat. One does not have to revolutionize present- 
day views of cosmology to talk of asymptotically flat space. It is enough to note 
how small is the departure from flatness, as measured by the departure of ( —goo) 1/2 
from unity, in cases of astronomical or astrophysical interest (Box 19.2). Surrounding 
a region where any dynamics, however complicated, is going on, whenever the 
geometry is asymptotically flat to some specified degree of precision, then to that 
degree of precision it makes sense to speak of the total energy-momentum 4-vector 
of the dynamic region, P, and its total intrinsic angular momentum, S. Parallel 
transport of either around any closed curve in the flat region brings it back to its 
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( ->1 

J Box 19.2 METRIC CORRECTION TERM NEAR SELECTED HEAVENLY BODIES 



m 

m 

r 

™ = 1 - 
r 

- i-9 oo) 1/2 

At shoulder of Venus 
de Milo 

2 X 10 5 g = 

E5 X 10-23 cm 

30 cm 

5 X 

10- 25 

At surface of Earth 

6 X 10 27 g = 

4 x 10 _1 cm 

6.4 x 10 8 cm 

6 X 

10 -10 

At Earth’s distance 
from sun 

2 X 10^3 g = 

1.5 X 10 3 cm 

1.5 x 10 13 cm 

1 X 

10" 8 

At sun’s distance from 
center of galaxy 

2 x 10 -“ g = 

1.5 X 10 16 cm 

2.5 x 10 22 cm 

6 X 

10- 7 

At distance of galaxy 
from center of Virgo 

6 X 10 47 g = 

4 X 10 19 cm 

3 X 10 23 cm 

1 X 

10“ 6 


_ _ I 


starting point unchanged. Moreover, it makes no difference how enormous are the 
departures from flatness in the dynamic region (black holes, collapsing stars, intense 
gravitational waves, etc.); far away the curvature will be weak, and the 4-momentum 
and angular momentum will reveal themselves by their imprints on the spacetime 
geometry. 


CHAPTER 


CONSERVATION LAWS FOR 
4-MOMENTUM AND 
ANGULAR MOMENTUM 


We denote as energy of a materia / system in a certain state 
the contribution of all effects (measured in mechanical units of 
work) produced outside the system when it passes in an arbitrary 
manner from its stete to e reference state which has been 

defined ad hoc. 

WILLIAM THOMPSON {later Lord Kelvin), 
as quoted by Max von Laue in Schilpp (1949), p. 514. 


All forms of energy possess inertia. 

ALBERT EINSTEIN, conclusion 
from his paper of September 26, 1905, 
as summarized by von Laue in Schilpp (1949), p. 523. 


V 


§20.1. OVERVIEW 


Chapter 5 (stress-energy 
tensor) is needed as 
preparation for this chapter, 
which in turn is needed as 
preparation for the Track-2 
portion of Chapter 36 
(generation of gravitational 
waves) and will be useful in 
understanding Chapter 35 
(propagation of gravitational 
waves). 

_ / 


Chapter 19 expounded the key features of total 4-momentum P and total angular 
momentum S for an arbitrary, gravitating system. But one crucial feature was left 
unproved: the conservation laws for P and S (Box 19.1). To prove those conservation 
laws is the chief purpose of this chapter. But other interesting, if less important, 
aspects of P and S will be encountered along the route to the proof—Gaussian flux 
integrals for 4-momentum and angular momentum; a stress-energy “pseudotensor” 
for the gravitational field, which is a tool in constructing volume integrals for P 
and S; and the nonlocalizability of the energy of the gravitational field. 
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§20.2. GAUSSIAN FLUX INTEGRALS FOR 

4-MOMENTUM AND ANGULAR MOMENTUM 


In electromagnetic theory one can determine the conserved total charge of a source 
by adding up the number of electric field lines emanating from it—i.e., by performing 
a Gaussian flux integral over a closed 2-surface surrounding it: 

° = < 20 » 

Similarly, in Newtonian theory one can determine the mass of a source by evaluating 
the Gaussian flux integral 

M = (20.2) 


These flux integrals work because the charge and mass of a source place indelible 
imprints on the electromagnetic and gravitational fields that envelop it. 

The external gravitational field (spacetime geometry) in general relativity possesses 
similar imprints, imprints not only of the source’s total mass-energy M, but also of 
its total 4-momentum P and its intrinsic angular momentum S (see Box 19.1). Hence, 
it is reasonable to search for Gaussian flux integrals that represent the 4-momentum 
and angular momentum of the source. 

To simplify the search, carry it out initially in linearized theory, and use Maxwell 
electrodynamics as a guide. In electrodynamics the Gaussian flux integral for charge 
follows from Maxwell’s equations F^ v y = AtiJ* 1 , plus the crucial fact that F ,fiV is 
antisymmetric, so that F = F°F: 


Q= f J° d 3 x = 4~ f F 0 ' , d 3 x = F f F ° 2 ,tPx = ‘f f 0 * d 2 Sf 

J J 4ft J 4ft J ' 


4fl- < 

[Gauss’s theorem 


477 « 

ni 


To find analogous flux integrals in linearized theory, rewrite the linearized field 
equations (18.7) in an analogous form involving an entity with analogous crucial 
symmetries. The entity needed turns out to be 

H* av £ = + t - h av i]^ - h^if v ). (20.3) 


As one readily verifies from this expression, it has the same symmetries as the 
Riemann tensor 


__ JjrvPfia __ jjlfiaMvP] 
ffulavp] _ 0 


(20.4) 


This entity, like h tiv , transforms as a tensor under the Lorentz transformations of 
linearized theory; but it is not gauge-invariant, so it is not a tensor in the general 
relativistic sense. 


Gaussian flux integrals for 
charge and Newtonian mass 


defined 
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Linearized field equations in 
terms of H mi ‘^ 


Gaussian flux integrals in 
linearized theory: (1) for 
4-momentum 


(2) for angular momentum 


Generalization of Gaussian 
flux integrals to full general 
relativity 


In terms of ///“»"£, the linearized field equations (18.7) take on the much simplified 
form 

2 = I677 (20.5) 

and from these, by antisymmetry of H fiav ^ in v and /?, follow the source conservation 
laws of linearized theory. 


7 >" — —- ///“*'/ 3 „ — 0 
■' I677 ’ 


which were discussed back in §18.3. The same antisymmetry as yields these equations 
of motion also produces a Gaussian flux integral for the source’s total 4-momentum: 

p ‘=f T,0 ‘ px =-£/ ** = jtf 

—[Gauss’s theorem] (20.6) 

Here the closed 2-surface of integration 5 must completely surround the source and 
must lie in a 3-surface of constant time x; 0 . The integral (20.6) for the source’s energy 
P°, which is used more frequently than the integrals for P\ reduces to an especially 
simple form in terms of gafi = Va(3 + h af3 : 

P ° = lhj(20-7) 

(see exercise 20.1). 

A calculation similar to (20.6), but more lengthy (exercise 20.2), yields a flux 
integral for total angular momentum about the origin of coordinates: 

J» = f (x M T y0 - xT* 0 ) d 3 x 

= TZ — f <x^H ya0 ’ - x'H>*<» a + H» 0v - d 2 S,. (20.8) 

1077 Jg ’ * 1 


To evaluate the flux integrals in (20.6) to (20.8) (by contrast with the volume 
integrals), one need utilize only the gravitational field far outside the source. Since 
that gravitational field has the same form in full general relativity for strong sources 
as in linearized theory for weak sources, the flux integrals can be used to calculate 
and J^ v for any isolated source whatsoever, weak or strong: 

in full general relativity 
theory, for any isolated 
source, when the closed 
surface of integration <3 
is in the asymptotically 
flat region surrounding (20.9) 
the source, and when 
asymptotically Minkows- 
kian coordinates are used. 


pll __ 

1 

I677 • 

f H^\ a d 2 S p 

p° = 

1 

1677 « 

( Sjk,k &kk,j) d Sj, 

5 

Tfiv — 

1 

£ {x^H va0 \ - x v H^ a 
J s 

- 

1677 - 

+ H^ v - d 2 S y 
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Knowing P* and J* 9 % one can calculate the source’s total mass-energy M and intrinsic 
angular momentum S** by the standard procedure of Box 5.6: 

M — 2 , (20.10) 

( vector by which the source’s asymptotic, \ 

“Af/r”, spherical field is displaced from | (20.11) 
being centered on the origin of coordinates/ 

Sp = - W + Y V P»)P°/M. (20.12) 

Note especially that the integrands of the flux integrals (20.9) are not gauge-invari¬ 
ant. In any local inertial frame at an event Ig^o) = v> W*o> = °J the y 
vanish, since 

e = h =0 =0- s —t] => — 0 

5imv,cl — n ixv,at 11 t a > Bfiv l \fiv * 11 

This is reasonable behavior; their Newtonian analog, the integrand <Pj = (gravita¬ 
tional acceleration) of the Newtonian flux integral (20.2), similarly vanishes in local 
inertial frames. 

Although the integrands of the flux integrals are not gauge-invariant, the total 
integrals P M (4-momentum) and J* v (angular momentum) most assuredly are! They 
have meaning and significance independent of any coordinate system and gauge. 
They are tensors in the asymptotically flat region surrounding the source. 

The spacetime must be asymptotically flat if there is to be any possibility of 
defining energy and angular momentum. Only then can linearized theory be applied; 
and only on the principle that linearized theory applies far away can one justify 
using the flux integrals (20.9) in the full nonlinear theory. Nobody can compel a 
physicist to move in close to define energy and angular momentum. He has no need 
to move in close; and he may have compelling motives not to: the internal structure 
of the sources may be inaccessible, incomprehensible, uninteresting, dangerous, 
expensively distant, or frightening. This requirement for far-away flatness is a 
remarkable feature of the flux integrals (20.9); it is also a decisive feature. Even 
the coordinates must be asymptotically Minkowskian; otherwise most formulas in 
this chapter fail or require modification. In particular, when evaluating the 4-momen- 
tum and angular momentum of a localized system , one must apply the flux integrals 
(20.9) only in asymptotically Minkowskian coordinates. If such coordinates do not exist 
(spacetime not flat at infinity ), one must completely abandon the flux integrals , and 
the quantities that rely on them for definition: the total mass , momentum , and angular 
momentum of the gravitating source. In this connection, recall the discussion of §19.4. 
It described, in physical terms, why “total mass-energy” is a limited concept, useful 
only when one adopts a limited viewpoint that ignores cosmology. (Compare “light 
ray” or “particle,” concepts of enormous value, but concepts that break down when 
wave optics or wave mechanics enter significantly.) 

Summary: Attempts to use formulas (20.9) in ways that lose sight of the Minkowski 
boundary conditions (and especially simply adopting them unmodified in curvilinear 
coordinates) easily and unavoidably produce nonsense. 


Total mass-energy, center of 
mass, and intrinsic angular 
momentum 


Gaussian flux integrals valid 
only in asymptotically flat 
region of spacetime and in 
asymptotically Minkowskian 
coordinates 
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EXERCISES 


The full Einstein field 
equations in terms of H 


Volume integrals for 
4-momentum and angular 
momentum in full general 
relativity 


Exercise 20.1. FLUX INTEGRAL FOR TOTAL MASS-ENERGY IN 
LINEARIZED THEORY 

Show that the flux integral (20.6) for P° reduces to (20.7). Then show that, when applied 
to a nearly Newtonian source [line element (18.15c)], it reduces further to the familiar 
Newtonian flux integral (20.2). 

Exercise 20.2. FLUX INTEGRAL FOR ANGULAR MOMENTUM IN 
LINEARIZED THEORY 

Derive the Gaussian flux integral (20.8) for J**. [Hint: use the field equations (20.5) to show 
16'Tx /i 7' , ’° = (x*H”** %a \ k - - //" 00 %; (20.13) 

and then use Gauss’s theorem to evaluate the volume integral of equation (20.8)]. 

Exercise 20.3. FLUX INTEGRALS FOR AN ARBITRARY STATIONARY SOURCE 

(a) Use the flux integrals (20.9) to calculate P* and J^ v for an arbitrary stationary source. 
For the asymptotically flat metric around the source, use (19.13), with the gravitational 
radiation terms set to zero. 

(b) Verify that the “auxiliary equations” (20.10) to (20.12) give the correct answer for this 
source’s total mass-energy M and intrinsic angular momentum S ti . 


§20.3. VOLUME INTEGRALS FOR 4-MOMENTUM 
AND ANGULAR MOMENTUM 

It is easy, in linearized theory, to convert the surface integrals for P M and J^ into 
volume integrals over the source; one can simply trace backward the steps that led 
to the surface integrals in the first place [equation (20.6); exercise 20.2]. How, in 
full general relativity, can one similarly convert from the surface integrals to volume 
integrals? The answer is rather easy, if one thinks in the right direction. One need 
only put the full Einstein field equations into the form 

= 1677 rjg (20.14) 

analogous to equations (20.5) of linearized theory. Here H^P is to be defined in 

terms of — %\ fJLV by equation (20.3), even deep inside the source where 

might be >1. This form of the Einstein equations then permits a conversion of the 
Gaussian flux integrals into volume integrals, just as in linearized theory: 

P M = 77— f H^ d 2 Sj = “ 4 “ f H*** 0 * . d 3 x = — f aB d 3 x 
I077V * I677V » J 1677*^ * H 

= f T$ t d 3 x. (20.15) 

Similarly, 

juv _ f ( x »rj f - x'Ttft) d 3 x. (20.16) 
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[Crucial to the conversion is the use of partial derivatives rather than covariant 
derivatives in equations (20.14).] In these volume integrals, as throughout the pre- 
ceeding discussion, the coordinates must become asymptotically Lorentz (g —> t 
far from the source. 

The form of can be calculated by recalling that H* 1 *^ aP is a linearized 
approximation to the Einstein curvature tensor (20.5). Define the nonlinear correc¬ 
tions by 


16t^ = - 2G (20.17) 

(To calculate them in terms of g^ v or h^ v — — tj mj , is straightforward but lengthy. 

The precise form of these corrections will never be needed in this book.) Then 
Einstein’s equations read 

- 16 m* 9 + 2 G» v = 1677(1^ -f 7^), 

so that 


T% - T» v + (20.18) 

The quantity t* v is sometimes called a “stress-energy pseudotensor for the gravita¬ 
tional field.” The Einstein field equations (20.14) imply, because A p is anti¬ 
symmetric in v and /?, that 


= (TV + t^\ 9 = 0. (20.19) 

These equations are equivalent to — 0, but they are written with partial 
derivatives rather than covariant derivatives—a fact that permits conversions back 
and forth between volume integrals and surface integrals. 

All the quantities H /xav P, and t tiV depend for their definition and existence 
on the choice of coordinates; they have no existence independent of coordinates; 
they are not components of tensors or of any other geometric object. Correspond¬ 
ingly, the equations (20.14) to (20.19) involving T%£ { and t^ v have no geometric, 
coordinate-free significance; they are not “covariant tensor equations.” There is, 
nevertheless, adequate invariance under general coordinate transformations to give 
the values and J^ v of the volume integrals (20.15) and (20.16) geometric, coor¬ 
dinate-free significance in the asymptotically flat region far outside the source. 
Although this invariance is hard to see in the volume integrals themselves, it is clear 
from the surface-integral forms (20.9) that no coordinate transformation which 
changes the coordinates only inside some spatially bounded region can influence 
the values of the integrals. For coordinate changes in the distant, asymptotically 
flat regions, linearized theory guarantees that under Lorentz transformations the 
integrals fo rT 71 and will transform like special relativistic tensors, and that under 
infinitesimal coordinate transformations (gauge changes) they will be invariant. 

Because t^ v are not tensor components, they can vanish at a point in one coordinate 
system but not in another. The resultant ambiguity in defining a localized energy 
density t 00 for the gravitational field has a counterpart in ambiguities that exist in 


f("stress-energy 
pseudotensor") defined 


defined 


Conservation law for T^ { 


t» p . and T% t are 
coordinate-dependent objects 
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Other, equally good versions 

of W"""' 3 , t*’. TK.: 


the formal definition of /*\ It is clear that any quantities which agree with 

the original H in the asymptotic weak-field region will give the same values as 
does for the and surface integrals (20.9). One especially convenient 
choice has been given by Landau and Lifshitz (1962; §100), who define 


(1) Hgtf H!£{? = - B" , ’B'‘^, (20.20) 

where = (—g) 1/2 g' u '. Landau and Lifshitz show that Einstein’s equations can 
be written in the form 


H i a -l p ap = IM-SKT*' + (20.21) 

W ? l-l where the Landau-Lifshitz pseudotensor components 

<-*> = ^(s^xS x % - S a \xS^ + yrtxMS X ",pl*, 

- (rt^'.pSr.x + B",pB'‘ p ,x) + SxpS"'’S^S^.p 

+ - g**g*)&g n g' r - g„g, r ) S" t ,xS pct , m ) (20.22) 

are precisely quadratic in the first derivatives of the metric. (Einstein also gave a 
pseudotensor t^,, with this property, but it was not symmetric and so did not lead 
to an integral for «/*“'.) Because has the same symmetries as H mv $ and equals 
far from the source (exercise 20.4), and because the field equations (20.21) 
(3) r L-Leff in terms of Hg[f have the same form as in terms of H^, it follows that 


T’t-Letr = + /£-l) (20.23a) 

has all the properties of the introduced earlier in this section: 

= 0, (20.23b) 

P» = f 7'£° Leff d 3 x, (20.23c) 

= f (x»rg Lett - x'T(£ hett ) d*x. (20.23d) 


EXERCISE Exercise 20.4. FORM OF Mg*? FAR FROM SOURCE 

Show that the entities of equations (20.20) reduce to H**”* (20.3) in the weak-field 

region far outside the source. 


§20.4. WHY THE ENERGY OF THE GRAVITATIONAL FIELD 
CANNOT BE LOCALIZED 

Consider an element of 3-volume dS v and evaluate the contribution of the “gravita¬ 
tional field” in that element of 3-volume to the energy-momentum 4-vector, using 
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in the calculation either the pseudotensor t< tr or the pseudotensor t^ L discussed in 
the last section. Thereby obtain 


p = e / 1 V lv dS v 


or 


p = •J&dZ,. 

Right? No, the question is wrong. The motivation is wrong. The result is wrong. 
The idea is wrong. 

To ask for the amount of electromagnetic energy and momentum in an element 
of 3-volume makes sense. First, there is one and only one formula for this quantity. 
Second, and more important, this energy-momentum in principle “has weight.” It 
curves space. It serves as a source term on the righthand side of Einstein’s field 
equations. It produces a relative geodesic deviation of two nearby world lines that 
pass through the region of space in question. It is observable. Not one of these 
properties does “local gravitational energy-momentum” possess. There is no unique 
formula for it, but a multitude of quite distinct formulas. The two cited are only 
two among an infinity. Moreover, “local gravitational energy-momentum” has no 
weight. It does not curve space. It does not serve as a source term on the righthand 
side of Einstein’s field equations. It does not produce any relative geodesic deviation 
of two nearby world lines that pass through the region of space in question. It is 
not observable. 

Anybody who looks for a magic formula for “local gravitational energy-momen¬ 
tum” is looking for the right answer to the wrong question. Unhappily, enormous 
time and effort were devoted in the past to trying to “answer this question” before 
investigators realized the futility of the enterprise. Toward the end, above all mathe¬ 
matical arguments, one came to appreciate the quiet but rock-like strength of 
Einstein’s equivalence principle. One can always find in any given locality a frame 
of reference in which all local “gravitational fields” (all Christoffel symbols; all r* ) 
disappear. No T’s means no “gravitational field” and no local gravitational field 
means no “local gravitational energy-momentum.” 

Nobody can deny or wants to deny that gravitational forces make a contribution 
to the mass-energy of a gravitationally interacting system. The mass-energy of the 
Earth-moon system is less than the mass-energy that the system would have if the 
two objects were at infinite separation. The mass-energy of a neutron star is less 
than the mass-energy of the same number of baryons at infinite separation. Sur¬ 
rounding a region of empty space where there is a concentration of gravitational 
waves, there is a net attraction, betokening a positive net mass-energy in that region 
of space (see Chapter 35). At issue is not the existence of gravitational energy, but 
the localizability of gravitational energy. It is not localizable. The equivalence 
principle forbids. 

Look at an old-fashioned potato, replete with warts and bumps. With an orange 
marking pen, mark on it a “North Pole” and an “equator”. The length of the equator 
is very far from being equal to 2k times the distance from the North Pole to the 


Why one cannot define a 
localized energy-momentum 
for the gravitational field 
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equator. The explanation, “curvature,” is simple, just as the explanation, “gravita¬ 
tion”, for the deficit in mass of the earth-moon system (or deficit for the neutron 
star, or surplus for the region of space occupied by the gravitational waves) is simple. 
Yet it is not possible to ascribe the deficit in the length of the equator in the one 
case, or in mass in the other case, in any uniquely right way to different elements 
of the manifold (2-dimensional in the one case, 3-dimensional in the other). Look 
at a small region on the surface of the potato. The geometry there is locally flat. 
Look at any small region of space in any of the three gravitating systems. In an 
appropriate coordinate system it is free of gravitational field. The over-all effect one 
is looking at is a global effect, not a local effect. That is what the mathematics cries 
out. That is the lesson of the nonuniqueness of the t** v \ 


§20.5. CONSERVATION LAWS FOR TOTAL 4-MOMENTUM 
AND ANGULAR MOMENTUM 


Consider a system such as our galaxy or the solar system, which is made up of many 
gravitating bodies. Some of the bodies may be highly relativistic (black holes; neutron 
stars), while others are not. However, insist that in the regions between the bodies 
spacetime be nearly flat (gravity be weak)—so flat, in fact, that one can cover the 
entire system with coordinates which are (almost) globally inertial, except in a small 
neighborhood of each body where gravity may be strong. Such coordinates can exist 
only if the Newtonian gravitational potential, 0 ~ |(t) 00 — go o)> * n interbody 
region is small: 


interbody ~~ (Mass of system)/(radius of system) < 1. 


Total 4-momentum and 
angular momentum for a 
system of gravitating bodies 


The solar system certainly satisfies this condition (0 interbody — 10~ 7 ), as does the 
Galaxy (0 interbody - ICC 6 ), as do clusters of galaxies (0 interbody - ICC 6 ); but the 
universe as a whole does not (^ interbody —l)! 

In evaluating volume integrals for the system’s total 4-momentum, split its volume 
into a region containing each body (denoted ‘VI”) plus an interbody region; and 
neglect the pseudotensor contribution from the almost-flat interbody region: 


a stem = Zf T £d*X + f T$ t d*X 

A •'A ^interbody 

region 


= 2 p a + f Tli0 d 3 x. 

A ^interbody 

A region 


(20.24a) 


Because spacetime is asymptotically flat around each body, P/ is the 4-momentum 
of body A as measured gravitationally by an experimenter near it. The integral of 
I 7 * 0 over the interbody region is the contribution of any gas, particles, or magnetic 



§20.5. CONSERVATION LAWS FOR 4-MOMENTUM AND ANGULAR MOMENTUM 


469 


fields out there to the total 4-momentum. A similar breakup of the angular momen- 
turn reads 

Astern = 2 J !t + f - x"7>°) d 3 x. (20.24b) 

A ‘'interbody 

region 

In operational terms, these breakups show that the total 4-momentum and angular 
momentum of the system, as measured gravitationally by an experimenter outside it, 
are sums of P* and for each individual body, as measured gravitationally by an 
experimenter near it, plus contributions of the usual special-relativistic type from the 
interbody matter and fields. This is true even if some of the bodies are hurtling through 
the system with speeds near that of light; their gravitationally measured Pa and 
contribute, on an equal footing with anyone else’s, to the system’s total P* and 

/A"! 

Surround this asymptotically flat system by a two-dimensional surface S that is 
at rest in some asymptotic Lorentz frame. Then the 4-momentum and angular 
momentum inside S change at a rate (as measured in 5’s rest frame) given by 

^T = if T & d3x = f 1*0 d 3 x=-f T$ tf d 3 x 

= -f T ek d2 Sj, (20.25) 

and similarly 

djv- v r 

sr “ - t wm, - *' 7 %) <ps f ,20^6) 

o<> 

Although the pseudotensor t»\ in the interbody region and outside the system, 
contributes negligibly to the total 4-momentum and angular momentum (by as¬ 
sumption), its contribution via gravitational waves to the time derivatives dP»/dt 
and djw/dt can be important when added up over astronomical periods of time. 
Thus, one must not ignore it in the flux integrals (20.25), (20.26). 

In evaluating these flux integrals, it is especially convenient to use the Landau- 

Lifshitz form of Tfi t , since that form contains no second derivatives of the metric 
Thus set 

r eff = ( ~g)(T + /£! l ) ~ (Ta- + 

where /a, l are given by equations (20 .22). Only those portions ofthat die out 
as 1 /H or 1/r 3 at large r can contribute to the flux integrals (20 25) (20 26) For 
static solutions [ B#ir ~ const. + 0{\/r)\. tf^ dies out as 1/r*. Hence, the only contri¬ 
butions come from dynamic parts of the metric, which, at these large distances are 
entirely in the form of gravitational waves. The study of gravitational waves in 
Chapter 35 will reveal that when /£' L is averaged over several wavelengths it 
becomes a stress-energy tensor T (GW ^ for the waves, which has all the properties 
one ever requires of any stress-energy tensor. (For example, via Einstein’s equations 


Rates of change of total 
4-momentum and angular 
momentum: 


(1) expressed as flux integrals 
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G { b)a" = 877 7 < GW) /*«' j it contributes to the “background” curvature of the spacetime 
through which the waves propagate.) Moreover, averaging /j^ L over several wave¬ 
lengths before evaluating the flux integrals (20.25), (20.26) cannot affect the values 
of the integrals. Therefore, one can freely make in these integrals the replacement 

_ J'/J.v _|_ jiQW)nv 

thereby obtaining 

dP* 

~~~dT 

dJ* v r 

——— = j> lx»(r j + pew*’) - x'(Tri + 7 Hgw)w)] d 2 S . (20.28) 

These are tensor equations in the asymptotically flat spacetime surrounding the 
system. All reference to pseudotensors and other nontensorial entities has disap¬ 
peared. 

Equations (20.27) and (20.28) say that the rate of loss of 4-momentum and angular 
momentum from the system , as measured gravitationally, is precisely equal to the rate 
at which matter , fields , and gravitational waves carry off 4-momentum and angular 
momentum . 

This theorem is extremely useful in thought experiments where one imagines 
changing the 4-momentum or angular momentum of a highly relativistic body (e.g., 
a rotating neutron star) by throwing particles onto it from far away [see, e.g., Hartle 
(1970)]. — 


£ (7*+ F GW >*)d% (20.27) 


(2) expressed as flux integrals 
of t* v + r< Gw >^ 


EXERCISE Exercise 20.5. TOTAL MASS-ENERGY IN NEWTONIAN LIMIT 

(a) Calculate tff L for the nearly Newtonian metric 

ds 2 = -(1 + 20) dt 2 + (1 - 2d>) 8 jk dx* dx k 

(see §18.4). Assume the source is slowly changing, so that time derivatives of 0 can be 
neglected compared to space derivatives. [Answer: 

/?°, - -— $ <p . 

L L 8ir 

= o. 

‘tu = £(*,*» - j i* i>-l (20.29) 

(Note: as given here is the “stress tensor for a Newtonian gravitational field”; cf. exercises 

39.5 and 39.6.) 

(b) Let the source of the gravitational field be a perfect fluid with 


T“" = (p + p)u*u r + pg*', p/p ~ V 2 = (dx/dtf ~ |#|. 
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Let the Newtonian potential satisfy the source equation 


0,„ = 4wp. 

Show that the energy of the source is 


P° = f (T°°+ t°°)(-g)d 3 x 

= f [P /( 1 - y2 ) 1/2 + Y P* 2 + Y p^]fe,g ra g a j 1/2 dy dz 


Lorentz 

contraction 

factor 


kinetic 

energy 


+ higher-order corrections. 


potential 


J 

"proper " 

energy 


volume 


(20.30) 


(c) Show that the “equations of motion” T £ hett = 0 reduce to the standard equations 
(16.3) of Newtonian hydrodynamics. 


§20.6. EQUATIONS OF MOTION DERIVED 
FROM THE FIELD EQUATION 

Consider the Einstein field equation 


G = 8t tT (20.31) 

under conditions where space is empty of everything except a source-free electro¬ 
magnetic field: 

^ = ~ (20.32) 

( c f the expression for stress-energy tensor of the electromagnetic field in §5.6). To 
predict from (20.31) how the geometry changes with time, one has to know how 
the electromagnetic field changes with time. The field is expressed as the “exterior 
derivative” of the 4-potential, 

F = dA (language of forms) 


or 


„ 3 A v 3 A u , 

F v.v = g^T ~ (language of components), 


(20.33) 


and the time rate of change of the field is governed by the Maxwell equation 

d*F = 0 


or 


F*' = 0 . 


( 20 . 34 ) 
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Vacuum Maxwell equations 
derived from Einstein field 
equation 


If it seems a fair division of labor for the Maxwell equation to predict the develop¬ 
ment in time of the Maxwell field and the Einstein equation to do the same for 
the Einstein field, then it may come as a fresh surprise to discover that the Einstein 
equation (20.31), plus expression (20.32) for the Maxwell stress-energy, can do both 
jobs. One does not have to be given the Maxwell “equation of motion” (20.34). One 
can derive it fresh from (20.31) plus (20.32). The proof proceeds in five steps (see 
also exercise 3.18 and §5.10). Step one: The Bianchi identity V • G =0 implies 
conservation of energy-momentum V • T — 0. Step two: Conservation expresses 
itself in the language of components in the form 

0 = 8*7*'., = 2 + 2 F**g afi F* ; , 

~ S^ar,^ (20.35) 

Step three: Leaving the middle term unchanged, rearrange the first term so that, 
like the last term, it carries a factor F aT . Thus in that first term let the indices vfi 
of F p P be replaced in turn by or and by to , to subdivide that term into 

Ffia o P* r 1 Ffia Q F TU 
1 ;<j5ar A ' ;rSa<j* 

= - F\. t )F” 

= g^(F„ r;a + F„. r )F". (20.36) 

Step four: Combine the first and the last terms in (20.35) to give 

g^(F,. T . a + F ap , r + F„.,)F". (20.37) 

The indices on the derivatives of the field quantities stand in cyclic order. This 
circumstance annuls all the terms in the connection coefficients r a p y when one writes 
out the covariant derivatives explicitly. Thus one can replace the covariant derivatives 
by ordinary derivatives. Moreover, these three derivatives annul one another identi¬ 
cally when one substitutes for the fields their expressions (20.33) in terms of the 
potentials. Consequently, nothing remains in the conservation law (20.35) except 
the middle term, giving rise to four statements (ju, = 0, 1,2, 3) 

F^F^. V = 0 (20.38) 

about the four quantities (/? = 0, 1,2, 3) 

(20.39) 

Step five: The determinant of the coefficients in the four equations (20.38) for the 
four unknowns (20.39) has the value 

| / VW* 3 | 


r\f\F 3 2 F\ 


= — (E • B) 2 


(20.40) 
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(see exercise 20.6, part i). In the generic case, this one function of the four variables 
(/, x, y , z) vanishes on one or more hypersurfaces; but off any such hypersurface 
(i.e., at “normal points” in spacetime) it differs from zero. At all normal points, the 
solution of the four linear equations (20.38) with their nonvanishing determinant 
gives identically zero for the four unknowns (20.39); that is to say, Maxwell’s 
“equations of motion” 


F* 9 ., = 0 

are fulfilled and must be fulfilled as a straight consequence of Einstein’s field 
equation (20.31)—plus expression 20.32 for the stress-energy tensor. Special cases 
admit counterexamples (see exercise 20.8); but in the generic case one need not 
invoke Maxwell’s equations of motion; one can deduce them from the Einstein field 
equation. 

Turn from the dynamics of the Maxwell field itself to the dynamics of a charged 
particle moving under the influence of the Maxwell field. Make no more appeal 
to outside providence for the Lorentz equation of motion than for the Maxwell 
equation of motion. Instead, to generate the Lorentz equation call once more on 
the Einstein field equation or, more directly, on its consequence, the principle of 
the local conservation of energy-momentum. 

Keep track of the world line of the particle from t = (to t = t + At (Figure 20.1). 
Generate a “world tube” around this world line. Thus, at each value of the time 
coordinate /, take the location of the particle as center; construct a sphere of radius 
c around this center; and note how the successive spheres sweep out the desired 
world tube. Construct “caps” on this tube at times t and t + At. The two caps, 
together with the world tube proper, bound a region of spacetime in which energy 
and momentum can be neither created nor destroyed (“no creation of moment of 
rotation,” in the language of the Bianchi identities, Chapter 15). Therefore the 
energy-momentum emerging out of the “top” cap has to equal the energy-momen¬ 
tum entering the “bottom” cap, supplemented by the amount of energy-momentum 
carried in across the world-tube by the Maxwell field. Out of such an analysis, as 
performed in flat spacetime, one ends up with the Lorentz equation of motion in 
its elementary form (see Chapters 3 and 4), 



p + jp 


Figure 20.1. 

“World tube.” The change in the 4-momentum of the 
particle is governed by the flow of 4-momentum across 
the boundary of the world tube. 


Lorentz force equation 
derived from the Einstein 
field equation 
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A particle acted on by its 
own electromagnetic field 
("radiation damping") 


Infinite self-energy of a point 
particle 


dp/dj = e(F, u > (language of forms) 

or in curved spacetime, the Lorentz equation of motion in covariant form, 
V u p = m V u u = e(F, u > (form language) 


or 

m \^T + d ']‘ T ] = eF<X P^~ ( com P onent language). (20.41) 


“One ends up with the Lorentz equation of motion”—but only after hurdling 
problems of principle along the way. One would understand what a particle is if 
one understood how to do the calculation of balance of energy-momentum with 
all rigor! Few calculations in all of physics have been done in so many ways by 
so many leading investigators, from Lorentz and his predecessors to Dirac and 
Rohrlich [see Teitelboim (1970, 1971) for still further insights]. Among the issues 
that develop are two that never cease to compel attention. (1) The particle responds 
according to the Lorentz force law (20.41) to a field. This field is the sum of a 
contribution from external sources and from the particle itself. How is the field 
exerted by the particle on itself to be calculated? Insofar as it is not already included 
in its effects in the “experimental mass” m in (20.41), this force is to be calculated 
as half the difference between the retarded field and the advanced field caused by 
that particle (see §36.11 for a more detailed discussion of the corresponding point 
for an emitter of gravitational radiation). This difference is singularity-free. On the 
world line, it has the following simple value [valid in general for point particles; 
valid for finite-sized particles when and only when the particle changes its velocity 
negligibly compared to the speed of light during the light-travel time across itself— 
see, e.g., Burke (1970)] 


y(^ret - F ^Y‘ 


— h e ( d** 1 ^ 

~ 3 \ dr dr 3 


d^x^_dx^\ 
dr 3 dr / 


(20.42) 


Every acceptable line of reasoning has always led to expression (20.42). It also 
represents the field required to reproduce the long-known and thoroughly tested 
law of radiation damping. (2) “Infinite self-energy.” Around a particle at rest, or 
close to a particle in an arbitrary state of motion, the field is e/r 2 and the field energy 
is 

(1/8,7) (e/r 2 )H^r 2 dr = (e 2 /2- e' 1 ). (20.43) 

*"min 


This expression diverges as r min is allowed to go to zero. To hurdle this difficulty, 
one arranges the calculation of energy balance in such a way that there always 
appears the sum of this “self-energy” and the “bare mass.” The two terms individ¬ 
ually are envisaged as “going to infinity” as r min goes to zero; but the sum is identified 
with the “experimental mass” and is required to remain finite. Of course, no particle 
is a classical object. A proper calculation of the energy has to be conducted at the 
quantum level. There it is easier to hide from sight the separate infinities—but they 
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are still present, and promise to remain until the structure of a particle is understood. 

Before one turns from the Maxwell and Lorentz equations of motion to a final 
example (deriving the geodesic equations of motion for an uncharged particle), is 
it not time to object to the whole program of “deriving an equation of motion from 
Einstein’s field equation”? First, is it not a pretensious parade of pomposity to say 
it comes “from Einstein’s field equation” (and even more, “from Einstein’s field 
equations”) when it really comes from a principle so elementary and long established 
as the law of conservation of 4-momentum? It cannot be contested that this conser¬ 
vation principle, in historical fact, came before geometrodynamics, just as it came 
before electrodynamics and before the theories of all other established fields. How¬ 
ever, in no theory but Einstein’s is this principle incorporated as an identity. Only 
here does the conservation of energy-momentum appear as a fully automatic conse¬ 
quence of the inner working of the machinery of the world (energy density tied to 
moment of rotation, and moment of rotation automatically conserved; see Chapter 
17). Out of Einstein’s theory one can derive the equation of motion of a particle. 
Out of Maxwell’s one cannot. Thus, nothing prevents one from acting on a charge 
with an “external” force, over and above the Lorentz force, nor from tailoring this 
force in such a way that the charge follows some prescribed world line (“engine- 
driven source”). It makes no difficulties whatsoever for Maxwell’s equations that 
one has shifted attention from a world line that follows the Lorentz equation of 
motion to one that does not. Quite the contrary is true in general relativity. To shift 
from right world line (geodesic) to wrong world line makes the difference between 
satisfying Einstein’s field equation in the vicinity of that world line and being unable 
to satisfy Einstein’s field equation. 

The Maxwell field equations are so constructed that they automatically fulfill and 
demand the conservation of charge; but not everything has charge. The Einstein 
field equation is so constructed that it automatically fulfills and demands the conser¬ 
vation of momentum-energy; and everything does have energy. The Maxwell field 
equations are indifferent to the interposition of an “external” force, because that 
force in no way threatens the principle of conservation of charge. The Einstein field 
equation cares about every force, because every force is a medium for the exchange 
of energy. 

Electromagnetism has the motto, “I count all the electric charge that’s here.” All 
that bears no charge escapes its gaze. 

“I weigh all that’s here” is the motto of spacetime curvature. No physical entity 
escapes this surveillance. 

Why, then, is the derivation of the geodesic equation of motion of an object said 
to be based on “Einstein’s geometrodynamic field equation” rather than on “the 
principle of conservation of 4-momentum”? Because geometry responds by its 
curvature to mass-energy in every form. Most of all, because geometry outside tells 
about mass-energy inside, free of all concern about issues of internal structure 
(violent motions, unknown forces, tortuously curved and even multiply-connected 
geometry). 

If one objection to the plan to derive the equation of motion of a particle “from 
the field equation” has been disposed of, then the moment has come to deal with 


Why one is justified to regard 
equations of motion as 
consequences of the Einstein 
field equation 
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How one can avoid 
complexities of particle 
structure when deriving 
equations of motion: the 
"external viewpoint" 


Derivation of geodesic 
motion from Einstein field 
equation: 


(1) derivation in brief 


(2) derivation with care 


Coupling of curvature to 
particle moments produces 
deviations from geodesic 
motion 


the other natural objection: Is there not an inner contradiction in trying to apply 
to a “particle” (implying idealization to a point) a field equation that deals with 
the continuum? Answer: There is a contradiction in dealing with a point. Therefore 
do not deal with a point. Do not deal with internal structure at all. Analyze the 
motion by looking at the geometry outside the object. That geometry provides all 
the handle one needs to follow the motion. 

Already here one sees the difference from the derivation of the Lorentz equation 
of motion as sketched out above. There (1) no advantage was taken of geometry 
outside as indicator of motion inside; (2) a detailed bookkeeping was envisaged of 
the localization in space of the electromagnetic energy; and (3) this bookkeeping 
brought up the issue of the internal structure of the particle, which could not be 
satisfactorily resolved. 

Now begin the analysis in the new geometrodynamic spirit. Surrounding “the 
Schwarzschild zone of influence” of the object, mark out a “buffer zone” (Figure 
20.2) that extends out to the region where the “background geometry” begins to 
depart substantially from flatness. Idealize the geometry in the buffer zone as that 
of an unchanging source merging asymptotically (“boundary $ of buffer zone”) into 
flat space. It suffices to recall the properties of the spacetime geometry far outside 
an unchanging (i.e., nonradiating) source (exercise 19.3) to draw the key conclusion: 
relative to this flat spacetime and regardless of its internal structure, the object 
remains at rest, or continues to move in a straight line at uniform velocity (conserva¬ 
tion of total 4-momentum; §20.5). In other words, it obeys the geodesic equation 
of motion. If this is the result in a flash, then it is appropriate to go back a step 
to review it, to find out what it means and what it demands. 

When the object is absent and the background geometry alone has to be consid¬ 
ered, then the geodesic is a well-defined mathematical construct. Moreover, Fermi- 
Walker transport along this geodesic gives a well-defined way to construct a comoving 
local inertial frame (see §13.6). Relative to this frame, the representative point of 
the geodesic remains for all time at rest at the origin. 

In what way does the presence of the object change this picture? The object 
possesses an angular momentum, mass quadrupole moments, and higher multipole 
moments. They interact with the tide-producing accelerations (Riemann curvature) 
of the background geometry. Depending on the orientation in space of these mo¬ 
ments, the interactions drive the object off its geodesic course in one direction or 
another (see §40.9). These anomalies in the motion go hand in hand with anomalies 
in the geometry. On and near the ideal mathematical geodesic the metric is Min- 
kowskian. At a point removed from this geodesic by a displacement with Riemann 
normal coordinates £ 2 , £ 3 (see §11.6), the metric components differ from their 
canonical values (—1, 1, 1 ,1) by amounts proportional (1) to the squares and 
products of the £ m and (2) to the components of the Riemann curvature tensor 
(tide-producing acceleration) of the background geometry. These second-order terms 
produce departures from ideality in the buffer zone, departures that may be described 
symbolically as of order 


S(metric) ~~ r 2 • R • (spherical harmonic of order two). 


(20.44) 
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Figure 20.2. 

“Buffer zone”: the shell of space between d and £0, where the geometry' is appropriately idealized as 
the spherically symmetric “Schwarzschild geometry” of a localized center of attraction (the object under 
study) in an asymptotically flat space. Inside d\ the “zone of influence” of the object. In the general 
case where this object lacks all symmetry, the metric is found to depart more and more from ideal 
“Schwarzschild character” as the exploration of the geometry is carried inward from d (effect of angular 
momentum of the object on the metric; effect of quadrupole moment; effect of higher moments). Outside 
the “background geometry.” As this geometry is explored at greater and greater distances outside 
it is found to depart more and more from flatness (effect of concentradons of mass, gravitational 
waves, and other geometrodynamics). 


Here r is the distance from the geodesic and R is the magnitude of the significant 
components of the curvature tensor. The object produces not only the standard 
“Schwarzschild” departure from flatness, 


S(metric) — m/r , (20.45) 

which by itself (in a flat background) would bring about no departure from geodesic 
motion, but also correction terms which may be symbolized as 
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^(metric) — ( S/r 2 ) (spherical harmonic of order one) (20.46) 

and 

^(metric) — (7/r 3 ) (spherical harmonic of order two) (20.47) 

and higher-order terms. Here S(cm 2 ) is a typical component of the angular momen¬ 
tum vector or “spin”; 7(cm 3 ) is a representative component of the moment of inertia 
or quadrupole tensor (see Chapter 36 for details), and higher terms have higher-order 
coefficients. 

Coupling of spin to curvature The tide-producing acceleration generated by the surroundings of the object 

(“background geometry”) acts on the spin of the object with a force of order RS 
and pulls it away from geodesic motion with an acceleration of the order 

acceleration (cm-) ~ (20.48) 

m(cm) 

(see exercise 40.8). Otherwise stated, the surrounding and the spin both put warps 
in the geometry, and these warps conspire together to push the object off track. 

The sum of the relevant two perturbations in the metric is qualitatively of the 
form 



Sg ~ r 2 R + S/r 2 . 

(20.49) 

The sum is least where r has a 

value of the order 



r ~~ (S/R)''*, 

(20.50) 

and there it has the magnitude 

Sg - (SR) 1/2 . 

(20.51) 


To “derive the geodesic equation of motion with some preassigned accuracy c” 
may be defined to mean that the metric in the buffer zone is Minkowskian within 
the latitude €. In the illustrative example, this means that ( SR ) V2 is required to 
be of the order of c or less. Nothing can be done about the value of R because 
the background curvature R is a feature of the background geometry. One can meet 
the requirement only by imposing limits on the mass and moments of the object. 
In the example, where the dominating moment is the angular momentum, one must 
require that this parameter of the object be less in order of magnitude than the 
limit 


S~~t 2 /R. (20.52) 

Evidently this and similar conditions on the higher moments are most easily satisfied 
by demanding that the object have spherical symmetry (S = 0, i — 0, higher 
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moments = 0). Then the perturbation in the metric, again disregarding angle factors 
and indices, is qualitatively of the form 

Sg-r 2 R + m/r 9 (20.53) 

and the buffer zone is best designed to bracket the minimizing value of r, 

r a <[r~~ (m/R) 1 '*] < r*. (20.54) 

The departure of the metric from Minkowskian perfection in the buffer zone is of 
the order 


8g ~~ (m 2 R)V*. (20.55) 

To achieve any preassigned accuracy € for <5g, one must demand that the mass be 
less than a limit of the order 


m ~~ e 3/2 /R 1/2 , 


(20.56) 


No object of finite mass moving under the influence of a complex background 
will admit a buffer zone where the geometry approaches Minkowskian values with 
arbitrary precision. Therefore it is incorrect to say that such an object follows a 
geodesic world line. It is meaningless to say that an object of finite rest mass follows 
a geodesic world line. World line of what? If the object is a black hole, there is 
no point inside its “horizon” (capture surface; one-way membrane; see Chapters 
33 and 34) that is relevant to the physics going on outside. Geodesic world line within 
what background geometry? It has no sense to speak of a geometry that “lies behind” 
or is “background to” a black hole. 

Turn from one motion of one object in one spacetime to a continuous one-param¬ 
eter family of spacetimes, with the mass m of the object being the parameter that 
distinguishes one of these solutions of Einstein’s field equation from another. Go 
to the limit m = 0. Then the size of the buffer zone shrinks to zero and the departure 
of the metric from Minkowskian perfection in the buffer zone also goes to zero. 
In this limit (“test particle”), it makes sense to say that the object moves in a straight 
line with uniform velocity in the local inertial frame or, otherwise stated, it pursues 
a geodesic in the background geometry. Moreover, this background geometry is 
well-defined: it is the limit of the spacetime geometry as the parameter m goes to 
zero [see Infeld and Schild (1949)]. In this sense, the geodesic equation of motion 
follows as an inescapable consequence of Einstein’s field equation. 

The concept of “background” as limit of a one-parameter family of spacetimes 
extends itself to the case where the object bears charge as well as mass, and where 
the surrounding space is endowed with an electromagnetic field. This time the 
one-parameter family consists of solutions of the combined Einstein-Maxwell equa¬ 
tions. The charge-to-mass ratio e/m is fixed. The mass m is again the adjustable 
parameter. In the limit when m goes to zero, one is left with (1) a background 
geometry, (2) a background electromagnetic field, and (3) a world line that obeys 


The sense in which no body 
can move on a geodesic of 
spacetime 


The sense in which test 
particles do move on 
geodesics of a background 
geometry 


Motion of a charged test 
particle in curved spacetime 
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References on derivation of 
equations of motion from 
Einstein field equation 


Quantum mechanical 
limitations on the derivation 


EXERCISES 


the general-relativity version of the Lorentz equation of motion in this background 
as a consequence of the field equations [Chase (1954)]. In contrast, a so-called 
“unified field theory of gravitation and electromagnetism” that Einstein tentatively 
put forward at one stage of his thinking, as a conceivable alternative to the combina¬ 
tion of his standard 1915 geometrodynamics with Maxwell’s standard electrody¬ 
namics, has been shown [Callaway (1953)] to lead to the wrong equation of motion 
for a charged particle. It moves as if uncharged no matter how much charge is piled 
on its back. If that theory were correct, no cyclotron could operate, no atom could 
exist, and life itself would be impossible. 

Thus the ability to yield the correct equation of motion of a particle has today 
become an added ace in the hand of general relativity. The idea for such a treatment 
dates back to Einstein and Grommer (1927). Corrections to the geodesic equation 
of motion arising from interaction between the spin of the object (when it has finite 
dimensions) and the curvature of the background geometry are treated by Papa- 
petrou (1951) and more completely by Pirani (1956) (see exercise 40.8). A book on 
the subject exists [Infeld and Plebanski (I960)]. Section 40.9 describes how corrections 
to geodesic motion affect lunar and planetary orbits. Some of the problems that 
arise when the object under study fragments or emits a directional stream of radia¬ 
tion, and unresolved issues of principle, are discussed by Wheeler (1961). 

When one turns from the limit of infinitesimal mass to an object of finite mass, 
no simpler situation presents itself then a system of uncharged black holes (Chapter 
33). Everything about the motion of these objects follows from an application of 
the source-free Einstein equation G = 0 to the region of spacetime outside the 
horizons (see Chapter 34) of the several objects. The theory of motion is then 
geometrodynamics and nothing but geometrodynamics. 

It has to be emphasized that all the considerations on motion in this section are 
carried out in the context of classical theory. In the real world of quantum physics, 
the geometry everywhere experiences unavoidable, natural, zero-point fluctuations 
(Chapter 43). The calculated local curvatures associated with these fluctuations at 
the Planck scale of distances [L = (fiG/c 3 ) 1/2 — 1.6 X 10“ 33 cm] are enormous 
[R — 1 /L 2 ~~ 0.4 X 10 66 cm -2 ] compared to the curvature produced on much larger 
scales by any familiar object (electron or star). No detailed analysis of the interaction 
of these two curvatures has ever been made. Such an analysis would define a 
smoothed-out average of the geometry over regions larger than the local quantum 
fluctuations. With respect to this average geometry, the object will follow geodesic 
motion: this is the expectation that no one has ever seen any reason to question—but 
that no one has proved. 


Exercise 20.6. SIMPLE FEATURES OF THE ELECTROMAGNETIC FIELD 
AND ITS STRESS-ENERGY TENSOR 

(a) Show that the “scalar” — \/2F aB F ap (invariant with respect to coordinate transforma¬ 
tions) and the “pseudoscalar” 1/4 F aB *F afi (reproduces itself under a coordinate transfor¬ 
mation up to a ± sign, according as the sign of the Jacobian of the transformation is positive 
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or negative) have in any local inertial frame the values E 2 - B 2 and E * B, respectively (“the 
two Lorentz invariaiits” of the electromagnetic field). 

(b) Show that the Poynting flux ( E X B)/ 4tt is less in magnitude than the energy density 
(■ E2 + £ 2 )/8?r, save for the exceptional case where both Lorentz invariants of the field vanish 
(case where the field is locally “null”). 

(c) A charged pith ball is located a small distance from the North Pole of a bar magnet. 
Draw the pattern of electric and magnetic lines of force, indicating where the electromagnetic 
field is “null” in character. Is it legitimate to say that a “null field” is a “radiation field”? 

(d) A plane wave is traveling in the z-direction. Show that the corresponding electromag¬ 
netic field is everywhere null. 

(e) Show that the superposition of two monochromatic plane waves traveling in different 
directions is null on at most a set of points of measure zero. 

(f) In the “generic case” where the field (E, B) at the point of interest is not null, show 
that the Poynting flux is reduced to zero by viewing the field from a local inertial frame 
that is traveling in the direction of E X B with a velocity 

v = tanh a, (20.57) 

where the velocity parameter a is given by the formula 


tanh 2a = ( p °y ntin g flux ) = 2\E X B\ 
(energy density) E 2 + B 2 ' 


(20.58) 


(g) Show that all components of the electric and magnetic field in this new frame can 
be taken to be zero except E x and B x . 

(h) Show that the 4 x 4 determinant built out of the components of the field in mixed 
representation, F a &, is invariant with respect to general coordinate transformations. (Hint: 
Use the theorem that the determinant of the product of three matrices is equal to the product 
of the determinants of those three matrices.) 

(i) Show that this determinant has the value — (E • B) 2 by evaluating it in the special local 
inertial frame of (f). 

(j) Show that in this special frame the Maxwell stress-energy tensor has the form 



-1 

0 

0 

0 

£ x 2 + */-’ 

0 

-1 

0 

0 

8.7 

0 

0 

+ 1 

0 


0 

0 

0 

+ 1 


(20.59) 


(Faraday tension along the lines of force; Faraday pressure at right angles to the lines of 
force). 


(k) In the other case, where the field is locally null, show that one can always find a local 
inertial frame in which the field has the form E = (0, F y 0), B = (0, 0, F) and the stress-energy 
tensor has the value 


H P‘,11 



-1 

1 

0 

0 

F 2 

-1 

1 

0 

0 

4tt 

0 

0 

0 

0 


0 

0 

0 

0 


(fi for row, v for column). 


(20.60) 


(1) Regardless of whether the electromagnetic field is or is not null, show that the Maxwell 
stress-energy tensor has zero trace, = 0, and that its square is a multiple of the unit 
tensor, 

[(E 2 - B 2 ) 2 + (2E-B) 2 ) 

= -^[( £2 + £2 > 2 - < 2£ x £ ) 2 1 - 


( 20 . 61 ) 
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Exercise 20.7. THE STRESS-ENERGY TENSOR DETERMINES THE 

ELECTROMAGNETIC FIELD EXCEPT FOR ITS COMPLEXION 

(a) Given a non-zero symmetric 4x4 tensor which has zero trace T - 0 and whose 
square is a multiple, A/ 4 /(M 2 « of the unit matrix, show that, according as this multiple is 
zero (“null case”) or positive, the tensor can be transformed to the form (20.60) or (20.59) 
by a suitable rotation in 3-space or by a suitable choice of local inertial frame, respectively. 

(b) In the generic (non-null) case in the frame in question, show that T* 9 is the Maxwell 
tensor of the “extremal electromagnetic field” ( MP with components 

£<extremal) _ Q, 0), 

^(extremal) _ (Q, Q). (20.62) 

Show that it is also the Maxwell tensor of the “dual extremal field” with components 

♦^extremal) _ ^ 0, 0), 

*/^extremal) _ (A/, 0, 0). (20.63) 

(c) Recalling that the duality operation * applied twice to an antisymmetric second-rank 
tensor (2-form) in four-dimensional space leads back to the negative of that tensor, show 
that the operator e* a (“duality rotation”) has the value 

e’ a = (cos a) + (sin a)*. (20.64) 

(d) Show that the most general electromagnetic field which will reproduce the non-null 
tensor T^ in the frame in question, and therefore in any coordinate system, is 

F„ = e'%,. (20.65) 

(e) Derive a corresponding result for the null case. [The field F 7 fiv defined in the one frame 
and therefore in every coordinate system by (d) and (e) is known as the “Maxwell square 
root” of T**\ is known as the “extremal Maxwell square root” of T and the angle 
a is called the “complexion of the electromagnetic field.” See Misner and Wheeler (1957); 
see also Boxes 20.1 and 20.2, adapted from that paper.] 


r Box 20.1 CONTRAST BETWEEN PROPER LORENTZ TRANSFORMATION 
AND DUALITY ROTATION 


"\ 


Quantity 

Genera / proper 

Duality 

Lorentz transformation 

rotation 

Components of the Maxwell stress-energy tensor or 
the “Maxwell square” of the field F 

Transformed 

Unchanged 

The invariants E 2 — B 2 and (E • B ) 2 

Unchanged 

Transformed 

The combination [( E 2 — B 2 ) 2 4- (2£• B) 2 ] = 

KE 2 + B 2 ) 2 - (IE X Bf] 

Unchanged 

Unchanged 



§20.6. EQUATIONS OF MOTION DERIVED FROM FIELD EQUATION 


483 


- ^ 

Box 20.2 TRANSFORMATION OF THE GENERIC (NON-NULL) 

ELECTROMAGNETIC FIELD TENSOR F = (E. B) IN 
A LOCAL INERTIAL FRAME 


Field values 


At start 


After simplifying 
duality rotation 


At start 


£, B 


After simplifying Loren tz 
transformation 


v 


E and B parallel to each other 
and parallel to x-axis 


E and B perpendicular, and E 
greater than B 

E parallel to jt-axis and 5 = 0 




Exercise 20.8. THE MAXWELL EQUATIONS CANNOT BE DERIVED FROM 
THE LAW OF CONSERVATION OF STRESS-ENERGY WHEN 
(E-B) = 0 OVER AN EXTENDED REGION 

Supply a counter-example to the idea that the Maxwell equations, 

F*' ;9 = 0, 

follow from the Einstein equation; or, more precisely, show that (1) the condition that the 
Maxwell stress-energy tensor should have a vanishing divergence plus (2) the condition that 
this Maxwell field is the curl of a 4-potential A M can both be satisfied, while yet the stated 
Maxwell equations are violated. [Hint: It simplifies the analysis without obscuring the main 
point to consider the problem in the context of flat spacetime. Refer to the paper of Teitelboim 
(1970) for the decomposition of the retarded field of an arbitrarily accelerated charge into 
two parts, of which the second, there called F^jj, meets the stated requirements, and has 
everywhere off the worldline (E-B) = 0, but does not satisfy the cited Maxwell equations.] 


Exercise 20.9. EQUATION OF MOTION OF A SCALAR FIELD AS 

CONSEQUENCE OF THE EINSTEIN FIELD EQUATION 

Tire stress-energy tensor of a massless scalar field is taken to be 

= (l/4wX^, r - 1/2 

Derive the equation of motion of this scalar field from Einstein’s field equation. 


( 20 . 66 ) 
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Whenever any action occurs in nature, the quantity of action 
employed by this change is the least possible. 

PIERRE MOREAU DE MAUPERTUIS (1746) 

In the theory of gravitation, as in all other branches of 
theoretical physics, a mathematically correct statement of a 
problem must be determinate to the extent allowed by the nature 
of the problem ; if possible, it must ensure the 
uniqueness of its solution. 
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Things are es they are because they were as they were. 
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Cafculemus 
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This chapter is entirely Track 2. 
No earlier Track-2 material is 
needed as preparation for it, 
but Chapters 9-11 and 13-15 
will be helpful. It is needed as 
preparation for Box 30.1 
(mixmaster universe) and for 
Chapters 42 and 43. 


§21.1. DYNAMICS REQUIRES INITIAL-VALUE DATA 

No plan for predicting the dynamics of geometry could be at the same time more 
mistaken and more right than this: “Give the distribution of mass-energy; then solve 
Einstein’s second-order equation, 

G = 8tt7; (21.1) 

for the geometry.” Give the distribution of mass-energy in spacetime and solve for 
the spacetime geometry? No. Give the fields that generate mass-energy, and their 


To Karel Kuchar, Claudio Teitelboim, and James York go warm thanks for their collaboration in 
the preparation of this chapter, and for permission to draw on the lecture notes of K. K. and to quote 
results of K. K. [especially exercise 21.10] and of J. Y. [especially equations (21.87), (21.88), and (21.152)] 
prior to publication elsewhere. 
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time-rates of change, and give 3-geometry of space and its time-rate of change, all 
at one time, and solve for the 4-geometry of spacetime at that one time? Yes. And 
only then let one’s equations for geometrodynamics and field dynamics go on to 
predict for all time, in and by themselves, needing no further prescriptions from 
outside (needing only work!), both the spacetime geometry and the flow of mass- 
energy throughout this spacetime. This, in brief, is the built-in “plan” of geometro¬ 
dynamics, the plan spelled out in more detail in this chapter. 

Contest the plan. Point out that the art of solving any coupled set of equations 
lies in separating the unknowns from what is known or to be prescribed. Insist that 
this separation is already made in (21.1). On the right already stands the source 
of curvature. On the left already stands the receptacle of curvature in the form of 
what one wants to know, the metric coefficients, twice differentiated. Claim therefore 
that one has nothing to do except to go ahead and solve these equations for the 
metric coefficients. However, in analyzing the structure of the equations to greater 
depth [see Cartan (1922a) for the rationale of analyzing a coupled set of partial 
differential equations], one discovers that one can only make the split between “the 
source and the receptacle” in the right way when one has first recognized the still 
more important split between “the initial-value data and the future.” Thus—to 
summarize the results before doing the analysis—four of the ten components of 
Einstein’s law connect the curvature of space here and now with the distribution 
of mass-energy here and now, and the other six equations tell how the geometry 
as thus determined then proceeds to evolve. 

In determining what are appropriate initial-value data to give, one discovers no 
guide more useful than the Hilbert variational principle, 

/ = J £ d 4 x — J L( — g) 1/2 d 4 x — J L ^(proper 4-volume) = extremum (21.2) 

(-{exercise 8.16] 


or the Amowitt-Deser-Misner (“ADM”) variant of it (§21.6) and generalizations 
thereof by Kuchar (§21.9). Out of this principle one can recognize most directly 
what one must hold fixed at the limits (on an initial spacelike hypersurface and 
on a final spacelike hypersurface) as one varies the geometry (§21.2) throughout 
the spacetime “filling of this sandwich,” if one is to have a well-defined extremum 
problem. 

The Lagrange function L (scalar function) or the Lagrangian density £ — 
(— g) 1/2 L (quantity to be integrated over coordinate volume) is built of geometry 
alone, when one deals with curved empty space, but normally fields are present as 
well, and contribute also to the Lagrangian; thus. 


e = 


geom 

L = 


4” A field 
^geom ^ 


(~g) l/2 L; 

field* 


(213) 


The variation of the field Lagrangian with respect to the typical metric coefficient 
proves to be, of all ways, the one most convenient for generating (that is, for 
calculating) the corresponding component of the symmetric stress-energy tensor of 
the field (§21.3). 


Give initial data, predict 
geometry 


Four of ten components of 
Einstein equation are 
conditions on initial-value 
data 


New view of stress-energy 
tensor 
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A computer, allowing for the effect of this field on the geometry and computing 
ahead from instant to instant the evolution of the metric with time, imposes its own 
ordering on the events of spacetime. In effect, it slices spacetime into a great number 
of spacelike slices. It finds it most convenient (§21.4) to do separate bookkeeping 
on (1) the 3-geometry of the individual slices and (2) the relation between one such 
slice and the next, as expressed in a “lapse function” N and a 3-vector “shift func¬ 
tion” N t . 

The 3-geometry internal to the individual slice or “simultaneity” defines in and 
by itself the three-dimensional Riemannian curvature intrinsic to this hypersurface; 
but for a complete account of this hypersurface one must know also the extrinsic 
curvature (§21.5) telling how this hypersurface is curved with respect to the envelop¬ 
ing four-dimensional spacetime manifold. 

In terms of the space-plus-time split of the 4-geometry, the action principle of 
Hilbert takes a simple and useful form (§21.6). 

In the most elementary example of the application of an action principle in 
mechanics, where one writes 


/ X t t 

L(dx/dt, x, t ) dt 

x',t' 


(21.4) 


and extremizes the integral, one already knows that the resultant “dynamic path 
length” or “dynamic phase” or “action,” 


S(x, t) = / extremU m> (21.5) 

is an important quantity, not least because it gives (up to a factor h) the phase of 
the quantum-mechanical wave function. Moreover, the rate of change of this action 
function with position is what one calls momentum, 


p = dS(x, t)/dx; (21.6) 

and the (negative of the) rate of change with time gives energy (Figure 21.1), 

E= -a S(x,t)/dt; (21.7) 

and the relation between these two features of a system of wave crests, 

Hamiltonian as a dispersion p _ pr( n v \ _ 

relation £ - H{p,x), (21.8) 

call it dispersion relation” or call it what one will, is the central topic of mechanics. 

When .dealing with the dynamics of geometry in the Arnowitt-Deser-Misner 
formulation,* one finds it convenient to think of the specified quantities as being 

Historical remark. No one knew until recently what coordinate-free geometric-physical quantity really 
is fixed at limits in the Hilbert-Palatini variational principle. In his pioneering work on the Hamiltonian 
formulation of general relativity, Dirac paid no particular attention to any variational principle. He had 
to generalize the Hamiltonian formalism to accommodate it to general relativity, introducing “first- and 
second-class constraints” and generalizations of the Poisson brackets of classical mechanics. The work 
of Amowitt, Deser, and Misner, by contrast, took the variational principle as the foundation for the 
whole treatment, even though they too did not ask what it is that is fixed at limits in the sense of 



Figure 21.1. 

Momentum and (the negative of the) energy viewed as rate of change 
of “dynamic phase” or “action,” 


c/ f / ^ /extremum\ r xt r/ . . . 

S x ' ^ ~ l) -{ value of )j[,,, L X ' X ' ^ dl ’ 


( 1 ) 


with respect to position and time; thus, 

55 = p Sx - E 8t . (2) 

The variation of the integral I with respect to changes of the history 
along the way, 8x(t ), is already zero by reason of the optimization 
of the history; so the only change that takes place is 


8S = 8I t 


8Ldt 


im = 5/ + J 

x\l 

x + Ax,t i £L ZL V 

= L 8t + I [ — Si q-‘Sx) dt 

V dx dx I 

SL r *+s*-t / dL d dL\ 

= L 8t H- Ax + I (-1 Sx dl. 

dx Jj', V ax dt dx l 


(3) 


[ zero by reason "1 T 
of extremizatiory^ 


When one contemplates only a change Sx in the coordinates (x, 0 
of the end point (change of history from 6$ to 6%\ one has Ax = Sx. 
When one makes only a change 5/ in the end point (change of history 
from 69 to <95), one has Ax = (indicator of change from 9 to 9) — 
—x 5/. For the general variation of the final point, one thus has 
Ax ~ Sx — x 5/ and 


55 


9 L ( 9 L \ 

=-Sx — [ x- L ) 5/. 

9x \ 9x / 



One concludes that the “dispersion relation” is obtained by taking the relations [compare (2) and 
/ rate of change of\ 


dynamic phase 
i with position 


= (momentum) = p = 


cL(x, x, t) 

Tx 


and 


( rate of change of ^ 
dvnamic phase 
with time 


cL 

— (energy) — E — x —— L, 


and eliminating x from them [solve (5) for x and substitute that value of x into (6)]; thus 

E — H(p , x, /) 


(4) 

(4) ] 

(5) 

( 6 ) 

(7) 


it \dx / 

Every feature of this elementary analysis has its analog in geometrodynamics. 


( 8 ) 


a coordinate-free geometric-physical quantity. The great payoff of this work was recognition of the lapse 
and shift functions of equation (21.40) as Lagrange multipliers, the coefficients of which gave directly 
and simply Dirac’s constraints. They did not succeed in arriving at a natural and simple time-coordinate, 
but that goal has in the meantime been achieved in the “extrinsic time” of Kuchar and York (§21.11). 
However, the Arnowitt-Deser Misner approach opened the door to the “intrinsic time” of Sharp, 
Baierlein, and Wheeler, where 3-geometry is fixed at limits, and 3-geometry is the carrier of information 
about time; and this led directly to Wheeler’s “superspace version” of the treatment of Arnowitt. Deser, 
and Misner. 
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Action viewed as dependent 
on 3-geometry 


Hamiltonian versus 
super-Hamiltonian 


the 3-geometry <3, £ of the initial spacelike hypersurface and the 3-geometry <3) -?j of 
the final spacelike hypersurface. One envisages the action integral as extremized 
with respect to the choice of the spacetime that fills the “sandwich” between these 
two faces. If one has thus determined the spacetime, one has automatically by that 
very act determined the separation in proper time of the two hypersurfaces. There 
is no additional time-variable to be brought in or considered. The one concept <3) & 
thus takes the place in geometrodynamics of the two quantities x, / of particle 
dynamics. The action S that there depended on x and t here depends on the 
3-geometry of the face of the sandwich; thus, 

5 = S(< 3 >£). ( 21.9) 

A change in the 3-geometry changes the action. The amount of the change in action 
per elementary change in 3-geometry defines the “field momentum” wj> ue conjugate 
to the geometrodynamic field coordinate g ijt according to the formula 6 

SS = f " J^ue $g,j d 3 X. (21.10) 

Comparing this equation out of the Arnowitt, Deser, and Misner (ADM) canonical 
formulation of geometrodynamics (§21.7) with the expression for change of action 
with change of endpoint in elementary mechanics, 

SS = pSx - E8t, (21.11) 

one might at first think that something is awry, there being no obvious reference 
to time in (21.10). However, the 3-geometry is itself automatically the carrier of 
information about time; and (21.10) is complete. Moreover, with no “time” variable 
other than the information that <3, £ itself already carries about time, there is also 
no “energy.” Thus the “dispersion relation” that connects the rates of change of 
action with respect to the several changes that one can make in the “field coordinates” 
or 3-geometry takes the form 


= 0 , ( 21 . 12 ) 

with the E-term of (21.8) equal to zero (details in §21.7). All the content of Einstein’s 
general relativity can be extracted from this one Hamiltonian, or “super-Hamilton- 
lan,” to give it a more appropriate name [see DeWitt (1967a), pp. 1113-1118, for 
an account of the contributions of Dirac, of Arnowitt, Deser, and Misner, and of 
others to the Hamiltonian formulation of geometrodynamics; and see §21.7 and 
subsequent sections of this chapter for the meaning and payoffs of this formulation]. 
The difference between a Hamiltonian and a super-Hamiltonian [see, for example, 
Kramers (1957)] shows nowhere more clearly than in the problem of a charged 
particle moving in flat space under the influence of the field derived from the 
electromagnetic 4-potential, A^x"). The Hamiltonian treatment derives the equation 
of motion from the action principle, 

0 = 81 = S f [Pi - H (Pi - 0] dt 



§21.1. DYNAMICS REQUIRES INITIAL-VALUE DATA 489 

with 

H= + [ mZ + rf’iPi + ^ A ^P> + 7'^)] ‘ 

The super-Hamiltonian analysis gets the equations of motion from the action 
principle 

0 = SI' = s f \p^ - 0C{p a , *0)] d\. 

Here the super-Hamiltonian is given by the expression 

XiPa,* 1 *) ~ j [ w2 + V"(Pt + f + f^)]' 

The variational principle gives Hamilton’s equations for the rates of change 

dx a /dX — dDC/dp a 


and 

dp^/dX = -dSC/dx*. 

From these equations, one discovers that % itself must be a constant, independent 
of the time-like parameter X. The value of this constant has to be imposed as an 
initial condition, % = 0 (“specification of particle mass”), thereafter maintained by 
the Hamiltonian equations themselves. This vanishing of % in no way kills the partial 
derivatives, 


dOC/dp a and — dX/dx^ y 
that enter Hamilton’s equations for the rates of change, 

dx a /dX and dp^/dX. 

Whether derived in the one formalism or the other, the equations of motion are 
equivalent, but the covariance shows more clearly in the formalism of the super- 
Hamiltonian, and similarly in general relativity. 

Granted values of the “field coordinates” g^x^y^z) ( (3) £>) and field momenta 
^trueC- x >y > 1 ) = SS/Sgy compatible with (21.12), one has what are called “compatible 
initial-value data on an initial spacelike hypersurface.” One can proceed as described 
in §21.8 to integrate ahead in time step by step from one spacelike hypersurface 
to another and another, and construct the whole 4-geometry. Here one is dealing 
with what in mathematical terminology are hyperbolic differential equations that 
have the character of a wave equation. 

In contrast, one deals with elliptic differential equations that have the character 
of a Poisson potential equation when one undertakes in the first place to construct 
the needed initial-value data (§21.9). In the analysis of these elliptic equations, it 


Dynamic evolution 
geometry 
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Another choice of what to fix 
at boundary hypersurface: 
conformal part of 3-geometry 
plus extrinsic time 


Mach updated: mass-energy 
there governs inertia here 


proves helpful to distinguish in the 3-geometry between (1) the part of the metric 
that determines relative lengths at a point, which is to say angles (“the conformal 
part of the metric”) and (2) the common multiplicative factor that enters all the 
components of the g tj at a point to determine the absolute scale of lengths at that 
point. This breakdown of the 3-geometry into two parts provides a particularly simple 
way to deal with two special initial-value problems known as the time-symmetric 
and time-antisymmetric initial-value problems (§21.10). 

The ADM formalism is today in course of development as summarized in §21.11. 
In Wheeler’s (1968a) “superspace” form, the ADM treatment takes the 3-geometry 
to be fixed on each of the bounding spacelike hypersurfaces. In contrast, York 
(§21.11) goes back to the original Hilbert action principle, and discovers what it 
takes to be fixed on each of the bounding spacelike hypersurfaces. The appropriate 
data turn out to be the “conformal part of the 3-geometry” plus something closely 
related to what Kuchaf (1971a and 1972) calls the “extrinsic time.” The contrast 
between Wheeler’s approach and the Kuchaf-York approach shows particularly 
clearly when one (1) deals with a flat spacetime manifold, (2) takes a flat spacelike 
section through this spacetime, and then (3) introduces a slight bump on this slice, 
of height e. The 3-geometry intrinsic to this deformed slice differs from Euclidean 
geometry only to the second order in e. Therefore to read back from the full 3-geom¬ 
etry to the time (“the forward advance of the bump”) requires in this case an 
operation something like extracting a square root. In contrast, the Kuchaf-York 
treatment deals with the “extrinsic curvature” of the slice, something proportional 
to the first power of e, and therefore provides what is in some ways a more convenient 
measure of time [see especially Kuchaf (1971) for the construction of “extrinsic time” 
for arbitrarily strong cylindrical gravitational waves; see-also Box 30.1 on “time” 
as variously defined in “mixmaster cosmology”]. York shows that the time-variable 
is most conveniently identified with the variable “dynamically conjugate to the 
conformal factor in the 3-geometry.” ' ' " 

The initial-value problem of geometrodynamics can be formulated either in the 
language of Wheeler or m the language of Kuchaf and York. In either formulation 
(§21.9 or §21.11) it throws light on what one ought properly today to understand 
by Mach’s principle (§21.12). That principle meant to Mach that the “acceleration” 
dealt with in Newtonian mechanics could have a meaning only if it was acceleration 
with respect to the fixed stars or to something equally well-defined. It guided Einstein 
to general relativity. Today it is summarized in the principle that “mass-energy there 

governs inertia here,” and is given mathematical expression in the initial-value 
equations. 

The analysis of the initial-value problem connected past and future across a 
spacelike hypersurface. In contrast, one encounters a hypersurface that accommo¬ 
dates a timelike vector when one deals (§21.13) with the junction conditions between 
one solution of Einstein’s field equation (say, the Friedmann geometry interior to 
a spherical cloud of dust of uniform density) and another (say, the Schwarzschild 
geometry exterior to this cloud of dust). Section 21.13, and the chapter, terminate 
with notes on gravitational shock waves and the characteristic initial-value problem 
(the statement of initial-value data on a light cone, for example). 
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§21.2. THE HILBERT ACTION PRINCIPLE AND 
THE PALATINI METHOD OF VARIATION 


Five days before Einstein presented his geometrodynamic law in its final and 
now standard form, Hilbert, animated by Einstein’s earlier work, independently 
discovered (1915a) how to formulate this law as the consequence of the simplest 
action principle of the form (21.2-21.3) that one can imagine: 

^geom = (1/16 7 t) (4, R. (21.13) 

(Replace 1/1677 by c 3 /16ttG when going from the present geometric units to conven¬ 
tional units; or divide by h ~ L * 2 to convert from dynamic phase, with the units 
of action, to actual phase of a wave function, with the units of radians). Here (4) R 
is the four-dimensional scalar curvature invariant, as spelled out in Box 8.4. 

This action principle contains second derivatives of the metric coefficients. In 
contrast, the action principle for mechanics contains only first derivatives of the 
dynamic variables; and similarly only derivatives of the type 3 AJdx 13 appear in 
the action principle for electrodynamics. Therefore one might also have expected 
only first derivatives, of the form 3 g llv /dx y , in the action principle here. However, 
no scalar invariant lets itself be constructed out of these first derivatives. Thus, to 
be an invariant, L geom has to have a value independent of the choice of coordinate 
system. But in the neighborhood of a point, one can always so choose a coordinate 
system that all first derivatives of the g^ vanish. Apart from a constant, there is 
no scalar invariant that can be built homogeneously out of the metric coefficients 
and their first derivatives. 

When one turns from first derivatives to second derivatives, one has all twenty 
distinct components of the curvature tensor to work with. Expressed in a local inertial 
frame, these twenty components are arbitrary to the extent of the six parameters 
of a local Lorentz transformation. There are thus 20 - 6 = 14 independent local 
features of the curvature (“curvature invariants”) that are coordinate-independent, 
any one of which one could imagine employing in the action principle. However, 
(4, R is the only one of these 14 quantities that is linear in the second derivatives 
of the metric coefficients. Any choice of invariant other than Hilbert’s complicates 
the geometrodynamic law, and destroys the simple correspondence with the Newto¬ 
nian theory of gravity (Chapter 17). 

Hilbert originally conceived of the independently adjustable functions of x,y, z, t 
in the variational principle as being the ten distinct components of the metric tensor 
in contravariant representation, g**. Later Palatini (1919) discovered a simpler and 
more instructive listing of the independently adjustable functions: not the ten g 
alone, but the ten g*" plus the forty r*„ of the affine connection. 

To give up the standard formula for the connection r in terms of the metric g 
and let r “flap in the breeze” is not a new kind of enterprise in mathematical physics. 
Even in the simplest problem of mechanics, one can give up the standard formula 
for the momentum p in terms of a time-derivative of the coordinate x and also let 


Variational principle the 
simplest route to Einstein 
equation 


Scalar curvature invariant 
only natural choice 
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Idea of varying coordinate 
and momentum 
independently 


Variation of connection is a 
tensor 


p “flap in the breeze.” Then x(t) and p{t) become two independently adjustable 
functions in a new variational principle, 

r dx(t) 1 

1 ~ J L P ^~dt - H(p(t\x{t)j)^dt ~ extremum. (21.14) 

Happily, out of the extremization with respect to choice of the function p(t\ one 
recovers the standard formula for the momentum in terms of the velocity. The 
extremization with respect to choice of the other function, x(t\ gives the equation 
of motion just as does the more elementary variational analysis of Euler and La¬ 
grange, where x(t) is the sole adjustable function. A further analysis of this equiva¬ 
lence between the two kinds of variational principles in particle mechanics appears 
in Box 21.1. In that box, one also sees the two kinds of variational principle as applied 
to electrodynamics. 

To express the Hilbert variational principle in terms of the r* p and g a P regarded 
as the primordial functions of x , y , z, note that the Lagrangian density is 

4eo m (-<?) 1/2 = (1/167 = (1/167 r)g« (21.15) 

Here, as in any spacetime manifold with an affine connection, one has (Chap¬ 
ter 14) 


where 


R — /? x 


(21.16) 


= s/’V 3 *'* - w'V 3 ** + r h r h' - ( 2L1? ) 

and every r is given in advance (in a coordinate frame) as symmetric in its two 
lower indices. In order that the integral I of (21.2-21.3) should be an extremum, 
one requires that the variation in I caused by changes both in the g^ v and in the 
r$ should vanish; thus, 

0 = 81 = (I/I 677 ) f B[ S a PR afi (-g)^)d*x + f 8[L neid (—g) 1/2 ] d 4 x. (21.18) 

Consider now the variations of the individual factors in the first and second 
integrals in (21.18). The variation of the first factor is trivial, 8g a P. In the variation 
of the second factor, changes in the g af3 play no part; only changes in the r *s 
appear. Moreover, the variation 8r X p is a tensor even though r X p itself is not. Thus 
in the transformation formula 


rr_ Irx d* dx T , d 2 x x 1 dx y 

1 *0 — OT~= --=-*-, 

L dx a dx$ dx a dx$ldx x 


(21.19) 


the last term destroys the tensor character of any set of r x T individually, but subtracts 
out in the difference Sr x T between two alternative sets of l" s. Note that the variation 
the typical component of the curvature tensor consists of two terms of 

{continued on page 500) 
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Box 21.1 RATE OF CHANGE OF ACTION WITH DYNAMIC COORDINATE 
(= "MOMENTUM") AND WITH TIME, AND THE DISPERSION 
RELATION (= "HAMILTONIAN") THAT CONNECTS THEM 
IN PARTICLE MECHANICS AND IN ELECTRODYNAMICS 


A. PROLOG ON THE PARTICLE-MECHANICS ANALOG 
OF THE PALATINI METHOD 

In particle mechanics, one considers the history x = x(t) to be adjustable be¬ 
tween the end points ( x ', t’) and (x, t) and varies it to extremize the integral I = 
JL(x, x, t ) dt taken between these two limits. 

Expressed in terms of coordinates and momenta 
(see Figure 21.1), the integral has the form 


1 = f [px - H(p, x, 0] < 


where x(t) is again the function to be varied and 
p is only an abbreviation for a certain function of 
x and x; thus, p = dL(x, x, t)/dx. Viewed in this 
way, the variation, 6p(t), of the momentum is gov¬ 
erned by, and is only a reflection of, the variation 
Sx(t). 


Momentum Treated as Independently 
Variable 


There miraculously exists, however, quite another 
way to view the problem (see inset). One can re- 
gard x(t) and p(t) as two quite uncorrelated and 

independently adjustable functions. One abandons 

the formula p = dL(x, x, t)/dx, only to recover it, 
or the equivalent of it, from the new “independ- 
ent-coordinate-and-momentum version” of the 
variation principle. 

The variation of (1), as defined and calculated in this new way, becomes 

Demand that the coefficient of Sp vanish and have the sought-for new version, 

_ dH(p, x , t) 


Box 21.1 (continued) 

of the old relation, p = dL(x, x, t)/dx, between momentum and velocity. The van¬ 
ishing of the coefficient of Sx gives the other Hamilton equation, 


P = ~ 


dH(p, x, t) 


equivalent in content to the original Lagrange equation of motion, 

d dL dL n 

dt dx dx ~ °‘ (4) 

That p(t) in this double variable conception is—before the extremization!—a 
function of time quite separate from and independent of the function x(t) shows 
nowhere more clearly than in the circumstance that p(t) has no end point conditions 
imposed on it, whereas x' and x" are specified. Thus not only is the shape of the 
history subject to adjustment in x, p, t space in the course of achieving the extremum, 
but even the end points are subject to being slid along the two indicated lines in’ 
the inset, like beads on a wire. 


2. Action as Tool for Finding Dispersion Relation 

Denote by 5(x, t) the “action,” or extremal value of /, for the classical history that 
starts with (x\ P) and ends at (x, t) (= h times phase of de Broglie wave). To change 
the end points to (x -|- Sx, t ) makes the change in action 

SS = p Sx. ( 5 ) 

Thus momentum is “rate of change of action with dynamic coordinate.” 

To change the end point to 

(x -I- Sx, t + St) = ([x + x 6/] + [Sx — x 5/], t + St) (6) 

makes the change in action 

SS = p[Sx — x St] + L St = p Sx — H St. (I) 

Thus the Hamiltonian is the negative of “the rate of change of action with time.” 

In terms of the Hamiltonian H = H(p, x), the “dispersion relation” for de Broglie 
waves becomes 


dS rj( dS \ 

~it =H \u- x y 


In the derivation of this dispersion relation, one can profitably short-cut all talk 
of/?(t) and x(t) as independently variable quantities, and derive the result in hardly 



more than one step from the definition I = JL(x , x , r) Similarly in electrody¬ 
namics. 

The remainder of this box best follows a first perusal of Chapter 21. 

B. ANALOG OF THE PALATINI METHOD IN ELECTRODYNAMICS 

In source-free electrodynamics, one considers as given two spacelike hypersurfaces 
S' and S" 9 and the magnetic fields-as-a-function-of-position in each, B' and B" (this 
second field will later be written without the " superscript to simplify the notation). 
To be varied is an integral extended over the region of spacetime between the two 
hypersurfaces, 

Well = / ^Maxwell d 4 X = - ~ f F^'FJ-g)" 2 d 4 X. (9) 


1. Variation of Field on Hypersurface and Variation of Location 
of Hypersurface are Cleanly Separated Concepts 
in Electromagnetism 

The electromagnetic field F is the physically relevant quantity in electromagnetism 
(compare the 3-geometry in geometrodynamics). By contrast, the 4-potential A has 
no direct physical significance. A change of gauge in the potentials, 

- A^ 4- d\/dx* 

leaves unchanged the field components 

= dAJdx* — dA fl /dx v 

(compare the coordinate transformation that changes the g while leaving un¬ 
changed the (3) £). The variation of the fields within the body of the sandwich is 
nevertheless expressed most conveniently in terms of the effect of changes SA in 
the potentials. 

One also wants to see how the action integral is influenced by changes in the 
location of the upper spacelike hypersurface (“many-fingered time”). Think of the 
point of the hypersurface that is presently endowed with coordinates x,y, z, t(x,y, z) 
as being displaced to x, y 9 r, / 4- 8t(x,y 9 z). Now renounce this use of a privi¬ 
leged coordinate system. Describe the displacement of the simultaneity in terms 
of a 4-vector 8n (not a unit 4-vector) normal to the hypersurface 2. The ele¬ 
ment of 4-volume 612 included between the original upper face of the sandwich 
and the new upper face, that had in the privileged coordinate system the form 
(~g) 1/2 8t(x 9 y 9 z)d 3 x, in the notation of Chapter 20 becomes 

612 = 8n* d = (6/i * d 3 I) 9 (10) 

where the element of surface d 3 2^ already includes the previously listed factor 
(~£) 1/2 . 
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Box 21.1 ( continued) 


Counting together the influence of changes in the field values on the upper 
hypersurface and changes in the location of that hypersurface, one has 

SS = 81 extremal = —(1/16 w) J F^F^Sn • d 3 Z) 

upper-S’ 

+ (1/4 tt) J F* v AAd 3 2 v (11) 

upper I . 

t 

replace by 1 
its equivalent 

,(H - g ” a ^ ;a ) 

+ (1/477) f F*'., g) 1/2 d+c. 

4-volume 7 ' 


has to vanish 1 
because integral has 
t been extremized 

Simplify this expression by arranging the coordinates so that the hypersurface 
shall be a hypersurface of constant t, and so that lines of constant jc, y, z shall be 
normal to this hypersurface. Then it follows that the element of volume on that 
hypersurface contains a single nonvanishing component, d 3 Z 0 = (~g) 1/2 d 3 x. The 
antisymmetry of the field quantity F° 1 ' in its two indices requires that v be a space like 
label, / = 1, 2, 3. The variation of the action becomes 


8S 


_ J[(~g) 1/2 ^' 


4tt 


iO f 

-H-{ 


(~ g) V2JW 
477 


A i;0 ^ £ 


Maxwell 


J St j d 3 x. 


add and subtract 


(_£)l/2 F iO 


477 


i 0 8t) 

'A 


( 12 ) 


2. Meaning of Field "Momentum” in Electrodynamics 

Identify this expression with the quantity 

8S = f vm 8 A d 3 x - J DC 5 FI, ( 13 ) 

where 


* em — 


8S 


( “density of electromagnetic 1 

momentum dynamically canon¬ 
ically conjugate to A” i 


4t7 


£1 

4t7 


(14) 
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is a simple multiple of the electric field and where 


/“density of \ 

X = - — = I electromagnetic = (1/16 -n^F^F^, + 4 F i0 (A i]0 - ^ 0;j )] (15) 

\ Hamiltonian” / 

= (1/M(^ 2 + B 2 ). 

The concept of dynamic Hamiltonian density agrees with the usual concept of density 
of electromagnetic energy, despite the very different context in which the two 
quantities are derived and used. However, the canonical momentum 77 ^ has nothing 
directly whatsoever to do with the density of electromagnetic momentum as defined, 
for example, by the Poynting vector, despite the confusing similarity in the standard 
names for the two quantities. Note that there is no term SA 0 in (13); that is, 7 t° em ~ 0. 


3. Bubble Differentiation 


The “bubble differentiation” with respect to 
“many-fingered time” that appears in (15) was first 
introduced by Tomonaga (1946). One thinks of a 
spacelike hypersurface Z v a magnetic field B de¬ 
fined as a function of position on this hypersurface 
(by an observer on a world line normal to this 
hypersurface), and a prescription S that carries one 
from this information to a single number, the ac¬ 
tion. (Divided by ti , this action gives the phase of 
the “wave function” or “probability amplitude” 
for the occurrence of this particular distribution 
of field values over this particular hypersurface.) 
One goes to a second hypersurface I 2 (see inset), 
which is identical with except in the immediate 
vicinity of a given point. Take a distribution of 
field values over S 2 that is identical with the origi¬ 
nal distribution over I v “identity of location” 
being defined by means of the normal. Evaluate 
the difference, 8S, in the value of the dynamic 
phase or action in the two cases. Divide this differ¬ 
ence by the amount of proper 4-volume 8Q = 
j(8n * d 3 Z) contained in the “bubble” between the 
two hypersurfaces. Take the quotient, evaluate it 
in the limit in which the size of the bubble goes 
to zero, and in this way get the “bubble-time de¬ 
rivative,” SS/Sf?, of the action. 
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Box 21.1 (continued) 


What does it mean to say that the action, S, besides depending on the hypersurface, 
2, depends also on the distribution of the magnetic field, B, over that hypersurface? 
The action depends on the physical quantity, B = V x A, not on the prephysical 
quantity, A. Thus a change in gauge 8A { — d\/dx\ cannot make any change in S. 

On the other hand, the calculated value of the change in S for this alteration in 
A is 


8A: d 3 x 


S(action) = 8S = f- 

J 8A t 

= = ~ / (If), 2) d3x - 


(16) 


In order that there shall be no dependence of action on gauge, it follows that this 
expression must vanish for arbitrary A (x,y,z), a result only possible if S(2, B) = 
5(hypersurface, field on hypersurface) satisfies the identity 



~ ^Eu.i — ~ (I/4ir)^* { = 0. 


(17) 


4. Hamilton-Jacobi Propagation Law" for Electrodynamics 

The dispersion relation” or “Hamilton-Jacobi equation” for electromagnetism 
relates ( 1 ) the changes of the “dynamic phase” or “action” brought about by altera¬ 
tions in the dynamic variables A i (the generalization of the x of particle dynamics) 
with ( 2 ) the changes brought about by alterations in many-fingered time (the gener¬ 
alization of the single time t of particle dynamics); thus (15) translates into 


SS 

Jq 


(4tt) 2 /35 \ 2 
877 \S,4 / 


1 

(8tt) 


(V xA) 2 


(18) 


C. DISPERSION RELATIONS FOR GEOMETRODYNAMICS AND 
ELECTRODYNAMICS COMPARED AND CONTRASTED 

Geometrodynamics possesses a direct analog of equation (17) (“action depends 
on no information carried by the vector potential A except the magnetic field 
B = V x A ”), in an equation that says the action depends on no information carried 
by the metric g tj on the “upper face of the sandwich” except the 3 -geometry there, 

-o. It also possesses a direct analog of equation (18) (“dynamic equation for the 
propagation of the action”) with this one difference: in electrodynamics the field 
variable B and the many-fingered time are distinct in character, whereas in geo¬ 
metrodynamics the “field” and the “many-fingered time” can be regarded as two 
aspects of one and the same (3) h: 

J 
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r 

D. ACTION PRINCIPLE AND DISPERSION RELATION ARE 
ROOTED IN THE QUANTUM PRINCIPLE; FEYNMAN'S 
PRINCIPLE OF THE DEMOCRATIC EQUALITY 
OF ALL HISTORIES 

For more on action principles in physics, see for example Mercier (1953), Lanczos 
(1970), and Yourgrau and Mandelstam (1968). 

Newton (1687) in the first page of the preface to the first edition of his Principia 
notes that “The description of right lines..., upon which geometry is founded, 
belongs to mechanics. Geometry does not teach us to draw these lines, but requires 
them to be drawn.” 

Newton s remark is also a question. Mechanics moves a particle along a straight 
line, but what is the machinery by which mechanics accomplishes this miracle? The 
quantum principle gives the answer. The particle moves along the straight line only 
by not moving along the straight line. In effect it “feels out” every conceivable world 
line that leads from the start, (x', f), to the point of detection, (x", t"), “compares” 

one with another, and takes the extremal world line. How does it accomplish this 
miracle? 

The particle is governed by a “probability amplitude to transit from (x\ t') to 
(x'\ O” This amplitude or “propagator,” <*", t"\x\ t'), is the democratic sum with 
equal weight of contributions from every world line that leads from start to finish; 
thus, 


(x", t"\x', t') z= N J e u « /n Tbc. (15) 

Here TV is a normalization factor, the same for all histories. 

Tbc is the “volume element” for the sum over histories. For a “skeleton history” 
defined by giving x n at t n = t 0 + n At, one has ^x equal, up to a multiplicative 
constant, to dx x dx 2 ... dx N . When the history is defined by the Fourier coefficients 
in such an expression as 


x(t) = 


*'(r - Q + x"(t - g) 
0 " - f) 


+ 2 a n sin m 
n 


(t - n 

it" - f) ’ 


(16) 


the volume element, again up to a multiplicative factor, is da x da 0 . 

Destructive interference in effect wipes out the contribution to the transition 
probability from histories that differ significantly from the “extremal history” or 
“classical history.” Histories that are near that extremal history, on the other hand, 
contribute constructively, and for a simple reason: a small departure of the first order 
from the classical history brings about a change in phase which is only of the second 
order in the departure. 

In this elementary example, one sees illustrated why it is that extremal principles 
play such a large part in classical dynamics. They remind one that all classical physics 
rests on a foundation of quantum physics. The central ideas are (1) the principle 
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Box 21.1 (continued) 


of superposition of probability amplitudes, (2) constructive and destructive interfer¬ 
ence, (3) the democracy of all histories,” and (4) the probability amplitude associ¬ 
ated with a history //is e ilH/n , apart from a normalizing factor that is a multiplicative 
constant. 

For more on the democracy of histories and the sum over histories see Feynman 
(1942, 1948, 1949, 1951, and 1955), and the book of Feynman and Hibbs (1965); 
also Hibbs (1951), Morette (1951), Choquard (1955), Polkinghorne (1955), Fujiwara 
(1962), and the survey and literature references in Kursunoglu (1962); also reports 
of Dempster (1963) and Symanzik (1963). This outlook has been applied by many 
workers to discuss the quantum formulation of geometrodynamics, the first being 
Misner (1957) and one of the latest being Faddeev (1971). 

---- J 


the form 5/' x a|S ^ and four terms of the form r Sr (indices being dropped for simplic¬ 
ity). One coordinate system is as good as another in dealing with a tensor. Therefore 
pick a coordinate system in which all the T’s vanish at the point under study. The 
terms f dr drop out. In this coordinate system, the variation of the curvature is 
expressed in terms of first derivatives of quantities like Sr One then need only 
replace the ordinary derivatives by covariant derivatives to obtain a formula correct 
in any coordinate system, 


- sr l;p, ( 21 . 20 ) 

along with its contraction, 

SR afi = - Sr^.0. ( 21 . 21 ) 

The third factor that appears in the variation principle is (— g) 1 ^. Its variation 
(exercise 21.1) is , 

s (~g) 1/2 = - J (-g) 1/2 g A „ Sgv. (21.22) 

The other integrand, the Lagrange density L field , will depend on the fields present 
and their derivatives, but will be assumed to contain the metric only as g^ v itself, 
never in the form of any derivatives of g**. 

In order for an extremum to exist, the following expression has to vanish: 

(1/16W )J[(^ - + £“W„;X - Sn^i-g^cPx 

■ + /(^pf - J g«0 L Beld) S S afi (~S) 1/2 d 4 X = 0 


(21.23) 
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Focus attention on the term in (21.23) that contains the variations of /*, 

0 / 16 *) f g“0(sr^ - sr> X;/} )(-g)V2 d* x , 

a nd integrate by parts to eliminate the derivatives of the S/"\ To prepare the way 
for this integration, introduce the concept of tensor density, a notational device widely 
applied in general relativity. The concept of tensor density aims at economy. Without 
this concept, one will treat the tensor 

W y = (-g) 1/2 [M«/?Y] 

(see exercise 3.13) as having 4 4 = 256 components, and its covariant derivative as 
having 4 5 = 1,024 components, of which one is 

e 0123;p ~ d( — g) /2 /3x p £ [0 i 23] — ^"op £ <rl23 — ^lo c 0ff23 

__ r° € _ r<* * 

2p t 01ff3 1 3p e 012a 

= [(-s) 1/2 , P - n P (-g) 1/2 )[om). 

The symbol [n/fyS], with values (0, —-1, +1), introduces what is largely excess bag¬ 
gage, doing mere bookkeeping on alternating indices. Drop this unhandiness. Intro¬ 
duce instead the non-tensor (—g) 1/2 and define for it the law of covariant differ¬ 
entiation, 

(~s) 1/2 ;p = (-s) 1/2 ,p - r° ap {-g?'*. (21.24) 

These four components take the place of the 1,024 components and communicate 
all the important information that was in them. 

Associated with the vector j is the vector density 

L = i-i) 1 / 2 j P > 

with the tensor T pv , the tensor density 

= (-s) 1/2 t;,; 

and so on; the German gothic letter is a standard indicator for the presence of the 
factor ( g) 1/2 - On some occasions (see, for example, §21.11) it is convenient to 
multiply the components of a tensor with a power of (— g) 1/2 other than 1. According 
to the value of the exponent, the resulting assemblage of components is then called 
a tensor density of this or that weight. 

The law of differentiation of an ordinary or standard tensor density formed from 
a tensor of arbitrary order, 


a::- = {-gf'*A::, 


is 


(3„ ).p _ (3- ) p + (standard r.. terms of a standard covariant 
derivative multiplied into 97) — (3")/'J p . 

The covariant derivative of a product is the sum of two terms: the covariant deriva- 


Concept of tensor density 
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tive of the first, times the second, plus the first times the covariant derivative of 
the second. 

Now return to the integral to be evaluated. Combine the factors g a & and ( — g) 1/2 
into the tensor density Integrate covariantly by parts, as justified by the rule 
for the covariant derivative of a product. Get a “term at limits,” plus the integral 

-(1/1677) J - 8£tf*y. y ) 8r k d*x. 

This integral is the only term in the action integral that contains the variations of 
the rs at the “interior points” of interest here. For the integral to be an extremum, 
the symmetrized coefficient of &T^ must vanish, 

8^;x-yW Y ; y-yS^" Y ;Y = 0. 

This set of forty equations for the forty covariant derivative g a ^. x has only the zero 
solution, 

B** ; x = 0- (21.25) 

Thus the “density formed from the reciprocal metric tensor” is covariantly constant. 

This simple result (1) brings many simple results in its train: the covariant con¬ 
stancy of (2) ( —g) 1/2 , (3) (4) g a p, and (5) g a/8 . Of these, (4) is of special interest 

here, and (2) is needed in proving it, as follows. Take definition (21.24) for the 
covariant derivative of (—g) 1/2 , and calculate the ordinary derivative that appears 
in the first term from exercise 21.1. One encounters in this calculation terms of the 
form dq a $/dx x . Use (21,25) to evaluate them, and end up with the result 

(^) 1/2 ;X = 0. 

From this result it follows that the covariant derivative of the (})-tensor density 
(—g) 1/2 i s also zero. But this tensor density is the product of the tensor density 
by the ordinary metric tensor g^ y . In the covariant derivative of this product 
by x x , one already knows that the derivative of the first factor is zero. Therefore 
the first factor times the derivative of the second must be zero, 


S^yjX = 0, 

and from this it follows that 

&P Y;X = 

as was to be proven; or, explicitly. 


d Sp y 
dx x 


~~ Syo^pX — Spa^yX — 0 - 


(21.26) 


Solve these equations for the r 9 s, which up to now have been independent of the 
gp y , and end up with the standard equation for the connection coefficients, 


^ & p (gjiCT,p "b &<rv,pL 


as required for Riemannian geometry. 


(21.27) 
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Similarly, equate to zero the coefficient of Sg a # in the variation (21.23), and find 
all ten components of Einstein’s field equation, in the form 


A* = 8-2^). 


_ 

"M" identified in §21.3 with "I 
I the stress-energy tensor T afj \ 

Among variations of the metric, one of the simplest is the change 

Snewfiv &)iv &iiv ~b T 

brought about by the infinitesimal coordinate transformation 


y-M = v/1 _ 
■*new A S • 


( 2128 ) 

(21.29) 

(21.30) 


Although the metric changes, the 3-geometry does not. It does not matter whether 
the spacetime geometry that one is dealing with extremizes the action principle or 
not, whether it is a solution of Einstein’s equations or not; the action integral I is 

a scalar invariant, a number, the value of which depends on the physics but not Action unaffected by mere 
at all on the system of coordinates in which that physics is expressed. This invariance chan 9 e in coordinatization 
even obtains for both parts of the action principle individually (/ geom and 7 fields ). 

Therefore neither part will be affected in value by the variation (21.29). In other 
words, the quantity 


SI. 


geom 


= (1/1677) f + £!>■.«)(- g)V 2 d * x 

= -(I/877) f G afi 'H a (-g) 1/2 d*x 
H[“covariant integration by parts”] 


(21.31) 


must vanish whatever the 4-geometry and whatever the change In this way, one 
sees from a new angle the contracted Bianchi identities of Chapter 15, 


<V* = 0. (21.32) 

The “neutrality” of the action principle with respect to a mere coordinate trans¬ 
formation such as (21.29) shows once again that the variational principle—and with 
it Einstein’s equation—cannot determine the coordinates or the metric, but only the 
4-geometry itself 


Exercise 21.1. VARIATION OF THE DETERMINANT OF THE METRIC TENSOR EXERCISE 

Recalling that the change in the value of any determinant is given by multiplying the change 
in each element of that determinant by its cofactor and adding the resulting products (exer¬ 
cise 5.5) prove that 

K~S) V2 = y {-g) V2 g^ Sg ur and 8(-g)W = - j(-g)\'%, Sg". 

Also show that 

^ = det IIb'“'M and 8(-g) V2 = + 5^'. 


f> c 
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Lagrangian generates 
stress-energy tensor 


Electromagnetism as an 
example 


Contrast to stress-energy 
tensor of "canonical field 
theory" 


§21.3. MATTER LAGRANGIAN AND STRESS-ENERGY TENSOR 

The derivation of Einstein’s geometrodynamic law from Hilbert’s action principle 
puts on the righthand side a source term that is derived from the field Lagrangian. 
In contrast, the derivation of Chapter 17 identified the source term with the stress- 
energy tensor of the field. For the two derivations to be compatible, the stress-energy 
tensor must be given by the expression 


T — field l r T 

field* 

(21.33a) 

or 


JvtB — fcaB _ o ^ field 

S ga/3 ' 

(21.33 b) 

What are the consequences of this identification? 



By the term Lagrange function of the field” as employed here, one means the 
Lagrange function of the classical theory as formulated in flat spacetime, with the 
flat-spacetime metric replaced wherever it appears by the actual metric, and with 
the “comma-goes-to-semicolon rule” of Chapter 16 applied to all derivatives. 

Were one dealing with a general tensorial field, the comma-goes-to-semicolon rule 
would introduce, in addition to the derivative of the tensorial field with all its indices, 
a number of C’s equal to the number of indices. The presence of these r *s in the 
field Lagrangian would have unhappy consequences for the Palatini variational 
procedure described in §21.2. No longer would the /*’s end up given in terms of 
the metric coefficients by the standard formula (21.27). No longer would the geom¬ 
etry, as derived from the Hilbert-Palatini variation principle, be Riemannian. Then 
what? 

These troublesome issues do not arise in two well-known simple cases, a scalar 
field and an electromagnetic field. In the one case, the field Lagrangian becomes 

Afield = (1/8 n)[—g a P(d<l>/dx a )(d<l>/dxP) - m 2 4> 2 ]. (21.34) 

No connection coefficient comes in; the quantity being differentiated is a scalar. In 
the other case, the field Lagrangian is built on first derivatives of the 4-potential 
Therefore r *s should appear, according to the standard rules for covariant 
differentiation (Box 8.4). However, the derivatives of the ^4’s appear, never alone, 
but always in an antisymmetric combination where the r*s cancel, making covariant 
derivatives equivalent to ordinary derivatives: 

F/tv A v - t n A^; V — A Vfl A^y. (21.35) 

In both cases, the differentiations of (21.33) to generate the stress-energy tensor 
are easily carried out (exercises 21.2 and 21.3) and give the standard expressions 
already seen [(5.22) and (5.23)] for T^ v in one of these two cases in an earlier chapter. 

Field theory provides a quite other method to generate a so-called canonical 
expression for the stress-energy tensor of a field [see, for example, Wentzel (1949)]. 
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By the very manner of construction, such an expression is guaranteed also to satisfy 
the law of conservation of momentum and energy, and by this circumstance it too 
becomes useful in certain contexts. However, the canonical tensor is often not 
symmetric in its two indices, and in such cases violates the law of conservation of 
angular momentum (see discussion in §5.7). Even when symmetric, it may give a 
quite different localization of stress and energy than that given by (21.33). Field 
theory in and by itself is unable to decide between these different pictures of where 
the field energy is localized. However, direct measurements of the pull of gravitation 
provide in principle [see, for example, Feynman (1964)] a means to distinguish 
between alternative prescriptions for the localization of stress-energy, because gravi¬ 
tation responds directly to density of mass-energy and momentum. It is therefore 
a happy circumstance that the theory of gravity in the variational formulation gives 
a unique prescription for fixing the stress-energy tensor, a prescription that, besides 
being symmetric, also automatically satisfies the laws of conservation of momentum 
and energy (exercises 21.2 and 21.3). [For an early discussion of the symmetrization 
of the stress-energy tensor, see Rosenfeld (1940) and Belinfante (1940). A more 
extensive discussion is given by Corson (1953) and Davis (1970), along with extensive 
references to the literature.] 

When one deals with a spinor field, one finds it convenient to take as the quantities 
to be varied, not the metric coefficients themselves, but the components of a tetrad 
of orthonormal vectors defined as a tetrad field over all space [see Davis (1970) for 
discussion and references]. 


Exercise 21.2. STRESS-ENERGY TENSOR FOR A SCALAR FIELD 

Given the Lagrange function (21.34) of a scalar field, derive the stress-energy tensor for this 
field. Also write down the field equation for the scalar field that one derives from this 
Lagrange function (in the general case where the field executes its dynamics within the arena 
of a curved spacetime). Show that as a consequence of this field equation, the stress-energy 
tensor satisfies the conservation law, T a ^ = 0. 

Exercise 21.3. FARADAY-MAXWELL STRESS-ENERGY TENSOR 

Given the Lagrangian density —F IIV F 11 '’/ 16ir, reexpress it in terms of the variables and 
8 and by use of (21.33) derive the stress-energy tensor as discussed in §5.6. Also derive 
from the Lagrange variation principle the field equation = 0 (curved spacetime, 
but—for simplicity—a charge-free region of space). As a consequence of this field equation, 
show that the Faraday-Maxwell stress-energy tensor satisfies the conservation law, T a S& = 0. 
For a more ambitious project, show that any stress-energy tensor derived from a field 
Lagrangian by the prescription of equation (21.33) will automatically satisfy the conservation 
law T a ^ = 0. 


§21.4. SPLITTING SPACETIME INTO SPACE AND TIME 

There are many ways to “push forward” many-fingered time and explore spacetime 
faster here and slower there, or faster there and slower here. However, a computer 
is most efficiently programmed only when it follows one definite prescription. The 


EXERCISES 
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Slice spacetime to compute 
spacetime 


Thin sandwich 4-geometry 



Building two 3-geometries into a thin sandwich 4-geometry, by interposing perpendicular connectors 
between the two, with preassigned lengths and shifts. What would otherwise be flexible thereupon 
becomes rigid. The flagged point illustrates equation (21.40). * 


successive hypersurfaces on which it gives the geometry are most conveniently 
described by successive values of a time-parameter t. One treats on a different footing 
the 3-geometries of these hypersurfaces and the 4-geometry that fills in between these 
laminations. 

The slicing of spacetime into a one-parameter family of spacelike hypersurfaces 
is called for, not only by the analysis of the dynamics along the way, but also by 
the boundary conditions as they pose themselves in any action principle of the form, 
Give the 3-geometries on the two faces of a sandwich of spacetime, and adjust 
the 4-geometry in between to extremize the action/ 5 ' 

There is no simpler sandwich to consider than one of infinitesimal thickness (Figure 
21.2). Choosing coordinates adapted to the (3 -f l)-space-time split, designate the 
“lower 55 (earlier) hypersurface in the diagram as t = constant and the “upper 55 (later) 
one as t + dt — constant (names, only names; no direct measure whatsoever of 
proper time). Compare the two hypersurfaces with two ribbons of steel out of which 
one wants to construct a rigid structure. To give the geometry on the two ribbons 
by no means fixes this structure; for that purpose, one needs cross-connectors between 
the one ribbon and the other. It is not even enough (1) to specify that these connectors 
are to be welded on perpendicular to the lower ribbon; (2) to specify where each 
is to be welded; and (3) to give its length. One must in addition tell where each 
connector joins the upper surface. If the proper distances between tops of the 
connectors are everywhere shorter than the distances between the bases of the 
connectors, the double ribbon will have the curve of the cable of a suspension bridge; 
if everywhere longer, the curve of the arch of a masonry bridge. The data necessary 
for the construction of the sandwich are thus (1) the metric of the 3-geometry of 
the lower hypersurface, 

x , y, z) dx l dx\ (21.36) 

telling the (distance) 2 between one point in that hypersurface and another; (2) the 
metric on the upper hypersurface, 
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gij(j "t" y, z) dx* dx 


(21.37) 


(3) a formula for the proper length, 


/lapse of \ 
proper time 

between lower = function”) * = W, x,y, z) dt, 
and upper 
\hypersurface / 


(21.38) 


of the connector that is based on the point (x,y, z) of the lower hypersurface; and 
(4) a formula for the place on the upper hypersurface, 


r C* m ) = **- N'(t, x,y, z) dt, 


(21.39) 


where this connector is to be welded. Omit part of this information, and find the 
structure deprived of rigidity. 

The rigidity of the structure of the thin sandwich is most immediately revealed 
in the definiteness of the 4-geometry of the spacetime filling of the sandwich. 
Ask for the proper interval ds or di between x a = ( t , x { ) and x a + dx a = 
(/ + dt, x % + dx { ). The Pythagorean theorem in its 4-dimensional form 


Metric of 4-geometry 
depends on lapse and shift of 
connectors of the two 
3-geometries 


/proper distance \ 2 _ /proper time from 

'in base 3-geometry/ \lower to upper 3-geometry, 


yields the result (see Figure 21.2). 

<* 2 = gyidx* + N * dt){dx’ + N’ dt) - (Ndt) 2 (21.40) 

Here as in (21.36) the g if are the metric coefficients of the 3-geometry, distinguished 
by their Latin labels from the Greek-indexed components of the 4-metric, 

<* 2 = U) g al 3 dx« dxP, (21.41) 

labeled here with a suffix (4) to reduce the possibility of confusion. Comparing (21.41) 

and (21.40), one arrives at the following construction of the 4-metric out of the Details of the 4-geometry 
3-metric and the lapse and shift functions [Amowitt, Deser, and Misner (1962)]: 

<4) Soo {4) go k |(A r s A rs - N 2 ) N k 

= • (21.42) 

<4, Sio U) g ik N t g ik 

The welded connectors do the job! 

In (21.42), the quantities N m are the components of the shift in its original primor¬ 
dial contravariant form, whereas the N, = gim N m are the covariant components, as 
calculated within the 3-geometry with the 3-metric. To invert this relation, 


yV m = g ms /Si 


(21.43) 
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is to deal with the reciprocal 3-metric, a quantity that has to be distinguished sharply 
from the reciprocal 4-metric. Thus, the reciprocal 4-metric is 

( 4 ) g 00 ( 4 ) g 0m — (l/N 2 ) ( N m /N 2 ) 

(21.44) 

(4)gfc0 (4 )gkm (N k /N 2 ) ( g km — N k N m / N 2 ) 

a result that one checks by calculating out the product 

“WV* = U \ y 

according to the standard rules for matrix multiplication. 

The volume element has the form 

(— (4) g) 1/2 dx° dx 1 dx 2 dx 3 — Ng 1/2 dt dx 1 dx 2 dx 3 . (21.45) 

Welding the connectors to the two steel ribbons, or adding the lapse and shift 
functions to the 3-metric, by rigidifying the 4-metric, also automatically determines 
the components of the unit timelike normal vector n. The condition of normalization 
on this 4-vector is most easily formulated by saying that there exists a 1-form, also 
called n for the sake of convenience, dual to n, and such that the product of this 
vector by this 1-form has the value 

(n, n) = — 1. (21.46) 

This 1-form has the value 


n — rip dx& — —Ndt -f 0 + 0 + 0. (21.47) 

Only so can this 1-form, this structure of layered surfaces, automatically yield a 
value of unity, one bong of the bell, when pierced as in Figure 2.4 by a vector that 
represents an advance of one unit in proper time, regardless of what x, y, and z 
displacements it also has. Thus the unit timelike normal vector in covariant 1-form 
representation necessarily has the components 

The components of the unit rip = ( —iV,0, 0, 0) (21.48) 

normal 

Raise the indices via (21.44) to obtain the contra variant components of the same 
normal, represented as a tangent vector; thus, 

n a = [(1/N), — (N m /N)]. (21.49) 

This result receives a simple interpretation on inspection of Figure 21.2. Thus the 
typical “perpendicular connector” in the diagram can be said to have the components 

(dt, —N m dt) 

and to have the proper length dr = Ndt; so, ratioed down to a vector n of unit 
proper length, the components are precisely those given by (21.49). 
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§21.5. INTRINSIC AND EXTRINSIC CURVATURE 

The central concept in Einstein’s account of gravity is curvature, so it is appropriate 
to analyze curvature in the language of the (3 + l)-space-time split. The curvature 
intrinsic to the 3-geometry of a spacelike hypersurface may be defined and calculated 
by the same methods described and employed in the calculation of four-dimensional 
curvature in Chapter 14. Of all measures of the intrinsic curvature, one of the 
simplest is the Riemann scalar curvature invariant i3) R (written for simplicity of 
notation in what follows without the prefix, as R); and of all ways to define this 
invariant (see Chapter 14), one of the most compact uses the limit (see exercise 21.4) 

proper area of a surface (approximately 
a 2-sphere) defined as the locus of the 
points at a proper distance c 

under study/ ~ ^ "-ZTJ- 


477£ 2 — 




/at point \ 


T irvs 1 Q 


Scalar curvature as measure 
of area deficit 


(21.50) 

For a more detailed description of the curvature intrinsic to the 3-geometry, 
capitalize on differential geometry as already developed in Chapters 8 through 14,’ 
amending it only as required to distinguish what is three-dimensional from what 
is four-dimensional. Begin by considering a displacement 

d& = e i dx i (21.51) 

within the hypersurface. Here the ej are the basis tangent vectors e { = d/dx i (in 
one notation) or e 4 = dtf/dx* (in another notation) dual to the three coordinate 
1-forms dx\ Any field of tangent vectors A that happens to lie in the hypersurface 
lets itself be expressed in terms of the same basis vectors; 

A = M 4 - (21.52) 

The scalar product of this vector with the base vector e. is 

(A • e,.) = A\e , • e } ) = A% = A } . (21.53) 

Now turn attention from a vector at one point to the parallel transport of the vector 
to a nearby point. 

A vector lying on the equator of the Earth and pointing toward the North Star, 
transported parallel to itself along a meridian to a point still on the Earth’s surface, 
but 1,000 km to the north, will no longer lie in the 2-geometry of the surface of 
the Earth. A telescope located in the northern hemisphere has to raise its tube to 
see the North Star! The generalization to a three-dimensional hypersurface imbedded 
in a 4-geometry is immediate. Take vector A, lying in the hypersurface, and transport 
it along an elementary route lying in the hypersurface, and in the course of this 
transport displace it at each stage parallel to itself, where “parallel” means parallel 
with respect to the geometry of the enveloping 4-manifold. Then A will ordinarily 
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end up no longer lying in the hypersurface. Thus the “covariant derivative” of A 
in the direction of the z-th coordinate direction in the geometry of the enveloping 
spacetime (that is, the A at the new point diminished by the transported A) has 
the form (see §10.4) - 

U)y eA = (4) YiA = (4 >v. (e .^) = e.M- + (21.54) 

A special instance of this formula is the equation for the covariantly measured change 
of the base vector e m itself, 


From parallel transport in 
4-geometry to parallel 
transport in 3-geometry 


A new covariant derivative, 
taken with respect to the 
3-geometry 


<4>V (21.55) 

In both (21.54) and (21.55) the presence of the “out-of-the-hypersurface component” 

(v4 i(4) r9.)(e 0 • n) (21.56) 

is quite evident. Now kill this component. Project i4) VA orthogonally onto the 
hypersurface. In this way arrive at a parallel transport and a covariant derivative 
that are intrinsic to the 3-geometry of the hypersurface. By rights this covariant 
derivative should be written (3) V; but for simplicity of notation it will be written 
as V in the rest of this chapter, except where ambiguity might arise. To get the 
value of the new covariant derivative, one has only to rewrite (21.54) with the suffix 
(4) replaced everywhere by a (3) , or, better, dropped altogether and with the “dummy 
index” of summation /x = (0, 1,2, 3) replaced by m = (1,2, 3). However, it is more 
convenient, following Israel (1966), to turn from an expression dealing with contra- 
variant components A i of A to one dealing with covariant components A i = (A * ej. 
Thus the covariant derivative of A in the direction of the z-th coordinate direction 
in the hypersurface, calculated with respect to the 3-geometry intrinsic to the hyper¬ 
surface itself, has for its /z-th covariant component the quantity [see equation (10.18)] 

dA 

= •»* (3) = e A • - A™r mhi (= A h;i for A in 2). (21.57) 

Here the notation of the vertical stroke distinguishes this covariant derivative from 
the covariant derivative taken with respect to the 4-geometry, as, for example, in 
equations (10.17ff). The connection coefficients here for three dimensions, like those 
dealt with earlier for four dimensions [see the equations leading from (14.14) through 
(14.15)], allow themselves to be expressed in terms of the metric coefficients and 
their first derivatives, and have the interpretation 

(3) r mhi = r mhi = e m -V i e h . (21.58) 

From the connection coefficients in turn, one calculates as in Chapter 14 the full 
Riemann curvature tensor <3) i?^ mn of the 3-geometry intrinsic to the hypersurface. 

Over and above the curvature intrinsic to the simultaneity, one now encounters 
a concept not covered in previous chapters (except fleetingly in Box 14.1), the 
extrinsic curvature of the 3-geometry. This idea has no meaning for a 3-geometry 
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5/i s -/f(S£>) 



9 + 5yP 


Figure 21.3. 

Extrinsic curvature measures the fractional shrinkage and deformation 
of a figure lying in the spacelike hypersurface I that takes place when 
each point in the figure is carried forward a unit interval of proper time 
“normal” to the hypersurface out into the enveloping spacetime. (No 
enveloping spacetime? No extrinsic curvature!) The extrinsic curvature 
tensor is a positive multiple of the unit tensor when elementary displace¬ 
ments 5£\ in whatever direction within the surface they point, all experi- 
encejhejame fractional shrinkage. Thus the extrinsic curvature of the 
hypersurface illustrated in the figure is positive. The dashed arrow repre¬ 
sents the normal vector n at the fiducial point 9 after parallel transport 
to the nearby point & 4- 8&. 


conceived in and by itself. It depends for its existence on this 3-geometry’s being 
imbedded as a well-defined slice in a well-defined enveloping spacetime. It measures 
the curvature of this slice relative to that enveloping 4-geometry (Figure 21.3). 

Take the normal that now stands at the point 9 and, “keeping its base in the 
hypersurface” 2, transport it parallel to itself as a “fiducial vector” to the point 
9 + 89, and there subtract it from the normal vector that already stands at that 
point. The difference, Sn 9 may be regarded in the appropriate approximation as a 
“vector,” the value of which is governed by and depends linearly on the “vector” 
of displacement 89. 

To obviate any appeal to the notion of approximation, go from the finite displace¬ 
ment 89 to the limiting concept of the vector-valued “displacement 1-form” d9 [see 
equation 15,13], Also replace the finite but not rigorously defined vector 8n by the 
limiting concept of a vector-valued 1-form dn. This quantity, regarded as a vector, 
being the change in a vector n that does not change in length, must represent a 
change in direction and thus stand perpendicular to n. Therefore it can be regarded 
as lying in the hypersurface I. Depending linearly on d9 , it can be represented 
in the form 


dn = ~K(d9). 


(21.59) 


Extrinsic curvature as an 
operator 


Here the linear operator K is the extrinsic curvature presented as an abstract coor¬ 
dinate-independent geometric object. The sign of K as defined here is positive when 
the tips of the normals in Figure 21.3 are closer than their bases, as they are, for 
example, during the recontraction of a model universe, in agreement with the 
conventions employed by Eisenhart (1926), Schouten (1954), and Amowitt, Deser 
and Misner (1962), but opposite to the convention of Israel (1966). 

Into the slots in the 1-forms that appear on the lefthand and righthand sides of 
(21.59), insert in place of the general tangent vector [which is to describe the general 
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local displacement, so far left open, as in the discussion following (2.12a)] a very 
special tangent vector, the basis vector e is for a displacement in the i-th coordinate 
direction. Thus find (21.59) reading 

(4) V,n = -*(«,) = (21.60) 

where the Kj are the components of the linear operator K in a coordinate represen¬ 
tation. Take the scalar product of both sides of (21.60) with the basis vector e m . 
Recall (e m * n) = 0. Thus establish the symmetry of the tensor K im , covariantly 
presented, in its two indices: 

K im = K i%m = *V(«j * « m ) = * < 4 >V i« 

= (/,-eo) ( 4 ) r°i = n-^V m e i = /r mi . 

L{see (21.55)] 


Gauss-Weingarten equation 
for 4-transport in terms of 
extrinsic curvature 

Knowing from this equation how each basis vector in 2 changes, one also knows 
how to rewrite (21.54) for the change in any vector field A that lies in 2. The change 
in both cases is expressed relative to a fiducial vector transported from a fiducial 
nearby point. By the term “parallel transport” one now means “parallel with respect 
to the geometry of the enveloping spacetime”: 

(4, v \A = A^e, + (21.63) 

Of special importance is the evaluation of extrinsic curvature when spacetime is 
sliced up into spacelike slices according to the plan of Arnowitt, Deser, and Misner 
as described in §21.4. The 4-geometry of the thin sandwich illustrated in Figure 
21.2, rudimentary though it is, is fully defined by the 3-metric on the two faces of 
the sandwich and by the lapse and shift functions N and N\ The normal in covariant 
representation according to (21.47) has the components 

(n 0 , n v n 2 , n 3 ) = (-AT, 0,0,0). (21.64) 

The change in n relative to “/» transported parallel to itself in the enveloping 
4-geometry,” according to the definition of parallel transport, is 


A knowledge of the tensor of extrinsic curvature assists in revealing the changes 
of the four vectors /i, e v e 2 » e 3 under parallel transport. Equation (21.60) already 
tells how n changes under parallel transport. The change of e m is to be read off 
from (21.55) as a vector. It is adequate identification of this vector to know its scalar 
product with each of four independent vectors: with the basis vectors e v e 2 , and 
e 3 , or, more briefly, with e s , in (21.58); and with the normal vector n in (21.61). 
Thus one arrives, following Israel (1966), at what are known as the equations of 
Gauss and Weingarten, in happy oversight of all change of notation in the interven¬ 
ing century: 

< 4 > V iej = K if -2- + «/>*: (21.62) 

1 ’ /i • n 


= n- (4) 7j 


(21.61) 
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{dn\ = n ik dx k 

= [§• - <4>r &«<,] <tx k (21.65) 

= N^r ( > k dx k 

Compare to the same change as expressed in terms of the extrinsic curvature tensor, 

(dn ) t = -K ik dx k . (21.66) 

Conclude that this tensor has the value 


K ik = ~*i;k = -N H) n k = -N[ i 4 ) g°°^r 0ik + Mg^r pik ], 

or, with the help of equations (21.42) and (21.44), 


K ik = (i /N)rr 0ik - N^r pik ] 


1 

IN 

1 

IN 


ldx k 


dN k _ Hik_ 

dx i a t 


+ N k}i — 


dgjfc l 

a t J 


ir. 


pik 



Extrinsic curvature in terms 
of shift and change of 
(21.67) 3-metric 


This is the extrinsic curvature expressed in terms of the ADM lapse and shift 
functions [Arnowitt, Deser, and Misner (1962)]. 

As an example, let ^ have the geometry of a 3-sphere Extrinsic curvature of 

, « expanding 3-sphere 

ds 2 = a 2 [dg 2 + sm 2 x (d0 2 + sin 2 6 d<j> 2 )]. ( 21 . 68 ) 

Let the nearby spacelike slice in the one-parameter family of slices, the slice with 
the label t + dt (only a label!) have a 3-metric given by the same formula with the 
radius a replaced by a + da. The 4-geometry of the thin sandwich between these 
two slices is completely undetermined until one gives the lapse and shift functions. 

For simplicity, take the shift vector N 1 (see Figure 21.2) to be everywhere zero and 
the lapse function at every point on I to have the same value N. The separation 
in proper time between the two spheres is thus dr = Ndt. Any geometric figure 
located in S expands with time. The fractional increase of any length in this figure 
per unit of proper time is the same in whatever direction that length is oriented, 
and has the value 


ffractional increase \ 

of length per unit j = — _JL 1 (21.69) 

^of proper time / a ^ a2 


The negative of this quantity, multiplied by the (\) unit tensor, 1 = gives the 
extrinsic curvature tensor in (J) representation, 

IN a 2 dt 


(21.70) 
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Basic forms for calculating 
4-curvature 


One confirms this result (exercise 21.5) by direct calculation of the components K{ 
using the ADM formula (21.67) as the starting point. 

The Riemann curvature R a bcd = {3) R a bcd intrinsic to the hypersurface 2, together 
with the extrinsic curvature K\p give one information on the Riemann and Einstein 
curvatures of the 4-geometry. In the calculation, it is not convenient to use the 
coordinate basis, 

basis vectors, basis 1-forms 
e o = 

e { = d i9 dx\ 

because ordinarily the basis vector e 0 does not stand perpendicular to the hypersur¬ 
face (see Figure 21.2). Adopt a different basis but one that is still self-dual, 

basis vectors, basis 1-forms, 

e n = n = N~\d t - N m dJ, w n = N dt = (n ■ n)n ? 

e i = d i9 of 1 ~ dx l + N { dt. I z 

Also use Greek labels a = n, 1, 2, 3, instead of Greek labels a = 0, 1, 2, 3, to list 
components. 

Recall that curvature is measured by the change in a vector on transport around 
a closed route; or, from equation (14.23), 

$(u, v)w - V u Y v w - V v V u w - (21.72) 


Let the vector transported be e i and let the route be defined by e ; and e k . The 
latter two vectors belong to a coordinate basis. Therefore the “route closes automati¬ 
cally”, [e^e k ] = 0, and the final term in (21.72) drops out of consideration. Call 
on (21.62) and (21.60) to find 


(4)y (4)y e _ (4)y 


■■ \ k ^ + 


- v n 

~ K ^J^nj 


— K K m e> 

ik > m (n-n) 


+ {3)r ilje m 


(21.73) 


+ “ rs K<^ + “ >r -> < 4 


Evaluate similarly the term with indices j and k reversed, subtract from (21.73), 
simplify, and find 


Gauss-Codazzi: 4-curvature in 
terms of intrinsic 3-geometry 
and extrinsic curvature 


*(«j. e k)«i = (K ik a ~ K m ) 


m ’ (n • n) 

+ [(/i • nrHKijK^ - K ik Kp + ™R m m ]e n 


(21.74) 


The coefficients give directly the desired components of the curvature tensor 


<4) R m m = (3) R m m + («* nTKKyKf - K ik Kp (21.75) 

and 

<4) *V = (« • n)-^R nijk =-(!!• n)-\K m - K mj ). (2 1.76) 
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Equations (21.75) and (21.76) are known as the equations of Gauss and Codazzi 
[for literature, see Eisenhart (1926)]. It follows from (21.75) that the components 
of the curvature of the 3-geometry will normally only then agree with the corre¬ 
sponding components of the curvature of the 4-geometry when the imbedding 
happens to be accomplished at the point under study with a hypersurface free of 
extrinsic curvature. The directly opposite situation is illustrated by the familiar 
example of a 2-sphere imbedded in a flat 3-space, where the lefthand side of (21.75) 
(with dimensions lowered by one unit throughout!) is zero, and the extrinsic and 
intrinsic curvature on the right exactly cancel. 

Important components of the Einstein curvature let themselves be evaluated from 
the Gauss-Codazzi results. In doing the calculation, it is simplest to think of e it e i 
and e k as being an orthonormal tetrad, n being itself already normalized and 
orthogonal to every vector in the hypersurface. Then, employing (14.7) and (21.75), 
one finds 


- G °0 = (4, * 12 12 + (4, * 23 23 + (4, * 31 31 

= m R 12 12 + ™R 23 23 + (3) /? 31 31 

+ (n • n)~\{K\K\ - K\K\) + {K\K\ - K\K\) 
+ (K\K\ - ATiATl)] 

= 1 R - 1 (II • /i) _1 [(Tr Kf - Tr (AC 2 )]. 


Einstein curvature in terms 
extrinsic curvature 


(21.77) 


Here R is the 3-dimensional scalar curvature invariant and Tr stands for “trace of”; 
thus, 

Tr /f = g*lK i$ = g %$ K}* = K\ (21.78) 

and 

Tr K 2 = {K 2 )\ = K t M Kj = Sjs K™ gmi K». (21.79) 


The result, though obtained in an orthonormal tetrad, plainly is covariant with 
respect to general coordinate transformations within the spacelike hypersurface; and 
it makes no explicit reference whatever to any time coordinate, in this respect 
providing a coordinate-free description of the Einstein curvature. 

The Einstein field equation equates (21.77) to 87 rp, where p is the density of 
mass-energy. Expression (21.77) is the “measure of curvature that is independent 
of how curved one cuts a spacelike slice.” This measure of curvature is central to 
the derivation of Einstein’s field equation that is sketched in Box 17.2, item 3, 
“Physics on a Spacelike Slice.” 

The other component of the Einstein curvature tensor that is easily evaluated by 
(14.7) from the results at hand has the form 


Equation (21.77) is the 
central Einstein equation, 
"mass-energy fixes 
curvature" 


G n _ n2 i2 + MR»\ 3 

~ ~( n * n ) 1 (^i|2 ~ ^2|1 + ^i|3 ~ ^3|l)> 


(21.80) 


when referred to an orthonormal frame. One immediately translates to a form valid 
for any frame e v e 2 , e 3 in the hypersurface, orthonormal or not, 

The other initial-value 

G* = -(#1 • - (Tr#0,J. (21.81) equation 
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The Einstein field equation equates this quantity to 877 times the Mh covariant 
component of the density of momentum carried by matter and fields other than 
gravity. 

The four components of the Einstein field equation so far written down will have 
a central place in what follows as “initial-value equations” of general relativity. The 
other six components will not be written out: ( 1 ) the dynamics lets itself be analyzed 
more simply by Hamiltonian methods; and (2) the calculation takes work. It demands 
that one evaluate the remaining type of object, S(e j 9 iv)e { . One step towards that 
calculation will be found in exercise 21.7. Sachs does the calculation (1964, equation 
10) but only after specializing to Gaussian normal coordinates. These coordinates 
presuppose a very special slicing of spacetime: ( 1 ) geodesics issuing normally from 
the spacelike hypersurface n — 0 cut all subsequent simultaneities n — constant 
normally; and ( 2 ) the n coordinate directly measures lapse of proper time, or proper 
length, whichever is appropriate,* along these geodesics. In coordinates so special 
it is not surprising that the answer looks simple: 


(4)r 


n 

ink 



ZK ik 

dn 


+ K im K” 


( Gaussian normal\ 
coordinates / 


(21.82) 


Additional terms come into (21.82) when one uses, instead of the Gaussian normal 
coordinate system, the coordinate system of Amowitt, Deser, and Misner. The ADM 
coordinates are employed here because they allow one to analyze the dynamics as 
one wants to analyze the dynamics, with freedom to push the spacelike hypersurface 
ahead in time at different rates in different places (“many-fingered time”). Fischer 
(1971) shows how to evaluate and understand the geometric content of such formulas 
in a coordinate-free way by using the concept of Lie derivative of a tensor field, 
an introduction to which is provided by exercise 21 . 8 . 

* Here Sachs’ equation (10) is generalized to the case where the unit normal n is not necessarily timelike. 
Sachs used n = d/dt. 


Exercise 21.4. SCALAR CURVATURE INVARIANT IN TERMS OF AREA DEFICIT 

It being 10,000 km from North Pole to equator, one would have 62,832 km for the length 
of the “equator” if the earth were flat, as contrasted to the actual —40,000 km, a difference 
reflecting the fact that the surface is curved up into closure. Turn from this “pre-problem” 
to the actual problem, a 3-sphere 

ds 2 = a 2 [d\ 2 + sin 2 x(d0 2 -f sin 2 # d<f> 2 )]. 

Measure off from x = 0 a 2-sphere of proper radius e = a\. Determine the proper area of 
this 2-sphere as a function of x- Verify that relation (21.50) on the area deficit gives in the 
limit e —*- 0 the correct result R = 6/a 2 . For a more ambitious exercise: (1) take a general 
(smooth) 3-geometry; (2) express the metric near any chosen point in terms of Riemann’s 
normal coordinates as given in §11.6; (3) determine the locus of the set of points at the 
proper distance e to the lowest interesting power of e in terms of the spherical polar angles 6 
and $ (direction of start of geodesic of length e); (4) determine to the lowest interesting 
power of e the proper area of the figure defined by these points; and thereby establish (21.50) 
[for more on this topic see, for example, Cartan (1946), pp. 252-256]. 
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Exercise 21.5. EXTRINSIC CURVATURE TENSOR FOR SLICE 
OF FRIEDMANN GEOMETRY 

Confirm the result (21.70) for the extrinsic curvature by direct calculation from formula 
(21.67). 

Exercise 21.6. EVALUATION OF «( ej , ejn 

Evaluate this quantity along the model of (21.74) or otherwise. How can it be foreseen that 
the coefficient of n in the result must vanish identically? Comparing coefficients of e m , find 
i4> R m nlk and test for equivalence to equation (21.76). 

Exercise 21.7. EVALUATION OF THE COMMUTATOR [ e> , /»] 

The evaluation of this commutator is a first step toward the calculation of a quantity like 
$(«,•,»)«,. Expressing e, as the differential operator d/dx 1 , use (21.49) to represent n also 
as a differential operator. In this way, show that the commutator in question has the value 
— ( Nj/N)n - (N m JNle m ._ 

Exercise 21.8. LIE DERIVATIVE OF A TENSOR (exercise provided by J. W. York, Jr.) 

Define the Lie derivative of a tensor field and explore some of its properties. The Lie 
derivative along a vector field n is a differential operator that operates on tensor fields T 
of type G)> converting them into tensors £ n T, also of type G). The Lie differentiation process 
obeys the usual chain rule and has additivity properties [compare equations (10.2b, 10.2c, 
10.2d) for the covariant derivative]. For scalar functions/ one has /»[/] =■ fyi*. The 

Lie derivative of a vector field u along a vector field v was defined in exercise 9.11 by 

£ u v = [ u , *]. 

If the action of £„ on 1-forms is defined, the extension to tensors of general type will be 
simple, because the latter can always be decomposed into a sum of tensor products of vectors 
and 1-forms. If a is a 1-form and v is a vector, then one defines £„a to be that 1-form 
satisfying 

= "[(O, I')] ~ <0\ [/>, *]> 

for arbitrary v. 

(a) Show that in a coordinate basis 

£ n a ~ (°a,/3 nli + °/3 n ? a )dX a - 

(b) Show that in a coordinate basis 

£ n T = ( T ap,n nl1 + T nf3 nll ,a + T a l t n> ‘,i 3 ) ® dx 13 

where T is of type (!)). 

(c) Show that in (a) and (b), all partial derivatives can be replaced by covariant derivatives. 
[Observe that Lie differentiation is defined independently of the existence of an affine 
connection. For more information, see, for example. Bishop and Goldberg (1968) and 
Schouten (1954)]. 

Exercise 21.9. EXPRESSION FOR DYNAMIC COMPONENTS OF THE 

CURVATURE TENSOR (exercise provided by J. W. York, Jr.) 

The Gauss-Codazzi equations can be viewed as giving 14 of the 20 algebraically independent 
components of the spacetime curvature tensor in terms of the intrinsic and extrinsic geometry 
of three-dimensional (non-null) hypersurfaces. In order to accomplish a space-plus-time 
splitting of the Hilbert Lagrangian one must express, in addition, the remaining 
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6 components of the curvature tensor in an analogous manner. It is convenient for this 
purpose to express all tensors as spacetime tensors, and to use Lie derivation in the direction 
of the timelike unit normal field of the spacelike hypersurfaces as a generalized notion of 
time differentiation. A number of preliminary results must be proven: 

^ ^uS/iv “h 

(b ) £ u(gn, + «„«,) = 

= W M;r + + V», + a^u,, 

where is the metric of the spacelike hypersurface, expressed in the spacetime coordinate 
basis, and a" = « x V x a" is the curvature vector (4-acceleration) of the timelike normal curves 
whose tangent field is (Recall that i^a" = 0.) 

(c) Prove that the extrinsic curvature tensor is given by 

K - -Ijf 

2 fit'' ___ 

(d) The unit tensor of projection into the hypersurface is defined by 

-L{f = + u*u v . 

In terms of X show that one can write 

U a;p — ~ Kctp °^ a f3 "" 

where 

Kap ~ -I-a -I- 

and 

(c) From the fact that is the unit normal field for a family of spacelike hypersurfaces 
show that w a p = 0. 

(f) The needed tools are now on hand. To obtain the result: 

(i) Write down £ U K^ (see exercise 21.8); 

(ii) Insert this expression into the Ricci identity in the form 

V- + 

(iii) P ro j ec t the two remaining free indices into the hypersurface using X, and show that 
one obtains 

= £ u K a$ + K ay Kl 

+ <3)V (a a /j) + a a a p , 

where (3) V a a jS = X J X £ V^a, can be shown to be the three-dimensional covariant derivative 
of a p . In Gaussian normal coordinates, show that one obtains from this result 

R oio> = ~ t R ij + R ik K k y 

(g) Finally, in the construction of (4) R, one needs to show that 

YH® V„> + «„«,] = £'"’[ (3) V ( „fl, ) + a^] = a x x . 
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Exercise 21.10. EXPRESSION OF <4) /? i I>in IN TERMS OF EXTRINSIC 
CURVATURE, PLUS A COVARIANT DIVERGENCE 
(exercise provided by K. Kuchar) 

Let a' be an arbitrary smooth set of four coordinates, not necessarily coordinated in any 
way with the choice of the 1-parameter family of hypersurfaces. 

(a) Show that 

i4)Ri nin = g a ' YnP ‘(n a -;FY ~ 

(b) Show that the covariant divergences 

and 

can be removed from this expression in such a way that what is left behind contains only 
first derivatives of the unit normal vector n. 

(c) Noting that the basis vectors e { and n form a complete set, justify the formula 

g -f (n • n)~ l n f3 'n fi \ 

where w* is the 1-form dual to 

(d) Noting that n a >.^n a ' = 0 and 


K a = ~ e i< 




show that 


(4) /?* nin = (Tr K) 2 — Tr K 1 plus a covariant divergence. 


§21.6. THE HILBERT ACTION PRINCIPLE AND 

THE ARNOWITT-DESER-MISNER MODIFICATION 
THEREOF IN THE SPACE-PLUS-TIME SPLIT 


For analyzing the dynamics, it happily proves unnecessary to possess the missing 
formula for {4) R n ink . It is essential, however, to have the Lagrangian density, 


16^ geom = (~ i4) g) 1/2(4 % (21.83) 

in the Hilbert action principle as the heart of all the dynamic analysis. In the present 
ADM (1962) notation, this density has the form 


(- (4) g) 1/2(4> * = (- <4, g) 1/2 [ <4) ^\ + 2<4) * in »n] 

= (~ (4> s) 1/2 [* + (#i * n)(Tr K 2 - (Tr Kf) + 2(n • /i)^^]. 


(21.84) 


Kuchar (1971b; see also exercise 21.10) shows how to calculate a sufficient part of 
this quantity without calculating all of it. The difference between the “sufficient part” 
and the “whole” is a time derivative plus a divergence, a quantity of the form 


Drop a complete derivative 
from the Hilbert action 
principle to get the ADM 
principle 


l(~ U) g) 1/2 A% = (-Mg) 


(21.85) 
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When one multiplies (21.83) by dt dx 1 dx 2 dx 3 and integrates to obtain the action 
integral, the term (21.85) integrates out to a surface term. Variations of the geometry 
interior to this surface make no difference in the value of this surface term. Therefore 
it has no influence on the equations of motion to drop the term (21.85). The result 
of the calculation (exercise 21.10) is simple: what is left over after dropping the 
divergence merely changes the sign of the terms in Tr/f 2 and (Tr/f) 2 in (21.84). 
Thus the variation principle becomes 

(extremum) = / modjfied = f £ modifjed d 4 x 

r r (21.86) 

= 0/16ir) J [R + (n • n)((Tr Kf - Tr K 2 )]Ng" 2 dt d 3 x + J £ fieIds d*x. 

This expression, rephrased, is the starting point for Arnowitt, Deser, and Misner’s 
analysis of the dynamics of geometry. 

Two supplements from a paper of York (1972b; see also exercise 21.9) enlarge one’s 
geometric insight into what is going on in the foregoing analysis. First, the tensor 

of extrinsic curvature lets itself be defined [see also Fischer (1971)] most naturally 
in the form 

K =-~* n 9, (21.87) 

where g is the metric tensor of the 3-geometry, n is the timelike unit normal field, 
and £ is the Lie derivative as defined in exercise 21.8. Second, the divergence (21.85), 
which has to be added to the Lagrangian of (21.86) to obtain the full Hilbert 
Lagrangian, is 

~2((— <4) g) 1/2 (/2“'Tr K + a a ')] a ., (21.88) 

where the coordinates are general (see exercise 21.10), and 

a a ' = n a '. p .n^ (21.89) 

is the 4-acceleration of an observer traveling along the timelike normal n to the 
successive slices. 


§21.7. THE ARNOWITT, DESER, AND MISNER FORMULATION 
OF THE DYNAMICS OF GEOMETRY 

Dirac (1959, 1964, and earlier references cited therein) formulated the dynamics of 
geometry in a (3 + 1 )-dimensional form, using generalizations of Poisson brackets 
and of Hamilton equations. Arnowitt, Deser, and Misner instead made the Hilbert- 
Palatini variational principle the foundation for this dynamics. Because of its sim¬ 
plicity, this ADM (1962) approach is followed here. The gravitational part of the 
integrand in the Hilbert-Palatini action principle is rewritten in the condensed but 
standard form (after inserting a 16w that ADM avoid by other units) as 

16w ^geomtrue = ^geomADM = Sij W/dt -NX - N f X* 

- 2 [y>Ar. - y N* Tr n + A^g) 1 ' 2 j 


(21.90) 
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Here each item of abbreviation has its special meaning and will play its special part, 
a part foreshadowed by the name now given it: 


77 


»; — 
true 


S(action) 

hii 


/“geometrodynamic \ 
field momentum” dyn¬ 
amically conjugate to 
the “geometrodynamic 
yield coordinate” I 


= J^ ;irii = S 1/2 (g ii TiK-K i >) 

(21.91) 


(here the of ADM is usually more convenient than 7 rjj ue ); and 


Momenta conjugate to the 
dynamic g {j 


^true = e>&i;) = (“super-Hamiltonian”) = ^'/16w; 

W'.Sy) = g~ 1/2 (Tr n 2 - i(Tr/7) 2 ) - g^R¬ 


(21.92) 


and 


1677 ^{rue = 0C X = OCXn* 1 , g {j ) = (“supermomentum”) = — 2-jr ik ik . (21.93) 

Here the covariant derivative is formed treating as a tensor density, as its 

definition in (21.91) shows it to be (see §21.2). The quantities to be varied to 
extremize the action are the coefficients in the metric of the 4-geometry, as follows: 
the six g tj and the lapse function A and shift function Np, and also the six “geome¬ 
trodynamic momenta,” ir'i To vary these momenta as well as the metric is (1) to 
follow the pattern of elementary Hamiltonian dynamics (Box 21.1), where, by taking 
the momentum p to be as independently variable as the coordinate x, one arrives 
at two Hamilton equations of the first order instead of one Lagrange equation of 
the second order, and (2) to follow in some measure the lead of the Palatini variation 
principle of §21.2. There, however, one had 40 connection coefficients to vary, 
whereas here one has come down to only six tt'K To know these momenta and the 
3-metric is to know the extrinsic curvature. Before carrying out the variation, drop 
the divergence — 2[ ] ^ from (21.90), since it gives rise only to surface integrals 

and therefore in no way affects the equations of motion that will come out of the 
variational principle. Also rewrite the first term in (21.90) in the form 


-(3/30(gyW 1 ») + 7r« dg^/dt, (21.94) 

and drop the complete time-derivative from the variation principle, again because 
it is irrelevant to the resulting equations of motion. The action principle now takes 
the form 

extremum = 4 ue = 7 adm /16t7 

= (1/1 677 ) f [77 Vdgy/dt - NX(v«g v ) - NiX H^gij)] d*x 

+ f f field d 4 x. (21.95) 

The action principle itself, here as always, tells one what must be fixed to make 
the action take on a well-defined value (if and when the action possesses an extre¬ 
mum). Apart from appropriate potentials having to do with fields other than geom- 
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Action principle says, fix 
3-geometry on each face of 
sandwich 


What a 3-geometry is 


Electromagnetism gives 
example of momentum 
conjugate to "field 
coordinate" 


etry, the only quantities that have to be fixed appear at first sight to be the values 
of the six g.. on the initial and final spacelike hypersurfaces. However, the ADM 
action principle is invariant with respect to any change of coordinates x\ x 2 , x 3 
—x 1 , x 2 , x 3 within the successive spacelike slices. Therefore the quantities that 
really have to be fixed on the two faces of the sandwich are the 3-geometries (3) i>' 
(on the initial hypersurface) and {3) h (on the final hypersurface) and nothing more. 

In mathematical terms, a 3-geometry (3) h is the “equivalence class” of a set of 
differentiable manifolds that are isometrically equivalent to each other under diffeo- 
morphisms. In the terms of the everyday physicist, a 3-geometry is the equivalence 
class of 3-metrics g^(x 9 y 9 z) that are equivalent to one another under coordinate 
transformations. In more homely terms, two automobile fenders have one and the 
same 2-geometry if they have the same shape, regardless of how much the coordinate 
rulings painted on the one may differ from the coordinate rulings painted on the 
other. 

To have in equation (21.95) an example of a field Lagrangian that is at the same 
time physically relevant and free of avoidable complications, take the case of a 
source-free electromagnetic field. It would be possible to take the field Lagrangian 
to have the standard Maxwell value, 

(1/8* t)(E 2 - B 2 ) —-(1/16 v)F„F*', (21.96) 

with 

Fnv = dA p /dx* - dAJdx v . (21.97) 

The variation of the Lagrangian with respect to the independent dynamic variables 
of the field, the four potentials A a , would then immediately give the four second-order 
partial differential wave equations for these four potentials. However, to have instead 
a larger number of first-order equations is as convenient for electrodynamics as it 
is for geometrodynamics. One seeks for the analog of the Hamiltonian equations 
of particle dynamics, 


dx/dt = dH(x,p)/dp i 

(21.98) 

dp/dt = —3 H(x,p)/dx. 

One gets those equations by replacing the Lagrange integral JX(x, x) dt by the 
Hamilton integral j[px — H(x,p)]dt. Likewise, here one replaces the action inte¬ 
grand of (21.96) by what in flat spacetime would be 

(1/4*) +1^^]. (21.99) 

In actuality, spacetime is to be regarded as not only curved but also sliced up into 
spacelike hypersurfaces. This (3 + 1) split of the geometry made it desirable to split 
the ten geometrodynamic potentials into the six g i; - and the four lapse and shift 
functions. Here one similarly splits the four A M into the three components A i of the 
vector potential and the scalar potential A 0 = — <£ (with the sign so chosen that, 
in flat spacetime in a Minkowski coordinate system, <f> = A 0 ). In this notation, the 
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Lagrange density function, including the standard density factor (- <4> g) 1/2 but 
dropping a complete time integral (d/dtXAfS*) that has no influence on the equations 
of motion, is given by the formula 


Afield = -* 1 3A { /dt + 

~ J A r g _1/2 g i ,(£ 1 £’ + «*#*) + N'[ijk]S u 3 k . 


( 21 . 100 ) 


Here use is made of the alternating symbol [ijk], defined as changing sign on the 
interchange of any two labels, and normalized so that [123] = 1. Note that the 
3-tensor e tjk and the alternating symbol [ijk] are related much as are the correspond¬ 
ing four-dimensional objects in equation (8.10), so that one can write 

A jk ). (21.101) 

The quantities are the components of the magnetic field in the spacelike slice. 
They are not regarded as independently variable. They are treated as fully fixed 
by the choice of the three potentials The converse is the case for the components 
of the electric field: they are treated like momenta, and as independently variable. 
Extremizing the action with respect to the (exercise 21.11) gives the analog 
of the equation dx/dt = p/m in particle mechanics, and the analog of the equation 

E i = -dAJdt - d<t>/dx l (21.102) 

of flat-spacetime electrodynamics; namely, 

-dAi/dt - - Ng~ 1/2 gij S* - \ijk]N^3 h = 0. (21.103) 

Here the last term containing the shift functions N\ arises from the obliquity of 

the coordinate system. ADM give the following additional but equivalent ways to 
state the result (21.103): 

& = * F ik 

i (i i ( 2U04 ) 

= j [ijk] [y [yv](- (4, s) 1/2, V"(V^) • 

They note that and are not directly the contravariant components of the fields 
in the simultaneity 

E= E% B = B% (21.105) 

but the contravariant densities, 

= g i/2 E i 9 cp = g i/2 B j (21.106) 

Extremizing the action with respect to the three A x (exercise 21.12) gives the 
curved-spacetime analog of the Maxwell equations, 


Lagrange density for 
electromagnetism 


The initial-value equation of 
electromagnetism 
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Divergence relation by 
extremization with respect 
to <> 


The remaining potential, <>, enters the action principle at only one point. Extre- 
mizing with respect to it gives immediately the divergence relation of source-free 
electromagnetism. 


Action principle tells what to 
fix at limits 


At limits, fix not potentials 
but magnetic field itself 


= 0- (21.108) 

If an action principle tells in and by itself what quantities are to be fixed at the 
limits, what lessons does (21.100) give on this score? One can go back to the example 
of particle mechanics in Hamiltonian form, as in Box 21.1, and note that there the 
momentum p could “flap in the breeze." Only the coordinate x had to be fixed at 
the limits. Thus the variation of the action was 

81 = 8 f [px - H(x, p)} dt (2 1.109) 

= /([•*- ZH/dp] dp + ( d/dt){p 8x) + [-p - dH/dx] 8x) dt. 

To arrive at a well-defined extremum of the action integral /, it was not enough 
to annul the coefficients, in square brackets, of 8p and fix; that is, to impose Hamil¬ 
ton’s equations of motion. It was necessary in addition to annul the quantities at 
limits, p 8x; that is, to specify x at the start and at the end of the motion. Similarly 
here. The quantities <p and S' flap in the breeze, but the magnetic field has to be 
specified on the two faces of the sandwich to allow one to speak of a well-defined 
extremum of the action principle. Why the magnetic field, or the three quantities 


dAj/dx' - dAfdx’; (21.110) 

why not the three A { themselves? When one varies (21.100) with respect to the A t , 
and integrates the variation of the first term by parts, as one must to arrive at the 
dynamic equations, one obtains a term at limits 


J f S { 8A { d 3 x — J S i 8A i d 3 x. (21.111) 

final 

One demands that both these terms at limits must vanish in order to have a well- 
defined variational problem. Go from the given vector potential to another vector 
potential, ^4 inew , by the gauge transformation 

A i m = A i + 8A i =A ( + d\/dx\ (21.112) 

The magnetic-field components given by the three differ in no way from those 
listed in (21.110). Moreover the “variation at limits,”” 

fs i 8A i d 3 x = fs i d\/dx i d 3 x = - f\S i ii d 3 x, (21.113) 

is automatically zero by virtue of the divergence condition (21.108), for any arbitrary 
choice of A. Therefore the quantities fixed at limits are not the three A { themselves 
(mere potentials) but the physically significant quantities (21.110), the components 
of the magnetic field. Moreover, the divergence condition S' { = 0 now becomes 
the initial-value equation for the determination of the potential 6. 
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In the preceding paragraph one need only replace “the three A” by “the six 
and “the components of the magnetic field” by “the 3-geometry (3> &” and “the 
potential <£” by “the lapse and shift functions N and jV*” to pass from electro¬ 
dynamics to geometrodynamics. 

With this parallelism in view, turn back to the variational principle (21.95) of 
general relativity in the ADM formulation. With the 3-geometry fixed on the two 
faces of the sandwich, vary conditions in between to extremize the action, varying 
in turn the i7 x \ the g i? -, and the lapse and shift functions. The geometrodynamic 
momenta appear everywhere only algebraically in the action principle, except in 
the term — IN^y. Variation and integration by parts gives Collecting 

coefficients of and annuling the sum of these coefficients, one arrives at one 
of the several conditions required for an extremum, 

d gij /dt = 2 Ngr v * (*« - jgu Tm) + N iii + N f[i . (21.114) 

This result agrees with what one gets from equations (21.91) defining geometrody¬ 
namic momentum in terms of extrinsic curvature, together with expression (21.67) 
for extrinsic curvature in terms of lapse and shift. The result (21.114) here is no 
less useful than the result 


dx/dt — dH(x,p)/dp — p/m 


in the most elementary problem in mechanics: it marks the first step in splitting 
a second-order equation or equations into twice as many first-order equations. 

Now vary the action with respect to the and again, after appropriate integration 
by parts and rearrangement, find the remaining first-order dynamic equations of 
general relativity [simplified by use of equations (21.116) and (21.117)], 


drr^/dt = - 


Ng 1/2 (^RH _ 1^7?) +L Ng -m g a 
2Ng~ V2 — j w’J'Tr n^j 




+ g 1/2 (X {ii ~ + (^ m ) , m 


(21.115) 


- N* 


\m 


• 77 ” 


- N j 


\m 1 


+ 


"source terms arising from fields 
other than geometry, omitted here for 
simplicity, but discussed by ADM (1962) 




Finally extremize the action (21.95) with respect to the lapse function N and the 
shift functions and find the four so-called initial-value equations of general 
relativity, equivalent to (21.77) and (21.81) or to G“ = 8 ttT"; thus, 


ADM principle reproduces 
formula for geometrodynamic 
momentum 


Dynamic and initial-value 
equations out of ADM 
formalism 


-(l/167r)af( W « gij ) = 

— (1/16 ff)3f’(7r y , g if ) = — (1 /4tt )[ijk\S’(R k . 


(21.116) 

(21.117) 
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Exercise 21.11. FIRST EXPLOITATION OF THE ADM VARIATIONAL PRINCIPLE 
FOR THE ELECTROMAGNETIC FIELD 

Extremize the action principle (21.100) with respect to the S 1 and derive the result (21.103). 

Exercise 21.12. SECOND EXPLOITATION OF THE ADM VARIATIONAL 
PRINCIPLE FOR THE ELECTROMAGNETIC FIELD 

Extremize (21.100) with respect to the A i% and verify that the resulting equations in any 
Minkowski-flat region are equivalent to (21.107). 

Exercise 21.13. FARADAY-MAXWELL SOURCE TERM IN THE DYNAMIC 
EQUATIONS OF GENERAL RELATIVITY 

Evaluate the final indicated source terms in (21.115) from the Lagrangian (21.100) of Maxwell 
electrodynamics, regarded as a function of the A- and the 

* ol)’ 

Exercise 21.14. THE CHOICE OF <f> DOESN'T MATTER 

Prove the statement in the text that the dynamic development of the electric and magnetic 
fields themselves is independent of the choice made for the scalar potential x,y, z) in 
the analysis (a) in flat spacetime in Minkowski coordinates and (b) in general relativity 
according to equations (21.103), and (21.107) as generalized in exercise 21.12. 

Exercise 21.15. THE CHOICE OF SLICING OF SPACETIME DOESN'T MATTER 

Given a metric {3) g tj (x,y, z) and an extrinsic curvature K%x,y, z) on a spacelike hypersurface 
T, and given that these quantities satisfy the initial-value equations (21.116) and (21.117), 
and given two alternative choices for the lapse and shift functions (TV, TV,) and (TV+ SN, 
TV, + STV,), show that the curvature itself (as distinguished from its components in these two 
distinct coordinate systems), as calculated at a point 9 a “little way” (first order of small 
quantities) off the hypersurface, by way of the dynamic equations (21.114) and (21.115), is 

independent of this choice of lapse and shift. 


§21.8. INTEGRATING FORWARD IN TIME 

In the Hamiltonian formalism of Arnowitt, Deser, and Misner [see also the many 
papers by many workers on the quantization of general relativity-primarily putting 
Einstein’s theory into Hamiltonian form—cited, for example, in references 1 and 
2 of Wheeler (1968)], the dynamics of geometry takes a form quite similar to the 
Hamiltonian dynamics of geometry. There one gives x and p at a starting time and 
integrates two first-order equations for dx/dt and dp/dt ahead in time to find these 
dynamically conjugate variables at all future times. Here one gives appropriate values 
°f Sa and 177X1 over an initial spacelike hypersurface and integrates the two first-order 
equations (21.114) and (21.115) ahead in time to find the geometry at future times. 
For example, one can rewrite the differential equations as difference equations 
according to the practice by now familiar in modern hydrodynamics, and then carry 
out the integration on an electronic digital computer of substantial memory capacity. 
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Time in general relativity has a many-fingered quality very different from the 
one-parameter nature of time in nonrelativistic particle mechanics [see, however, 
Dirac, Fock, and Podolsky (1932) for a many-time formalism for treating the relati¬ 
vistic dynamics of a system of many interacting particles]. He who is studying the 
geometry is free to push ahead the spacelike hypersurface faster at one place than 
another, so long as he keeps it spacelike. This freedom expresses itself in the lapse 
function N(t,x,y,z) at each stage, t, of the integration. Equations (21.114) and 
(21.115) are not a conduit to feed out information on N to the analyst. They are 
a conduit for the analyst to feed in information on N. The choice of A is to be made, 
not by nature, but by man. The dynamic equations cannot begin to fulfill their 
purpose until this choice is made. The “time parameter” t is only a label to distinguish 
one spacelike hypersurface from another in a one-parameter family of hypersurface; 
but N thus tells the spacing in proper time, as it varies from place to place, between 
the successive slices on which one chooses to record the time-evolution of the 
geometry. A cinema camera can record what happens only one frame at a time, 
but the operator can make a great difference in what that camera sees by his choice 
of angle for the filming of the scene. So here, with the choice of slicing. 

Another choice is of concern to the analyst, especially one doing his analysis on 
a digital computer. He is in the course of determining, via (21.114-21.115) written 
as difference equations, what happens on a lattice work of points, typified by 
x = ... , 73,74, 75, 76, 77,..., etc. He finds that the curvatures are developing most 
strongly in a localized region in the range around x = 83 to x = 89. He wants to 
increase the density of coverage of his tracer points in this region. He does so by 
causing points at lesser and greater x values to drift into this region moment by 
moment as t increases: t — ..., 122, 123, 124,.... He makes the tracer points at 
lesser x-values start to move to the right (N x positive) and points at greater x-values 
move to the left (A t negative). In other words, the choice of the three shift functions 
Ni(t,x,y, z) is just as much the responsibility of the analyst as is the choice of the 
lapse function N. The equations will never tell him what to pick. He has to tell the 
equations. 

These options, far from complicating dynamic equations (21.114-21.115), make 
them flexible and responsive to the wishes of the analyst in following the course 
of whatever geometrodynamic process is in his hands for study. 

The freedom that exists in general relativity in the choice of the four functions 
N, N t , is illuminated from another side by comparing it with the freedom one has 
in electrodynamics to pick the one function <j>(t, x,y, z), the scalar potential. In no 
way do the dynamic Maxwell equations (21.103) and (21.107), as generalized in 
exercise 21.12 determine Instead they demand that it be determined (by the 
analyst) as the price for predicting the time-development of the vector potential A t . 
An altered choice of x,y, z) in its dependence on position and time means altered 
results from the dynamic equations for the development of the three A i in time and 
space. However, the physically significant quantities, the electric and magnetic fields 
themselves on successive hypersurfaces, come out the same (exercise 21.14) regardless 
of this choice of <£. Similarly in geometrodynamics: an altered choice for the four 


Lapse and shift chosen to. 
push forward the integration 
in time as one finds most 
convenient 


Same 4-geometry regardless 
of lapse and shift options 
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F 


tz I-=g l 




Figure 21.4. 

Some of the many ways to make distinct spacelike slices through 
one and the same (4) £), the complete Schwarzschild 4-geometry, 


functions N, means (a) an altered laying down of coordinates in spacetime, and 
therefore (b) altered results for the intrinsic metric (3) g i; . and extrinsic curvature K ij 
of successive spacelike hypersurfaces, but yields the same 4-geometry {4) h (Figure 
21.4) regardless of this choice of coordinatization (exercise 21.15). 


§21.9. THE INITIAL-VALUE PROBLEM IN 

THE THIN-SANDWICH FORMULATION 


Initial-value data: what is 
freely disposable? and what 
is thereby fixed? 


Given appropriate initial-value data, one can integrate the dynamic equations ahead 
in time and determine the evolution of the geometry; but what are “appropriate 
initial-value data”? They are six functions {3) g {j (x,y, z) plus six more functions 
?r tJ (x, 7 , z ) or that together satisfy the four initial-value equations (21.116) 

and (21.117). To be required to give coordinates and momenta accords with the 
familiar plan of Hamiltonian mechanics; but to have consistency conditions or 
constraints imposed on such data is less familiar. A particle moving in two-dimen¬ 
sional space is catalogued by coordinates x,y, and coordinates p x ,p y ; but a particle 
forced to remain on the circle x 2 + y 2 = a 2 satisfies the constraint xp x -{- yp y = 0. 
Thus the existence of a “constraint” is a signal that the system possesses fewer degrees 
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of freedom than one would otherwise suppose. Fully to analyze the four “initial- 
value or constraint ’ conditions (21.116) and (21.117) is thus to determine (1) how 
many dynamic degrees of freedom the geometry possesses and (2) what these degrees 
of freedom are; that is to say, precisely what “handles” one can freely adjust to 
govern completely the geometry and its evolution with time. The counting one can 
do today, with the conclusion that the geometry possesses the same count of true 
degrees of freedom as the electromagnetic field. The identification of the “handles,” 
or freely adjustable features of the dynamics, is less advanced for geometry than 
it is for electromagnetism (Box 212), but most instructive so far as it goes. 

By rights the identification of the degrees of freedom of the field, whether that 
of Einstein or that of Faraday and Maxwell, requires nothing more than knowing 
what must be fixed on initial and final spacelike hypersurfaces to make the appro¬ 
priate variation principle well-defined. One then has the option whether (1) to give 
that quantity on both hypersurfaces or (2) to give that quantity and its dynamic 
conjugate on one hypersurface or (3) to give the quantity on both hypersurfaces, 
as in (1), but go to the limit of an infinitely thin sandwich, so that one ends up 
specifying the quantity and its time rate of change on one hypersurface. This third 
“thin sandwich” procedure is simplest for a quick analysis of the initial-value 
problem in both electrodynamics and geometrodynamics. Take electrodynamics first, 
as an illustration. 

Give the divergence-free magnetic field and its time-rate of change: on an arbitrary 
smooth spacelike hypersurface in curved spacetime in the general case; on the 
hypersurface t = 0 in Minkowski spacetime in the present illustrative treatment, 

®\0,x,y,z) given, (21.118) 

® *(0, y, z) = also given. (21.119) 

These quantities together contain four and only four independent data per space 

point. How is one now to obtain the momenta n* ~-so that one can start 

integrating the dynamic equations (21.103) and (21.107) forward in time? (1) Find 
a set of three functions A { (0,x,y,z) such that their curl gives the three specified 
That this can be done at all is guaranteed by the vanishing of the divergence 
However, the choice of the A t is not unique. The new set of potentials A inew = 
A i + dX/dx' with arbitrary smooth X, provide just as good a solution as the original 
A y No matter. Pick one solution and stick to it. (2) Similarly, find a set of three 
Ai(0,x,y,z) such that their curl gives the specified £@*(0, x,y, z), and resolve all 
arbitrariness of choice by fiat. (3) Recall that the electric field (negative of the field 
momentum) is given by 


$i= -A i - df/dx* (21.120) 

(formula valid without amendment only in flat space). The initial-value or constraint 
equation { = 0 translates to the form 

= - v ‘’ A if j - (21.121) 


In electromagnetism, give 
magnetic field and its rate 
change as initial data 



Box 21.2 COUNTING THE DEGREES OF FREEDOM OF THE ELECTROMAGNETIC FIELD 


A. First Approach: Number of 
"Field Coordinates" per 
Spacepoint 

Superficial tally of the degrees of freedom of the 
source-free electromagnetic field gives three field 
coordinates A t (x,y, z) per spacepoint on the initial 
simultaneity I, plus three field momenta ^ = 
w ‘/4w [with ** = -£i(x,y,z)] per spacepoint 
Closer inspection reveals that the number of 
coordinate degrees of freedom per spacepoint is 
not three but two. Thus the change in vector po¬ 
tential A f *- A i + d\/cx { makes no change in 
the actual physics, the magnetic field components, 

Bi = j[ijk](dA k /dx’ - dAj/dx*). 

Moreover, though those components are three in 
number, they satisfy one condition per spacepoint, 
® ~~ thus reducing the effective net number 

of coordinate degrees of freedom per spacepoint 
to two. 

The momentum degrees of freedom per space- 
point are likewise reduced from three to two by 
the one condition per spacepoint . = 0. 

B. Alternative Approach: 

Count Fourier Coefficients 

In textbooks on field theory [see, for example, 
Wentzel (1949)], attention focuses on flat space- 
time. The electromagnetic field is decomposed by 
Fourier analysis into individual running waves. 
Instead of counting degrees of freedom per point 
in coordinate space, one does the equivalent: 
counts up degrees of freedom per point in wave- 
number space. Thus for each (k x , k y9 k z ), there are 
two independent states of polarization. Each state 
of polarization requires for its description an am¬ 
plitude (“coordinate”) and time-rate of change of 
amplitude (“momentum”) at the initial time, 

Thus the number of degrees of freedom per point 
in wave-number space is two for coordinates and 
two for momenta, in accord with what one gets 
by carrying out the count in coordinate space. 


In curved spacetime, Fourier analysis is a less 
convenient way of identifying the degrees of free¬ 
dom of the electromagnetic field [for such a Fou¬ 
rier analysis, see Misner and Wheeler (1957), es¬ 
pecially their Table X and following text] than 
direct analysis in space, as above. 

C. Another Alternative: 

Analyze “Deformation of 
Structure" 

Still a third way to get a handle on the degrees 
of freedom of a divergence-free field, whether 8 
or (B , rests on the idea of deformation of structure 
[diagram from Wheeler (1964)]. Represent the 



magnetic field by Faraday's picture of lines of 
force (a) continuing through space without ever 
ending, automatic guarantee that is every¬ 
where zero. Insert “knitting needles” (b) into the 
spaghetti-like structure of the lines of force and 
move these needles as one will. Sliding the “knit¬ 
ting needles” along a line of force causes no move¬ 
ment of the line of force, (c) With the help of two 
knitting needles perpendicular to each other and 
to the line of force, one can give any given line 
of force any small displacement one pleases per¬ 
pendicular to its length: again two degrees of free¬ 
dom per spacepoint. Granted any non-zero field 
to begin with, no matter how small, one can build 
it up by a sequence of such small deformations 
to agree with any arbitrary field pattern of zero 
divergence, no matter what its complexity and 
strength may be. 
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Solve for <£. Then (4) equation (21.120) gives the initial-value electric field, or 
electrodynamic field momentum n' ~~ — <0, required (along with the field coordinate 
for starting the integration of the dynamic equations (21.103) and (21.107). 
[Misner and Wheeler (1957) deal with the additional features that come in when 
the space is multiply connected. Each wormhole or handle of the geometry is able 
to trap electric lines of force. The flux trapped in any one wormhole defines the 
classical electric charge q w associated with that wormhole. One has to specify all 
these charges once and for all in addition to the data (21.118) and (21.119) in order 
to determine fully the dynamic evolution of the electromagnetic field. There is no 
geometrodynamic analog to electric charge, according to Unruh (1971).] (5) In this 
integration, the scalar potential 4> at each subsequent time step is not to be calculated; 
it is to be chosen. Only when one has made this free choice definite do the dynamic 
equations come out with definite results for the and the m* or 0 at these successive 
steps. 

In the thin-sandwich formulation of the initial-value problem of electrodynamics, 
to summarize, one gives and $ { (equivalent to (B on two nearby hypersurfaces). 
One chooses the A { and A i with much arbitrariness to represent these initial-value 
data. The arbitrariness having been seized on to give the initial A { and A t , there 
is no arbitrariness left in the initial <f>. However, at all subsequent times the situation 
is just the other way around. All the arbitrariness is sopped up in the choice of the 
<t>, leaving no arbitrariness whatever in the three A { (as given by the integration of 
the dynamic equation). 

The situation is quite similar in geometrodynamics. One gives the beginnings of 
a 1-parameter family of spacelike hypersurfaces; namely, 


Scalar potential: fixed at 
start; freely disposable later 


In ADM treatment, give 
3-geometry and its time-rate 
of change 


(3) £(0) given, 


( 21 . 122 ) 


(3) -j)(0) = given, (21.123) 

dt 

Then (1) one picks a definite set of coordinates x i = (x,y, z) and in terms of those 
coordinates finds the unique metric coefficients g^(x, y, z) that describe that 3-geom¬ 
etry. The existence of a solution is guaranteed by the circumstance that {3) £ is a 
Riemannian geometry. However, one could have started with different coordinates 
and ended up with different metric coefficients for the description of the same 
3-geometry. No matter. Pick one set of coordinates, take the resulting metric co¬ 
efficients, and stick to them as giving half the required initial-value data. (2) Simi¬ 
larly, to describe the 3-geometry {3) h + i3) h dt at the value of the parameter t 4- dt, 
make use of coordinates x* + x* dt and arrive at the metric coefficients g i; - + g^ dt . 
The arbitrariness in the x l having thus been resolved by fiat, and the (3) £ being 
given as definite initial physical data, the g {j are thereby completely fixed. (3) Recall 
that the components of the extrinsic curvature or the momenta are given 
in terms of the g i; . and g i} - and the lapse and shift functions N and AT. by (21.67) 
or by (21.67) plus (21.91) or by (21.114). The four initial-value or “constraint” 
equations (21.116) and (21.117) thus become four conditions for finding the four 
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quantities N, N t . One can shorten the writing of these conditions by introducing 
the abbreviations 

.= jl^iu + % - dgij/m (21.124) 

and 


r! = (anomaly) = 

(both for functions of x, y, z on the initial simultaneity). Then one has 


(3) R + y 2 /N 2 = 16^7 nn = 16 ttT™ 
for the one initial-value equation; and for the other three, 



- S^Tr y 
N 


-iirTf. 


(21.126) 


(21.127) 


Lapse and shift initially 
determinate; thereafter freely 
disposable 


Counting initial-value data 


In summary, one chooses the g i; . and g { . with much arbitrariness (because of the 
arbitrariness in the coordinates, not by reason of any arbitrariness in the physics) 
to represent the given initial-value data, (3> £j and (3) -i>. The arbitrariness at the initial 
time all having been soaked up in this way, one expects no arbitrariness to be left 
m the initial N and as obtained by solving (21.126) and (21.127). However, on 
all later spacelike slices, the award of the arbitrariness is reversed. The lapse and 
shift functions are freely disposable, but, with them once chosen, there is no arbi¬ 
trariness whatever in the six g tj (and the six K if or tt 1 ’) as given by the integration 
of the dynamic equations (21.114) and (21.115). The analogy with electrodynamics 
is clear. There the one “gauge-controlled” function <p was fixed at the start by the 
elliptic equation (21.121), but was thereafter free. Here the four lapse and shift 
functions are fixed at the start by the four equations (21.126) and (21.127), but are 
thereafter free. 

Exercise 21.16 applies the initial-value equation (21.126) to analyze the whole 
evolution in time of any Friedmann universe in which one knows the equation 
p =p(p) connecting pressure with density. Exercise 21.17 looks for a variation 
principle on the spacelike hypersurface 2 equivalent in content to the elliptic 
initial-value equation (21.121) for the scalar potential <£. Exercises 21.18 and 21.19 
look for similar variation principles to determine the lapse and shift functions. 

How many degrees of freedom, or how many “handles,” are there in the specifica¬ 
tion of the 4-geometry that one will obtain? The metric coefficients of the initial 
3-geometry provided six numbers per space point. However, they were arbitrary to 
the extent of a coordinate transformation, specified by three functions of position, 


x = x(x',y, z% 
y = y(x',y', z% 

z = z(x',y',z'). 
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The net number of quantities per space point with any physical information was 
therefore 6 — 3 = 3. One can visualize these three functions as the three diagonal 
components of the metric in a coordinate system in which g if has been transformed 
to diagonal form. Ordinarily it is not useful to go further and actually spell out the 
analysis in any such narrowly circumscribed coordinate system. 

Now think of the (3> & in question as imbedded in the (4> & that comes out of the 
integrations. Moreover, think of that (4) & as endowed with the lumps, bumps, wiggles, 
and waves that distinguish it from other generic 4-geometries and that make Min¬ 
kowski geometry and special cosmologies so unrepresentative. The {3) h is a slice in 
that (4) &. It partakes of the lumps, bumps, wiggles, and waves present in all those 
regions of the (4> -& that it intersects. To the extent that the * 4> £) is generic, it does 
not allow the l3> -& to be moved to another location without becoming a different 
l3> -&. If one tries to push the l3> -& “forward in time” a little in a certain locality, leaving 
it unchanged in location elsewhere, one necessarily changes the (3) £. By this circum¬ 
stance, one sees that the (3> & “carries information about time” [Sharp (1960); Baierl- 
ein, Sharp, and Wheeler (1962)]. Moreover, this “forward motion in time” demands 
for its description one number per space point. It is possible to think of this number 
in concrete terms by imagining an arbitrary coordinate system T, x, y, z laid down 
in the (4 > -£j. Then the hypersurface can be conceived as defined by the value T = 
t(x, y, I) at which it cuts the typical line x, y, I. A forward movement carries it to 
y, z) + St(x,y, z), and changes shape and metric coefficients on (3) h accordingly. 

It is usually better not to tie one’s thinking down to such a concrete model, but 
rather to recognize as a general point of principle (1) that the location of the {3) Z> 
in spacetime demands for its specification one datum per spacepoint, and (2) that 
this datum is already willy-nilly present in the three data per spacepoint that mark 
any (3> £). 

In conclusion, there are only two data per spacepoint in a (3) £ that really tell Four pieces of 
anything about the (4) £) in which it is imbedded, or to be imbedded (as distinguished geometrodynamic information 
from where the (3> & slices through that (4> &). Similarly for the other ,3) & that defines simultaneit 50 '™ ^ ' n ' tial 
the other “face of the sandwich,” whether thick or thin. Thus one concludes that Slmutaneity 
the specification of (3> £) and (3) h actually gives four net pieces of dynamic information 
per spacepoint about the (4> & (all the rest of the information being “many-fingered 
time,” telling where the 3-geometries are located in that (4) &). According to this line 
of reasoning, geometrodynamics has the same number of dynamic degrees of free¬ 
dom as electrodynamics. One arrives at the same conclusion in quite another way 
through the weak-field analysis (§35.3) of gravitational waves on a flat spacetime 
background: the same ranges of possible wave numbers as for Maxwell waves; and 
for each wave number two states of polarization; and for each polarization one 
amplitude and one phase (the equivalent of one coordinate and one momentum). 

In electrodynamics in a prescribed spacetime manifold, one has a clean separation 
between the one time-datum per spacepoint (when one deals with electromagnetism 
in the context of many-fingered time) and the two dynamic variables per spacepoint; 
but not so in the superspace formulation of geometrodynamics. There the two kinds 
of quantities are inextricably mixed together in the one concept of 3-geometry. 
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Turn from initial- and final-value data to the action integral that is determined 
by (1) these data and (2) the principle that the action be an extremum. 


/ = 


/ 


extremum 


= 5. 


The action depends on the variables on the final hypersurface, according to the 
formula 


Problem in assuring 
completeness and 
consistency of initial data 


5 = S(2,B) (21.128) 

in electrodynamics, but according to the formula 

5= S( (3) £) (21.129) 

in geometrodynamics. In each case, there are three numbers per spacepoint in the 
argument of the functional (one in two in a divergence-free magnetic field; three 
in < 3) £). 

This mixing of the one many-fingered time and the two dynamic variables in a 
3-geometry makes it harder in general relativity than in Maxwell theory to know 
when one has in hand appropriate initial value data. Give I and give <g and # 
on 2 : that was enough for electrodynamics. For geometrodynamics, to give the six 
gij{x,y y z) and the six g i; (x, y, z) is not necessarily enough. For example, let the time 
parameter t be a fake, so that dt, instead of leading forward from a given hypersur¬ 
face I to a new hypersurface + dS 9 merely recoordinatizes the present hypersur¬ 
face: 


x { — x { - dt , - -' 

Sij ga + (£ii; + €w ) dL (21.130) 

A first inspection may make one think that one has adequate data in the six g Xj 
and the six 


The "thin sandwich 
conjecture" 


gij T £j]i> (21.131) 

but in the end one sees that one has not both faces of the thin sandwich, as required, 
but only one. Thus one must reject, as improperly posed data in the generic problem 
of dynamics, any set of six g fj that let themselves be expressed in the form (21.131) 
[Belasco and Ohanian (1969)]. 

Similar difficulties occur when the two faces of the thin sandwich, instead of 
coinciding everywhere, coincide in a limited region, be it three-dimensional, two- 
dimensional, or even one-dimensional (“crossover of one face from being earlier 
than the other to being later”). Thus it is enough to have (21.131) obtaining even 
on only a curved line in I to reject the six g {j as inappropriate initial-value data. 

That one can impose conditions on the g.- and g i; - which will guarantee existence 
and uniqueness of the solution N(x, y , z), N { (x , y 9 z) of the initial-value equations 
(21.126) and (21.127) is known as the “thin-sandwich conjecture,” a topic on which 
there has been much work by many investigators, but so far no decisive theorem. 
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To presuppose existence and uniqueness is to make the first step in giving mathe¬ 
matical content to Mach’s principle that the distribution of mass-energy throughout 
space determines inertia (§21.12). 


§21.10. THE TIME-SYMMETRIC AND 

TIME-ANTISYMMETRIC INITIAL-VALUE PROBLEMS 

Turn from the general initial-value problem to two special initial-value problems 
that lend themselves to detailed treatment, one known as the time-symmetric ini¬ 
tial-value problem, the other as the time-antisymmetric problem. 

A 4-geometry is said to be time-symmetric when there exists a spacelike hypersur¬ 
face I at all points of which the extrinsic curvature vanishes. In this case the three 
initial value equations (21.127) are automatically satisfied, and the fourth reduces 
to a simple requirement on the three-dimensional scalar curvature invariant, 

R = 16v 7 p. (21.132) 

Still further simplifications result when one limits attention to empty space. Simplest 
of all is the case of spherical symmetry in which (21.132) yields at once the full 
Schwarzschild geometry at the moment of time symmetry (two asymptotically flat 
spaces connected by a throat), as developed in exercise 21.20. 

Consider a 3-geometry with metric 

ds i = Sum* dxi dxk • (21.133) 

Call it a “base metric.” Consider another 3-geometiy with metric 

ds 2 2 = yp 4 (x i )ds 1 2 . (21.134) 

Angles are identical in the two geometries. On this account they are said to be 
conformally equivalent. The scalar curvature invariants of the two 3-geometries are 
related by the formula [Eisenhart (1926)] 

R 2 = -fy- 5 + yp-4R v (21.135) 


where 


▼i 2 '/' = +** = gr 1/2 (.^Mlgi 1/2 g ik (^/dx k )) (21.136) 

Demand that the scalar curvature invariant R 2 vanish, and arrive [Brill (1959)] at 
the “wave equation” 

- " Vpty - (R^iyp = 0 (21.137) 

for the conformal correction factor \p. Brill takes the base metric to have the form 
suggested by Bondi, 


ds 2 = dz 2 + dp 2 ) + p 2 d4> 2 , 


(21.138) 
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Wave amplitude determines 
mass-energy: m — m(A) 


"Time-antisymmetric” 
initial-value data 


and takes the conformal correction factor \p also to possess axial symmetry. In the 
application: 

< 7 i(p, z) measures the “distribution of gravitational wave amplitude,” assumed for 
simplicity to vanish outside r = (p 2 + z 2 ) 1/z = a\ 

A measures the ‘amplitude of the distribution of gravitational wave ampli¬ 
tude”; 

'WP> z ) conformal correction factor, which varies with position at large 

distances as 1 4 - (m/2r). The quantity m(cm) is uniquely determined by 
the condition that the geometry be asymtotically flat. It measures the 
mass-energy of the distribution of gravitational radiation. 

The mass m of the gravitational radiation is proportional to A 2 for small values of 
the amplitude A. It is inversely proportional to the reduced wavelength A = (effective 
wavelength/ 277 ) that measures the scale of rapid variations in the gravitational wave 
amplitude ^(p, z) in the “active zone.” Thus the metric is dominated by wiggles, 
proportional in amplitude to A, in the active zone, and at larger distances dominated 
by something close to a Schwarzschild (1 + 2 m/r) factor in the metric. When the 
amplitude A is increased, a critical value is attained, A = A crit , at which m goes to 
infinity and the geometry curves up into closure (“universe closed by its own content 
of gravitational-wave energy”). Further analysis and examples will be found in 
Wheeler (1964a), pp. 399-451, also in Wheeler (1964c). 

Brill has carried out a similar analysis [Brill (1961)] for the vacuum case of what 
he calls time-antisymmetric initial-value conditions, sketched below as amended by 
York (1973). (1) The initial slice is maximal, Tr K — 0. (2) This slice is conformally 
flat, 

gii = VS i r (21.139) 

(3) Work in the “base space” with metric 5 i; - and afterwards transform to the geometry 

(21.139). Three of the initial-value equations become 

KtfaseJ = °- (21.140) 

To solve these equations, (1) take any localized trace-free symmetric tensor B km ; 

(2) solve the flat-space Laplace equation V 2 A = (3/2) d 2 B km /dx k dx m for A\ 

(3) define the six potentials A km = B km + \A S km ; and (4) calculate 

^tLe = [ikl)[jmn] d 2 A k Jdx l dx n , (21.141) 

that automatically satisfy (21.140) and give Tr K h&se = 0. Then K ij = <F 10 KjL e also 
automatically satisfies these conditions, but now in the curved geometry (21.139). 
The final initial-value equation becomes a quasilinear elliptic equation, in the flat 
base space, for the conformal factor xp, 

WLe'f' + r'ZiK basei,) 2 = 0. 

i ,) 

The asymptotic form of xj, reveals that the mass of the wave is positive. 


(21.142) 
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In addition to the time-symmetric and time-antisymmetric cases, there are at least 
two further cases where the initial-value problem possess special simplicity. One is 
the case of a geometry endowed with a symmetry, as, for example, for the Friedmann 
universe of Chapter 27 or the mixmaster universe of Chapter 30 or cylindrical 
gravitational waves in the treatment of Kuchaf (1971a). One starts with a spacelike 
slice on which the g if and have a special symmetry, and makes all future spacelike 
slices in a way that preserves this symmetry. The geometry on any one of these 
simultaneities, though almost entirely governed by these symmetry considerations, 
still typically demands some countable number of parameters for its complete deter¬ 
mination, such as the radius of the Friedmann universe, or the three principal radii 
of curvature of the mixmaster universe. These parameters and the momenta conju¬ 
gate to them define a miniphase space. In this miniphase space, the dynamics runs 
its course as for any other problem of classical dynamics [see, for example, Box 30.1 
and Misner (1969) for the mixmaster universe; Kuchaf (1971a) and (1972) for waves 
endowed with cylindrical symmetry; and Gowdy (1973) for waves with spherical 
symmetry]. Even the evidence for the existence of many-fingered time, most charac¬ 
teristic feature of general relativity, is suppressed as the price for never having to 
give attention to any spacelike slice that departs from the prescribed symmetry. 


Exercise 21.16. POOR MAN'S WAY TO DO COSMOLOGY 

Consider a spacetime with the metric 

ds 2 = -dt 2 + a 2 (t)[d x 2 + sin 2 x(d9 2 + sin 2 0 d<p 2 )}, 

corresponding to a 3-geometry with the form of a sphere of radius a(t) changing with time. 
Show that the tensor of extrinsic curvature as expressed in a local Euclidean frame of 
reference is 

K = — ar\da/dt)1 , 

where 1 is the unit tensor. Show that the initial value equation (21.77) reduces to 

(6 /a 2 )(da/dt) 2 + (6/a 2 ) = \6 tt p(a) 

[for the value of the second term on the left, see exercise 14.3 and Boxes 14.2 and 14.5], 
and explain why it is appropriate to write the term on the right as 6 a Q /a 3 for a “dust-filled 
model universe.” More generally, given any equation of state, p = p(p), explain how one 
can find p = p(a) from 


d(pa 3 ) — —p d{a z )\ 

and how one can thus forecast the history of expansion and recontraction, a = a{t). 

Exercise 21.17. THIN-SANDWICH VARIATIONAL PRINCIPLE FOR 
THE SCALAR POTENTIAL IN ELECTRODYNAMICS 

(a) Choose the unknown U m in the expression 


1 

_ p?nn 


d<*> c<j) 

dx m ax” 


+ U n 


d<p 

a*" 


Finite dimensional dynamics 
for geometries endowed with 
high symmetry 
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in such a way that this expression, multiplied by the volume element g 1/2 d 3 x , and integrated 
over the simultaneity I, is extremized by a <p, and only by a <p, that satisfies the initial-value 
equation (21.108) of electrodynamics. 

(b) Show that the resulting variational principle, instead of having to be invented “out 
of the blue,” is none other than what follows directly from the action principle build on 
the Lagrangian density (21.100) of electrodynamics (independent variation of $ and the three 
A t everywhere between the two faces of a sandwich to extremize 7, subject only to the prior 
specification of the A t on the two faces of the sandwich, in the limit where the thickness 
of the sandwich goes to zero). 

Exercise 21.18. THIN-SANDWICH VARIATIONAL PRINCIPLE FOR THE LAPSE 
AND SHIFT FUNCTIONS IN GEOMETRODYNAMICS 

(a) Extremize the action integral 

U* - (Tr/C) 2 + Tr/C 2 - 2 T' nn ]N 
-2 T?N k )?ntPx 

with respect to the lapse and shift functions, and show that one arrives in this way at the 
four initial-value equations of geometrodynamics. It is understood that one has given the 
six and the six dg^/dt on the simultaneity where the analysis is being done. The extrinsic 
curvature is considered to be expressed as in (21.67) in terms of these quantities and the 
lapse and shift. The energy density and energy flow are referred to a unit normal vector 
n and three arbitrary coordinate basis vectors e i within the simultaneity, as earlier in this 
chapter, and the asterisk is an abbreviation for an omitted factor of 877. 

(b) Derive this variational principle from the ADM variational principle by going to the 
limit of an infinitesimally thin sandwich [see derivation in Wheeler (1964)]. 

Exercise 21.19. CONDENSED THIN-SANDWICH VARIATIONAL PRINCIPLE 

(a) Extremize the action I 3 of the preceding exercise with respect to the lapse function TV. 

(b) What is the relation between the result and the principle that “3-geometry is a carrier 
of information about time”? 

(c) By elimination of TV, arrive at a “condensed thin-sandwich variational principle” in 
which the only quantities to be varied are the three shift functions TV { . 

Exercise 21.20. POOR MAN'S WAY TO SCHWARZSCHILD GEOMETRY 

On curved empty space evolving deterministically in time, impose the conditions (1) that 
it possess a moment of time-symmetry, a spacelike hypersurface, the extrinsic curvature of 
which, with respect to the enveloping spacetime, is everywhere zero, and (2) that this spacelike 
hypersurface be endowed with spherical symmetry. Write the metric of the 3-geometry in 
the form 


ds 2 = ^ 4 (r )(d r 2 + r 2 dO 2 -hr 2 sin 2 0 d<p 2 ). 


From the initial-value equation (21.127), show that the conformal factor^ up to a multiplica¬ 
tive factor must have the form \p = (1 + m/27). Show that the proper circumference 2irr^ 2 (7) 
assumes a minimum value at a certain value of F, thus defining the throat of the 3-geometry. 
Show that the 3-geometry is mirror-symmetric with respect to reflection in this throat in 
the sense that the metric is unchanged in form under the substitution r f — m 2 /47. Find the 
transformation from the conformal coordinate 7 to the Schwarzschild coordinate r. 
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§21.11. YORK'S "HANDLES" TO SPECIFY A 4-GEOMETRY 

On a simultaneity—or on the simultaneity—of extremal proper volume, give the 
conformal part of the 3-geometry and give the two inequivalent components of the 
dynamically conjugate momentum in order (1) to have freely specifiable, but also 
complete, initial-value data and thus (2) to determine completely the whole generic 
four-dimensional spacetime manifold. This in brief is York’s extension (1971, 1972b) 
to the generic case of what Brill did for special cases (see the preceding section). 
York and Brill acknowledge earlier considerations of Lichnerowicz (1944) and Bruhat 
(1962 and earlier papers cited there on conformal geometry and the initial-value 
problem). But why conformal geometry, and why pick such a special spacelike 
hypersurface on which to give the four dynamic data per spacepoint? 

Few solutions of Maxwell’s equations are simpler than an infinite plane mon¬ 
ochromatic wave in Minkowski’s flat spacetime, and few look more complex when 
examined on a spacelike slice cut through that spacetime in an arbitrary way, with 
local wiggles and waves, larger-scale lumps and bumps, and still larger-scale general 
curvatures. No one who wants to explore electrodynamics in its evolution with 
many-fingered time can avoid these complexities; and no one will accept these 
complexities of many-fingered time who wants to see the degrees of freedom of the 
electromagnetic field in and by themselves exhibited in their neatest form. He will 
pick the simplest kind of timelike slice he can find. On that simultaneity, there are 
two and only two field coordinates, and two and only two field momenta per 
spacepoint. Similarly in geometrodynamics. 

When one wants to untangle the degrees of freedom of the geometry, as distinct 
from analyzing the dynamics of the geometry, one therefore retreats from the three 
items of information per spacepoint that are contained in a 3-geometry [or in any 
other way of analyzing the geometrodynamics, as especially seen in the “extrinsic 
time” formulation of Kuchar (1971b and 1972)] and following York (1) picks the 
simultaneity to have maximal proper volume and (2) on this simultaneity specifies 
the two “coordinate degrees of freedom per spacepoint” that are contained in the 
conformal part of the 3-geometry. 

An element of proper volume g l/2 d 3 x on the spacelike hypersurface I undergoes, 
in the next unit interval of proper time as measured normal to the hypersurface, 
a fractional increase of proper volume [see Figure 21.3 and equations 21.59 and 
21 .66] given by 

“Trlf = -jg~ 1/2 Tr/7. (21.143) 

For the volume to be extremal this quantity must vanish at every point of 2. This 
condition is satisfied in a Friedmann universe (Chapter 27) and in a Taub universe 
(Chapter 30) at that value of the natural time-coordinate t at which the universe 
switches over from expansion to recontraction. It is remarkable that the same 
condition on the choice of simultaneity, 2, lets itself be formulated in the same 
natural way, 


The degrees of freedom of 
the geometry in brief 


Pick hypersurface of extremal 
proper volume 


Tr K ~ 0 or Tr n — 0, 


( 21 . 144 ) 
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Case of open 3-geometry 


Meaning of conformal 
3-geometry 


for a closed universe altogether deprived of any symmetry whatsoever. Alternatively, 
one can deal with a spacetime that is topologically the product of an open 3-space 
by the real line (time). Then it is natural to think of specifying the location in it 
of a bounding spacelike 2-geometry 5 with the topology of a 2-sphere. Then one 
has many ways to fill in the interior of 5 with a spacelike 3-geometry 2\ but of 

all these 2% only the one that is extremal, or only the ones that are extremal satisfy 
(21.144). 

Who is going to specify this 2-geometry with the topology of a 2-sphere? The 
choice of that 2-geometry is not a matter of indifference. In a given 4-geometry, 
distinct choices for the bounding 2-geometry will ordinarily give distinct results for 
the extremizing 3-geometry, and therefore different choices for the “initial-value 
simultaneity,” 2. No consideration immediately thrusts itself forward that would 
give preference to one choice of 2-geometry over another. However, no such infinity 
of options presents itself when one limits attention to a closed 3-geometry. Therefore 
it will give concreteness to the following analysis to consider it applied to a closed 
universe, even though the analysis surely lets itself be made well-defined in an open 
region by appropriate specification of boundary values on the closed 2-geometry that 
bounds that open region. In brief, by limiting attention to a closed 3-geometry, one 
lets the obvious condition of closure take the place of boundary conditions that are 
not obvious. 

York’s analysis remains simple when his extrinsic time 


T = yg 1/2 Tr n = J Tr K 


has any constant value on the hypersurface, not only the value r = 0 appropriate 
for the hypersurface of extremal proper volume. 

On the simultaneity 2 specified by the condition of constant extrinsic time, 
r = constant, begin by giving the conformal 3-geometry, 


< =«>< = 


/the equivalence class of all those positive definite\ 
/ Riemannian three-dimensional metrics that are \ 
equivalent to each other under (1) diffeomorphism 
(smooth sliding of the points over the mainfold to 
new locations) or (2) changes of scale that vary 
smoothly from point to point, leaving fixed all 
t local angles (ratios of local distances), but J 
\changing local distances themselves or (3) both./ 


(21.145) 


The conformal 3-geometry is a geometric object that lends itself to definition and 
interpretation quite apart from the specific choice of coordinate system and even 
without need to use any coordinates at all. The conformal 3-geometry {on the 
hypersurface 2 where r = constant) may be regarded much as one regards the 
magnetic field in electromagnetism. The case of conformally flat 3-geometry, 


ds 2 = 4' i (x,y,z)ds g Me 


(21.146) 
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(with gijbase = <5 i; ), is analogous to those initial-value situations in electromagnetism 
where the magnetic field is everywhere zero (the time-antisymmetric initial-value 
problem of Brill); but now we consider the case of general dsl Me . 

The six metric coefficients g i; - of the conformal 3-geometry, subject to being 
changed by change of the three coordinates jc\ and undetermined at any one point 
up to a common position-dependent multiplicative factor, carry 6 — 3 — 1=2 
pieces of information per spacepoint. In this respect, they are like the components 
of the divergenceless magnetic field $. The corresponding field momentum oc S' 
(Box 21.1, page 496) has its divergence specified by the charge density, and so also 
carries 


two pieces of information (in addition to the prescribed information 

about the density of charge) per spacepoint. (21.147) 

The comparison is a little faulty between the components of $ and the metric 
coefficients. They are more like potentials than like components of the physically 
relevant field. 

The appropriate measure of the “field” in geometrodynamics is the curvature 
tensor; but how can one possibly define a curvature tensor for a geometry that is 
as rudimentary as a conformal 3-geometry? York (1971) has raised and answered 
this question. The Weyl conformal-curvature tensor [equation (13.50) and exercise 
13.13] is independent [in the proper (|) representation], in spaces of higher dimen¬ 
sionality, of the position-dependent factor xj/ 4 with which one multiplies the metric 
coefficients, but vanishes identically in three-dimensional space (exercise 21.21). One 
arrives at a non-zero conformally invariant measure of the curvature only when one 

goes to one higher derivative (exercise 21.22). In this way, one comes to York’s York's curvature tensor 
curvature /? a& , here called Y ab , a tensor density with these properties: 

Y ab _ y ba (symmetric); 

Y a a = 0 (traceless); 

Y ab ]b = 0 (transverse); 

Y ab invariant with respect to position-dependent 
changes in the conformal scale factor; 

Y ab = 0 when and only when the 3-geometry is conformally fiat. (21.148) 

Y ab provides what York calls the pure spin-two representation of the 3-geometry 
intrinsic to 2\ It is the analog of the field $ of electrodynamics on the spacelike 
initial-value simultaneity. It directly carries physical information about the conformal 
3-geometry. 

In addition to the conformal geometry (3) <, specified by the “potentials” g i; -/g 1/3 , 
jmd measured T>y the Afield components” one must also specify on 2 the 
corresponding conjugate momenta: 
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The associated momenta 


Unique solution for 
conformal factor 


7 r ab — -77 ab (symmetric); ££ = 0 (traceless); 

^ ab \ b = 0 (transverse) in case there is no flow of energy in 
space; otherwise 

77 ab jb = 8?; (density of flow of energy) 0 ; 

two pieces of information (in addition to the prescribed information 

about the flow of energy) per spacepoint. (21.149) 

It might appear to be essential to specify with respect to which of the 3-geometries, 
distinguished from one another by different values of the conformal factor one 
calculates the covariant derivatives of tensor densities of weight 5/3 (see §21.2) in 
(21.148) and (21.149). However, York has shown that the conditions (21.149) do not 
in any way depend on the value of the conformal factor \p 4 . 

These equations (21.149) for what York calls the “momentum density of weight 
5/3/’ 

%ab -gl/S^ab _j. g ab Tr „j t (21.150) 

are linear, and therefore lend themselves to analysis by standard methods. It is a 
great help in this enterprise that York (1973a,b) has provided a “conformally invari¬ 
ant orthogonal decomposition of symmetric tensors on Riemannian manifolds” that 
allows one to generate solutions of these requirements (“transverse traceless,” “con¬ 
formal Killing,” and “trace” parts, respectively, measure deformation of conformal 
part of geometry, mere recoordinatization, and change of scale). It is a further 
assistance, as York notes, that one has the same 1T ab for an entire conformal equiva¬ 
lence class of metrics; that is, for a given 

g ab = g- 1/3 ga» (2U51) 

no matter how different the g ab and \p themselves may be. 

The conformal 3-geometry and the “momentum density of weight 5/3” once 
picked, the remaining initial-value equation (21.116) then becomes the “scale” 
equation, 

8 V 2 ^ - <3 >/ty + M<T 7 + Qr 3 - |t 2 ^ 5 = 0, (21.152) 

O 

for the determination of the conformal factor \p. Here V 2 stands for the Laplacian 

= g-^(d/dx a )g l/2 g a \dxp/dx b ). (21.153) 

It, like (3) R, M, and Q , refers to the base space. It is interesting that 

V 2 - 

8 

is a conformally invariant wave operator, whereas V 2 itself is not. The quantity M 
in York’s analysis is an abbreviation for 

M = g- 5/3 g ac g bd Z ai TT cd , (21.154a) 

and 

Q = 16 5rp base (= 16^*p = 16^ 8 p infinal3 . geometry ). (21.154b) 
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One seeks a solution tp that is continuous over the closed manifold and everywhere 
real and positive. When does such a solution ip of the elliptic equation (21.152) exist? 
When is it unique? Always (when M > 0 and r ^ 0), is the result of O’Murchadha 
and York (1973); see also earlier investigations of Choquet-Bruhat (1972). Some of 
the physical considerations that come into this kind of problem have been discussed 
by Wheeler (1964a, pp. 370-381). 

§21.12. MACH'S PRINCIPLE AND THE ORIGIN OF INERTIA 

In my opinion the general theory of relativity can only solve this problem [of 
inertia ] satisfactorily if it regards the world as spatially self-enclosed. 

ALBERT EINSTEIN (1934), p. 52. 


On June 25, 1913, two years before he had discovered the geometrodynamic law 
that bears his name, Einstein (1913b) wrote to Ernst Mach (Figure 21.5) to express 
his appreciation for the inspiration that he had derived for his endeavors from Mach’s 
ideas. In his great book, The Science of Mechanics, Mach [(1912), Chapter 2, section 
6 ] had reasoned that it could not make sense to speak of the acceleration of a mass 
relative to absolute space. Anyone trying to clear physics of mystical ideas would 
do better, he reasoned, to speak of acceleration relative to the distant stars. But how 
can a star at a distance of 10 9 light-years contribute to inertia in the here and the 
now? To make a long story short, one can say at once that Einstein’s theory 
(1) identifies gravitation as the mechanism by which matter there influences inertia 
here; (2) says that this coupling takes place on a spacelike hypersurface [in what 
one, without a closer examination, might mistakenly think to be a violation of the 
principle of causality; see Fermi (1932) for a discussion and clarification of the similar 
apparent paradox in electrodynamics; see also Einstein (1934), p. 84: “Moreover 
I believed that I could show on general considerations a law of gravitation invariant 
in relation to any transformation of coordinates whatever was inconsistent with the 
principle of causation. These were errors of thought which cost me two years of 
excessively hard work, until I finally recognized them as such at the end of 1915”]; 
(3) supplies in the initial-value equations of geometrodynamics a mathematical tool 
to describe this coupling; (4) demands closure of the geometry in space [one conjec¬ 
tures; see Wheeler (1959, 1964c) and Honl (1962)], as a boundary condition on the 
initial-value equations if they are to yield a well-determined [and, we know now, 
a unique] 4-geometry; and (5) identifies the collection of local Lorentz frames near 
any point in this resulting spacetime as what one means quantitatively by speaking 
of inertia at that point. This is how one ends up with inertia here determined by 
density and flow of mass-energy there. 

There are many scores of papers in the literature on Mach’s principle, including 
many—even one by Lenin (English translation, 1927)—one could call anti-Mach- 
ian; and many of them make interesting points [see especially the delightful dialog 
by Weyl (1924a) on “inertia and the cosmos,” and the article (1957) and book (1961) 
of Sciama]. However, most of them were written before one had anything like the 
understanding of the initial-value problem that one possesses today. Therefore no 

(continued on page 546 ) 


No violation of causality, 
despite appearances 


An enormous literature 
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Figure 21.5. 

Einstein’s appreciation of Mach, written to Ernst Mach June 25, 1913, while Einstein was working hard 
at arriving at the final November 1915 formulation of standard general relativity. Regarding confirmation 
at a forthcoming eclipse: "If so, then your happy investigations on the foundations of mechanics, Planck’s 
unjustified criticism notwithstanding, will receive brilliant confirmation. For it necessarily turns out that 
inertia originates in a kind of interaction between bodies, quite in the sense of your considerations on 
Newton’s pail experiment. The first consequence is on p. 6 of my paper. The following additional points 
emerge: (1) If one accelerates a heavy shell of matter S, then a mass enclosed by that shell experiences 
an accelerative force. (2) If one rotates the shell relative to the fixed stars about an axis going through 
its center, a Coriolis force arises in the interior of the shell; that is, the plane of a Foucault pendulum 
is dragged around (with a practically unmeasurably small angular velocity).” Following the death of 
Mach, Einstein (1916a) wrote a tribute to the man and his work. Reprinted with the kind permission 
of the estate of Albert Einstein, Helen Dukas and Otto Nathan, executors. 
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attempt will be made to summarize or analyze the literature, which would demand 
a book in itself. Moreover, Mach’s principle as presented here is more sharply 
formulated than Einstein ever put it in the literature [except for his considerations 
arguing that the universe must be closed; see Einstein’s book (1950), pp. 107-108]; 
and Mach would surely have disowned it, for he could never bring himself to accept 
general (or even special) relativity. Nevertheless, it is a fact that Mach’s principle— 
that matter there governs inertia here—and Riemann’s idea—that the geometry of 
space responds to physics and participates in physics—were the two great currents 
of thought which Einstein, by means of his powerful equivalence principle, brought 
together into the present-day geometric description of gravitation and motion. 
Mach's principle updated and “Specify everywhere the distribution and flow of mass-energy and thereby deter- 
speHed out mine the inertial properties of every test particle everywhere and at all times”. 

Spelled out, this prescription demands (1) a way of speaking about “everywhere”: 
a spacelike hypersurface 2. Let one insist—in conformity with Einstein—(2) that 
it be a closed 3-geometry, and for convenience, not out of necessity, (3) that t be 
independent of position on 2. (4) Specify this 3-geometry to the extent of giving 
the conformal metric; without the specification of at least this much 3-geometry, 
there would be no evident way to say “where” the mass-energy is to be located. 

(5) Give density p base as a function of position in this conformal 3-geometry. 

(6) Recognize that giving the mass-energy only of fields other than gravity is an 
inadequate way to specify the distribution of mass-energy throughout space. For- 
malistically, to be sure, the gravitational fields does not and cannot make any 
contribution to the source term that stands on the righthand side of Einstein’s field 
equation. However, the analysis of gravitational waves (Chapters 18 and 35) shows 
that perturbations in the geometry of scale small compared to the scale of observation 
have to be regarded as carrying an effective content of mass-energy. Moreover, one 
has in a geon [Wheeler (1955); Brill and Hartle (1964); for more on gravitational-wave 
energy, see §35.14] an object built out of gravitational waves (or electromagnetic 
waves, or neutrinos, or any combination of the three) that holds itself together for 
a time that is long in comparison to the characteristic period of vibration of the 
waves. It looks from a distance like any other mass, even though nowhere in its 
interior can one put a finger and say “here is mass.” Therefore it, like any other 
mass, must have “its influence on inertia.” But to specify this mass, one must give 
enough information to characterize completely the gravitational waves on the simul¬ 
taneity 2. For this, it is not enough merely to have given the two “wave-coordinates” 
per spacepoint that one possesses in (3) <. One must give in addition (7) the two 
“wave-momenta” per spacepoint that appear in York’s “momentum density of weight 
5/3,” t T ab ; and at the same time, as an inextricable part of this operation, one must 
(8) specify the density of flow of field energy. (9) Solve for the conformal factor 

(10) Then one has complete initial-value data that satisfy the initial-value equa¬ 
tions of general relativity. (11) These data now known, the remaining, dynamic, 
components of the field equation determine the 4-geometry into the past and the 
future. (12) In this way, the inertial properties of every test particle are determined 
everywhere and at all times, giving concrete realization to Mach’s principle. 

Much must still be done to spell out the physics behind these equations and to 
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see this physics in action. Some significant progress had already been made in this 
direction before the present stage in one’s understanding of the initial-value equa¬ 
tions. Especially interesting are results of Thirring (1918) and (1921) and of Thirring 
and Lense (1918), discussed by Einstein (1950) in the third edition of his book, 
The Meaning of Relativity. 

Consider a bit of solid ground near the geographic pole, and a support erected 
there, and from it hanging a pendulum. Though the sky is cloudy, the observer 
watches the track of the Foucault pendulum as it slowly turns through 360°. Then 
the sky clears and, miracle of miracles, the pendulum is found to be swinging all 
the time on an arc fixed relative to the far-away stars. If “mass there governs inertia 
here,” as envisaged by Mach, how can this be? 

Enlarge the question. By the democratic principle that equal masses are created 
equal, the mass of the earth must come into the bookkeeping of the Foucault 
pendulum. Its plane of rotation must be dragged around with a slight angular 
velocity, co^, relative to the so-called “fixed stars.” How much is And how 
much would be if the pendulum were surrounded by a rapidly spinning 

spherical shell of mass M and radius R sheU , turning at angular velocity w shell ? 

Einstein’s theory says that inertia is a manifestation of the geometry of spacetime. 
It also says that geometry is affected by the presence of matter to an extent propor¬ 
tional to the factor G/c 2 = 0.742 x 10~ 28 cm/g. Simple dimensional considerations 
leave no room except to say that the rate of drag is proportional to a expression 
of the form 


, _ 7, ^ ^ shell, conv ,. _ ^shell . 

J drag ~ ^ ^2 d ^shell ^ n ( 

A shell 


; shelL 


shell 


(21.155) 


Here A: is a numerical factor to be found only by detailed calculation. Lense and 
Thirring [(1918) and (1921)], starting with a flat background spacetime manifold, 
calculated in the weak-field approximation of Chapter 18 the effect of the moving 
current of mass on the metric. Expressed in polar coordinates, the metric acquires 
a non-zero coefficient g ^ ( . Inserted into the equation of geodesic motion, this off- 
diagonal metric coefficient gives rise to a precession. This precession (defined here 
about an axis parallel to the axis of rotation, not about the local vertical) is given 
by an expression of the form (21.155), where the precession factor k has the value 

k = 4/3. (21.156) 

There is a close parallelism between the magnetic component of the Maxwell field 
and the precession component of the Einstein field. In neither field does a source 
at rest produce the new kind of effect when acting on a test particle that is also 
at rest. One designs a circular current of charge to produce a magnetic field; and 
a test charge, in order to respond to this magnetic field, must also be in motion. 
Similarly here; no pendulum vibration means no pendulum precession. Moreover, 
the direction of the precession depends on where the pendulum is, relative to the 
rotating shell of mass. The precession factor k has the following values; 


The Foucault pendulum 


The dragging of the inertial 
frame 
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The "sum for inertia” 


k = 4/3 for pendulum anywhere inside rotating 
shell of mass; 

k — 4/3 for pendulum at North or South pole; (21.157) 

k — —2/3 for pendulum just outside the rotating 
shell at its equator. 

This position-dependence of the drag, makes still more apparent the analogy 
with magnetism, where the field of a rotating charged sphere points North at the 
center of the sphere, and North at both poles, but South at the equator. 

Whether the Foucault pendulum is located in imagination at the center of the 
earth or in actuality at the North pole, the order of magnitude of the expected drag 
is 

m earth 0.44 cm 1 radian 

“drag ~ “earth ~ 6 X 10 8 CIT1 13700 SeC (21.158) 

— 5 X 10“ 14 rad/sec, 

too small to allow detection, let alone actual measurement, by any device so far 
built—but perhaps measurable by gyroscopes now under construction (§40.7). By 
contrast, near a rapidly spinning neutron star or near a black hole endowed with 
substantial angular momentum, the calculated drag effect is not merely detectable; 
it is even important (see Chapter 33 on the physics of a rotating black hole). 

The distant stars must influence the natural plane of vibration of the Foucault 
pendulum as the nearby rotating shell of matter does, provided that the stars are 
not so far away (r — radius of universe) that the curvature of space begins to 
introduce substantial corrections into the calculation of Thirring and Lense. In other 
words, no reason is apparent why all masses should not be treated on the same 
footing, so that (21.158) more appropriately, if also somewhat symbolically, reads 


"plane of 
vibration 
of Foucault 
pendulum 


R 


shell 




far-away 

“stars” 


(21.159) 


Moreover, when there is no nearby shell of matter, or when it has negligible effects, 
the plane of vibration of the pendulum, if experience is any guide, cannot turn with 
respect to the frame defined by the far-away “stars.” In this event w Foucault must 
be identical with co stars ; or the “sum for inertia,” 


y m “sta. r” 

far-away ^“star” 


YYl 

universe 

r 

' umverse 


(21.160) 


“stars” 

must be of the order of unity. Just such a relation of approximate identity between 
the mass content of the universe and its radius at the phase of maximum expansion 
is a characteristic feature of the Friedman model and other simple models of a closed 
universe (Chapters 27 and 30). In this respect, Einstein’s theory of Mach’s principle 
exhibits a satisfying degree of self-consistency. 
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At phases of the dynamics of the universe other than the stage of maximum 
expansion, r universe can become arbitrarily small compared to m universe . Then the 
ratio (21.160) can depart by powers of ten from unity. Regardless of this circum¬ 
stance, one has no option but to understand that the effective value of the “sum 
for inertia” is still unity after all corrections have been made for the dynamics of 
contraction or expansion, for retardation, etc. Only so can w Foucault retain its inescap¬ 
able identity with w far . awajr stars . Fortunately, one does not have to pursue the theology 
of the “sum for inertia” to the uttermost of these sophistications to have a proper 
account of inertia. Mach s idea that mass there determines inertia here has its 
complete mathematical account in Einstein’s geometrodynamic law, as already 
spelled—etrtr—For—the first strong-field analysis of the dragging of the inertial 
reference system in the context of relativistic cosmology, see Brill and Cohen (1966) 
and Cohen and Brill (1967); see also §33.4 for dragging by a rotating black hole. 

Still another clarification is required of what Mach’s principle means and how 

it is used. The inertial properties of a test particle are perfectly well-determined when Minkowski geometry as limit 

that particle is moving in ideal Minkowski space. “Point out, please,” the anti-Ma- of a closed 3-geometry 

chian critic says, the masses that are responsible for this inertia.” In answer, recall 

that Einstein s theory includes not only the geometrodynamic law, but also, in 

Einstein’s view, the boundary condition that the universe be closed. Thus the section 

of spacetime that is flat is to be viewed, not as infinite, but as part of a closed universe. 

(For a two-dimensional analog, fill a rubber balloon with water and set it on a glass 
tabletop and look at it from underneath). The part of the universe that is curved 
acquires its curvature by reason of its actual content of mass-energy or—if animated 
only by gravitational waves—by reason of its effective content of mass-energy. This 
mass-energy, real or effective, is to be viewed as responsible for the inertial properties 
of the test particle that at first sight looked all alone in the universe. 

It in no way changes the qualitative character of the result to turn attention to 
a model universe where the region of Minkowski flatness, and all the other linear 
dimensions of the universe, have been augmented tenfold (“ten times larger balloon; 
ten times larger face”). The curvature and density of the curved part of the model 
universe are down by a factor of 100, the volume is up by a factor of 1,000, the 
mass is up by a factor of 10; but the ratio of mass to radius, or the “sum for inertia” 

(the poor man’s substitute for a complete initial-value calculation) is unchanged. 

Einstein acknowledged a debt of parentage for his theory to Mach’s principle 
(Figure 21.5). It is therefore only justice that Mach’s principle should in return today 
owe its elucidation to Einstein’s theory. 


-Exercise 21.21. WHY THE WEYL CONFORMAL CURVATURE 
TENSOR VANISHES 

How many independent components does the Riemann curvature tensor have in three- 
dimensional space? How many does the Ricci curvature tensor have? Show that the two 
tensors are related by the formula 
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= WRac ~ + g.'R*> ~ gat*? 

+ y Wig* ~ Hgac) 

with no need of any Weyl conformal-curvature tensor to specify (as in higher dimensions) 
the further details of the Riemann tensor. Show' that the Weyl tensor, from an ^-dimensional 
modification of equation (13.50) as in exercise 13.13, vanishes for n — 2. 

Exercise 21.22. YORK'S CURVATURE 

[York (1971)], (a) Define the tensor [Eisenhart (1926)] 

^abc ^able ^aclh *F 

(b) Show that a 3-geometry is conformally flat w'hen and only when R abc = 0. 

(c) Show that the following identities hold and reduce to five the number of independent 
components of R ahc : 

R a ac = g°^6ac = 0; 

Rabc + Racb = 0 ; 

R(ibc + Rcab + Rbca = 


(d) Show that Yorks’ curvature 


y«» = £i/V/i (/?,>_! s,»*) 

= - \g ui [ ae f]g imR ™t 

is conformally invariant and has the properties listed in equations (21.148). 

Exercise 21,23. PULLING THE POYNTING FLUX VECTOR "OUT OF THE AIR" 

From the condition that the Hamilton-Jacobi functional S(g { ^ A m ) (extremal of the action 
integral) for the combined Einstein and Maxwell fields, ostensibly dependent on the six metric 
coefficients g i; .(x,j, z) and the three potentials A m (x,y, z), shall actually depend only on the 
3-geometry of the spacelike hypersurface and the distribution of magnetic field strength on 
this hypersurface, show that the geometrodynamic field momentum 7r ij = SS/Sg^ satisfies 
a condition of the form 


* il U = 

and evaluate the coefficient c in this equation [Wheeler (1968b)]. Hint: Note that the transfor¬ 
mation 


X 4 —»- X* - g if —>- gij + + I il4 

in no way changes the 3-geometry itself, and therefore the corresponding induced change 
in S , 

must vanish identically for arbitrary choice of the ¥(x,y, 2 ), which measure the equivalent 
of the sliding of a ruled transparent rubber sheet over an automobile fender. 
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Exercise 21.24. THE EXTREMAL ACTION ASSOCIATED WITH THE HILBERT 

ACTION PRINCIPLE DEPENDS ON CONFORMAL 3-GEOMETRY 
AND EXTRINSIC TIME [K. Kuchar (1972) and J. York (1972)] 

Show that the data demanded by the Hilbert action principle 8f {4) R(- H) g) 1/2 d*x = 0 on 
each of the two bounding spacelike hypersurfaces consist of (1) the conformal 3-geometry 
(3) < of the hypersurface plus (2) the extrinsic time variable defined by 

T = y g -1/2 Tr /» = y Tr /f, 

conveniently represented by the pictogram W, measured by one number per spacepoint, 
and independent of the conformal factor in the metric of the 3-geometry. This done, explain 
in a few words why in this formulation of geometrodynamics the Hamilton-Jacobi function 
(ft times the phase of the wave function in the semiclassical or JWKB approximation) is 
appropriately expressed in the form 

5 = S( (3) <,W). 


§21.13. JUNCTION CONDITIONS 

The intrinsic and extrinsic curvatures of a hypersurface, which played such funda¬ 
mental roles in the initial-value formalism, are also powerful tools in the analysis 
of “junction conditions.” 

Recall the junction conditions of electrodynamics: across any surface (e.g., a 
capacitor plate), the tangential part of the electric field, and the normal part 
of the magnetic field, £ x , must be continuous; thus, 

[£]i] = (discontinuity in E n ) 

= (£„ on “+ w side of surface) - (£j, on ” side of 
surface) 

= £|| - £7 = 0 , (21.161a) 

[B ± ] EE £J - £1 = 0; (21.161b) 

while the “jump” in the parts E ± and £ j( must be related to the charge density (charge 
per unit area) a , the current density (current per unit area) j, and the unit normal 
to the surface n by the formulas 

[£J = £+ - El = 4 Tran, (21.161c) 

[£„] = £,} - £7 = 4Try x n. (21.161d) 

Recall also that one derives these junction conditions by integrating Maxwell’s 
equations over a “pill box” that is centered on the surface. 

Similar junction conditions, derivable in a similar manner, apply to the gravita¬ 
tional field (spacetime curvature), and to the stress-energy that generates it.* Focus 

*The original formulation of gravitational junction conditions stemmed from Lanczos (1922, 1924). 
The formulation given here, in terms of intrinsic and extrinsic curvature, was developed by Darmois (1927), 
Misner and Sharp (1964), and Israel (1966). For further references to the extensive literature, see Israel. 


Junction conditions for 
electrodynamics 



552 


21. VARIATIONAL PRINCIPLE AND INITIAL VALUE DATA 



Figure 21.6. 

Gaussian normal coordinates in the neighborhood of a 3-surface 2. The metric 
in Gaussian normal coordinates has the form 

ds 2 = (n • #i ) _1 dn 2 4- g {j dx l dx’ 

with n = d/dn, (n • n) = — 1 if the surface is spacelike, and (n * n) = 1 if it is 
timelike. (See exercise 27.2.) The extrinsic curvature of the surfaces n = constant 
is = — \ dgij/d/i, and the Einstein field equations written in “3 + 1” form are 
(21.162). 


attention on a specific three-dimensional slice through spacetime—the 3-surface 2 
on Figure 21.6. Let the surface be either spacelike [unit normal n timelike; 
(/? • /?) = — 1] or timelike [n spacelike; (n • n) = -hi]- The null case will be discussed 
later. As an aid in deriving junction conditions, introduce Gaussian normal coordi¬ 
nates in the neighborhood of 2 [see the paragraph preceeding equation (21.82)]. In 
terms of the intrinsic and extrinsic curvatures of 2 and of neighboring 3-surfaces 

Einstein equation in "3 + 1" n = constant, the Einstein tensor and Einstein field equation have components 
form 

G\ = -y (3, tf + y(/» ■ n ) -1 {(Tr /f ) 2 - Tr(/T 2 )} = 8 ^ n , (21.162a) 

G\= —(n • - (Tr #f) l4 } = (21.162b) 

G'j = (3) G i j + (n • Tr , 

- (Tr IQ**, + 1 5‘ } (Tr Kf + 1 S 4 ,. Tr (/C 2 )} = 8 tt^. (21.162c) 

[See equations (21.77), (21.81), (21.76), and (21.82).] 

Suppose that the stress-energy tensor T 1 ^ contains a “delta-function singularity” 
at 2 —i.e., suppose that 2 is the “world tube” of a two-dimensional surface with 
finite 4-momentum per unit area (analog of surface charge and surface current in 
Surface stress-energy tensor electrodynamics). Then define the surface stress-energy tensor on 2 to be the integral 

of with respect to proper distance («), measured perpendicularly through 2: 



(21.163) 
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To discover the effect of this surface layer on the spacetime geometry, perform a 
“pill-box integration” of the Einstein field equation (21.162) 


f r +£ 1 

Lim I G a g dn = 87rS“ fi . 
e—»o L*_. R J p 


(21.164) 


Examine the integral of G a If the 3-metric g i§ were to contain a delta function 
or a discontinuity at 2, then 2 would not have any well-defined 3-geometry—a 
physically inadmissible situation, even in the presence of surface layers. Absence 
of delta functions, §(«), in means absence of delta functions in i3) R ; absence of 
discontinuities in g.. means absence of delta functions in = — \g ijn . Thus, 
equations (21.162) when integrated say 

f G n n dn = 0 = 8 ttS\, (21.165a) 

f G n i dn- 0 = 8 7;S n i , (21.165b) 

f G'j • dn = (n • n)(y\ - §*, Tr y) = 8775 *,., (21,165c) 

where y^ is the “jump” in the components of the extrinsic curvature 

y=[K] = 0 K on "n = +e side” of 2) - (If on “n = -« side” of JP) (21.166) 
~K + - K~. 


In the absence of a delta-function surface layer, the above junction conditions 
say, simply, that y = [K] = 0. In words: if one examines how 2 is embedded in 
the spacetime above its “upper”face, and how it is embedded in the spacetime below 
its “lower” face, one must discover identical embeddings—i.e., identical extrinsic 
curvatures K. Of course, the intrinsic curvature of 2 must also be the same, whether 
viewed from above or below. More briefly: 

(absence of surface layers) (“continuity” of g %j and K {j ). (21.167) 


If a surface layer is present, then 2 must be the world tube of a two-dimensional 
layer of matter, and the normal to 2 must be spacelike, (n • /») = + 1 . The junction 
conditions (21.165a,b) then have the simple physical meaning 

/the momentum flow is entirely in 2\ 

[ i.e., no momentum associated with the 
surface layer flows out of 2 ; i.e., 2 
\is the world tube of the surface layer / 


S(/i,...) = 0 


(21.168a) 


which tells one nothing new. The junction condition (21.165c) says that the surface 
stress-energy generates a discontinuity in the extrinsic curvature (different embedding 
in spacetime “above” 2 than “below” 2 ), given by 

- 8 ^Try = 8^, (21.168b) 

Of course, the intrinsic geometry of 2 must be the same as seen from above and 
below, 

gij continuous across 2. (21.169) 


Derivation of junction 
conditions 


Junction conditions in 
absence of surface layers 


Junction conditions for a 
surface layer 
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Equation of motion for a 
surface layer 


Gravitational-wave shock 
fronts 


In analyzing surface layers, one uses not only the junction conditions (21.168a) to 
(21.169), but also the four-dimensional Einstein field equation applied on each side 
of the surface I separately, and also an equation of motion for the surface stress- 
energy. The equation of motion is derived by examining the jump in the field 
equation G n i = 8^7^. (equation 21.162b); thus [G n J = 8tt[ T ".] says 

(Y f "-8 < »Try) lm = -8fr[ry; 

and when reexpressed in terms of S™ by means of the junction condition (21.168b), 
it says 


S im |m + [p n ] = 0. (21.170) 

[For intuition into this equation of motion, see Exercises 21.25 and 21.26. For 
applications of the “surface-layer formalism” see exercise 21.27; also Israel (1966), 
Kuchar (1968), Papapetrou and Hamoui (1968).] 

When one turns attention to junction conditions across a null surface one finds 
results rather different from those in the spacelike and timelike cases. A “pill-box” 
integration of the field equations reveals that even in vacuum the extrinsic curvature 
may be discontinuous. A discontinuity in across a null surface, without any 
stress-energy to produce it, is the geometric manifestation of a gravitational-wave 
shock front (analog of a shock-front in hydrodynamics). For quantitative details see, 
e.g., Pirani (1957), Papapetrou and Treder (1959, 1962), Treder (1962), and especially 
Choquet-Bruhat (1968b). 

That a discontinuity in the curvature tensor can propagate with the speed of light 
is a reminder that all gravitational effects, like all electromagnetic effects, obey a 
causal law. The initial-value data on a spacelike initial-value hypersurface uniquely 
determine the resulting spacetime geometry [see the work of Cartan, Stellmacher, 
Lichnerowicz, and Bruhat (also under the names Foures-Bruhat and Choquet-Bru¬ 
hat) and others cited and summarized in the article of Bruhat (1962)] but determine 
it in a way consistent with causality. Thus a change in these data throughout a limited 
region of the initial value 3-geometry makes itself felt on a slightly later hypersurface 
solely in a region that is also limited, and only a little larger than the original region. 

When one turns from classical dynamics to quantum dynamics, one sees new 
reason to focus attention on a spacelike initial-value hypersurface: the observables 
at different points on such a hypersurface commute with one another; i.e., are in 
principle simultaneously observable. 

Not every four-dimensional manifold admits a global singularity-free spacelike 
hypersurface. Those manifolds that do admit such a hypersurface have more to do 
with physics, it is possible to believe, than those that do not. 

Even in a manifold that does admit a spacelike hypersurface, attention has been 
given sometimes, in the context of classical theory, to initial-value data on a hyper¬ 
surface that is not spacelike but “characteristic,” in the sense that it accomodates 
null geodesics [see, for example, Sachs (1964) and references cited there]. It is typical 
in such situations that one can predict the future but not the past, or predict the 
past but not the future. 
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Children ofjight and children of darkness is the vision of physics that emerges 
from this chapter, as from other branches of physics. The children of light are the 
differential equations that predict the future from the present. The children of 
darkness are the factors that fix these initial conditions. 


Exercise 21.25. EQUATION OF MOTION FOR A SURFACE LAYER 

(a) Let u be the “mean 4-velocity” of the matter in a surface layer—so defined that an 
observer moving with 4-velocity u sees zero energy flux. Let o be the total mass-energy per 
unit proper surface area, as measured by such a “comoving observer.” Show that the surface 
stress-energy tensor can be expressed in the form 

S = au ® u + f, where (t • u) = 0, (21.171) 

and where t is a symmetric stress tensor. 

(b) Show that the component along u of the equation of motion (21.170) is 

do/dr = -aw* (i + u } t’ k lk + u } [T’ n l (21.172) 

where d/dr = u. Give a physical interpretation for each term. 

(c) Let a } be that part of the 4-acceleration of the comoving observer which lies in the 
surface layer I . By projecting the equation of motion (21.170) perpendicular to u , show 
that 


= -Pjai‘ ab 16 + [7'“"]}. (21.173) 

where P ja is the projection operator 

P ia = gja + «,-««• (21.174) 

Give a physical interpretation for each term of equation (21.182). 

Exercise 21.26. THIN SHELLS OF DUST 

For a thin shell of dust surrounded by vacuum ([T jn ] = 0, t = 0), derive the following 
equations 

(21.175a) 
(21.175b) 
(21.175c) 

(21.175d) 

Here a+ and a~ are the 4-accelerations as measured by accelerometers that are fastened 
onto the outer and inner sides of the shell, and g is the 3-metric of the shell. Show that 
the first of these equations is the law of “conservation of rest mass.” 

Exercise 21.27. SPHERICAL SHELL OF DUST 

Apply the formalism of exercise 21.25 to a collapsing spherical shell of dust [Israel (1967b)]. 
For the metric inside and outside the shell, take the flat-spacetime and vacuum Schwarzschild 
expressions (Chapter 23), 


do/dr — -ou b lb , 
a+ + a~ = 0, 
a + — a~ = (4 iro)n 

Y = ® u + 
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ds 2 = —dl 2 + dr 2 + r 2 (dd 2 + sin 2 d d<f> 2 ) inside, (21.176a) 

ds 2 = - (l - dt 2 + - ^ 2 — + r 2 (dd 2 + sin 2 d d$ 2 ) outside. (21.176b) 

\ r ) 1 - 2 M/r 

Let the “radius” of the shell, as a function of proper time measured on the shell, be 


R X (proper circumference of shell) = R(r). 

277 


(21.176c) 


Show that the shell's mass density varies with time as 

o(t) = ju/47t/? 2 (t), ju = constant = “total rest mass”; (21.176d) 

and derive and solve the equation of motion 
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THERMODYNAMICS, HYDRODYNAMICS, 
ELECTRODYNAMICS, GEOMETRIC 
OPTICS, AND KINETIC THEORY 

___ ) 


§22.1. THE WHY OF THIS CHAPTER 

Astrophysical applications of gravitation theory are the focus of the rest of this book, 
except for Chapters 41 -44. Each application—stars, star clusters, cosmology, collapse, 
black holes, gravitational waves, solar-system experiments—can be pursued by itself 
at an elementary level, without reference to the material in this chapter. But deep 
understanding of the applications requires a prior grasp of thermodynamics, hydro¬ 
dynamics, electrodynamics, geometric optics, and kinetic theory, all in the context 
of curved spacetime. Hence, most Track-2 readers will want to probe these subjects 
at this point. 


§22.2. THERMODYNAMICS IN CURVED SPACETIME* 

Consider, for concreteness and simplicity, the equilibrium thermodynamics of a 
perfect fluid with fixed chemical composition (“simple perfect fluid”)—for example, 
the gaseous interior of a collapsing supermassive star. The thermodynamic state of 
a fluid element, as it passes through an event 9 0 , can be characterized by various 
thermodynamic potentials, such as n , p, p, T, s, p. The numerical value of each 
potential at is measured in the proper reference frame (§13.6) of an observer 
who moves with the fluid element—i.e., in the fluid element’s “rest frame.” Despite 

*For more detailed treatments of this subject see, e.g., Stueckelberg and Wanders (1953), Kluitenberg 
and de Groot (1954), Meixner and Reik (1959), and references cited therein: see also the references 
on hydrodynamics cited at the beginning of §22.3, and the references on kinetic theory cited at the 
beginning of §22.6. 


-. 

This chapter is entirely Track 2. 

No earlier Track-2 material is 
needed as preparation for it, 
but Chapter 5 (stress-energy 
tensor) will be helpful. 

§22.5 (geometric optics) is 
needed as preparation for 
Chapter 34 (singularities and 
global methods). The rest of 
the chapter is not needed as 
preparation for any later 
chapter; but it will be 
extremely helpful in most 
applications of gravitation 
theory (Chapters 23-40). 

_ _ / 


Thermodynamic potentials 
are defined in rest frame of 
fluid 
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Definitions of thermodynamic 
potentials 


Definition of "simple fluid" 


Law of baryon conservation 


this use of rest frame to measure the potentials, the potentials are frame-independent 
functions (scalar fields). At the chosen event a given potential (e.g., n) has a 
unique value so n is a perfectly good frame-independent function. 

The values of /?, p, p, T, s y p measure the following quantities in the rest frame 
of the fluid element: 

n , baryon number density; i.e., number of baryons per unit three-dimensional 
volume of rest frame, with antibaryons (if any) counted negatively. 

p, density of total mass-energy; i.e., total mass-energy (including rest mass, ther¬ 
mal energy, compressional energy, etc.) contained in a unit three-dimensional 
volume of the rest frame. 

/?, isotropic pressure in rest frame. 

T, temperature in rest frame. 

5 , entropy per baryon in rest frame. (The entropy per unit volume is ns.) 

p, chemical potential of baryons in rest frame [see equation (22.8) below]. 

The chemical composition of the fluid (number density of hydrogen molecules, 
number density of hydrogen atoms, number density of free protons and electrons, 
number density of photons, number density of 238 U nuclei, number density of A 
hyperons ...) is assumed to be fixed uniquely by two thermodynamic variables—e.g., 
by the total number density of baryons n and the entropy per baryon In this sense 
the fluid is a “simple fluid.” Simple fluids occur whenever the chemical abundances 
are “frozen” (reaction rates too slow to be important on the time scales of interest; 
for example, in a supermassive star except during explosive burning and except at 
temperatures high enough for e~ — e + pair production). Simple fluids also occur 
in the opposite extreme of complete chemical equilibrium (reaction rates fast enough 
to maintain equilibrium despite changing density and entropy; for example, in 
neutron stars, where high pressures speed up all reactions). When one examines 
nuclear burning in a nonconvecting star, or explosive nuclear burning, or pair 
production and neutrino energy losses at high temperatures, one must usually treat 
the fluid as “multicomponent.” Then one introduces a number density n d and a 
chemical potential pj for each chemical species with abundance not fixed by n 
and s. For further details see, e.g., Zel’dovich and Novikov (1971). 

The most fundamental law of thermodynamics—even more fundamental than the 
“first” and “second” laws—is baryon conservation . Consider a fluid element whose 
moving walls are attached to the fluid so that no baryons flow in or out. As the 
fluid element moves through spacetime, deforming along the way, its volume V 
changes. But the number of baryons in it must remain fixed, so 

4~{nV) = 0. (22.1) 

UT 

The changes in volume are produced by the flow of neighboring bits of fluid away 
from or toward each other—explicitly (exercise 22.1) 


dV/dr = (V-ii)F, 


( 22 . 2 ) 
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where u = d/dr is the 4-velocity of the fluid. Consequently, baryon conservation 
[equation (22.1)] can be reexpressed as 

0 = -^ + = ▼„« + «(▼• u) = u •▼« + «(▼• i/) = ▼ • (nu); 


i.e., 


VS = 0 , 

S — nu — baryon number-flux vector 


(22.3) 

(22.4) 


(see §5.4 and exercise 5.3.) Moreover, this abstract geometric version of the law must 
be just as valid in curved spacetime as in flat (equivalence principle). 

Note the analogy with the law of charge conservation, V • J — 0, in electrody¬ 
namics (exercise 3.16) and with the local law of energy-momentum conservation, 
▼ • T — 0 (§§5.9 and 16.2). In a very deep sense, the forms of these three laws are 
dictated by the theorem of Gauss (§5.9, and Boxes 5.3, 5.4). 

The second law of thermodynamics states that, in flat spacetime or in curved, 
entropy can be generated but not destroyed. Apply this law to a fluid element of 
volume V containing a fixed number of baryons N. The entropy it contains is 

S = Ns = nsV. 


Entropy may flow in and out across the faces of the fluid element (“heat flow” 
between neighboring fluid elements); but for simplicity assume it does not; or if 
it does, assume that it flows too slowly to have any significance for the problem 
at hand. Then the entropy in the fluid element can only increase: 

d(nsV)/dr > 0 when negligible entropy is exchanged between 
neighboring fluid elements; 

i.e. [combine with equation (22.1)] 


ds/dr > 0 (no entropy exchange). (22.5) 

So long as the fluid element remains in thermodynamic equilibrium, its entropy will 
actually be conserved [“ = ” in equation (22.5)]; but at a shock wave, where equilib¬ 
rium is momentarily broken, the entropy will increase (conversion of “relative kinetic 
energy” of neighboring fluid elements into heat). [For discussions of heat flow in 
special and general relativity, see Exercise 22.7. For discussion of shock waves, see 
Taub (1948), de Hoffman and Teller (1950), Israel (1960), May and White (1967), 
Zel’dovich and Rayzer (1967), Lichnerowicz (1967, 1971), and Thorne (1973a).] 
The first law of thermodynamics , in the proper reference frame of a fluid element, 
is identical to the first law in flat spacetime (“principle of equivalence”); and in 
flat spacetime the first law is merely the law of energy conservation: 


d 


( 


energy in a volume element containing 
a fixed number, A , of baryons 



<i(volume) + T ^(entropy); 


Second law of 
thermodynamics 


Shock waves and heat flow 


First law of thermodynamics 
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Pressure and temperature 
calculated from p(n, 5 ) 


i.e.. 


i.e.. 


d(pA/n) = —pd(A/n) + T d(As); 


dp = P + P dn -f nTds. 
n 

Query: what kind of a “</” appears here? For a simple fluid, the values of two 
potentials, e.g., n and 5, fix all the others uniquely; so any change in p must be 
determined uniquely by the changes in n and s. It matters not whether the changes 
are measured along the world line of a given fluid element, or in some other direction. 
Thus, the “rf” in the first law can be interpreted as an exterior derivative 

dp = P + ? dn -f nT ds ; (22.6) 

n 

and the changes along a given direction in the fluid (along a given tangent vec¬ 
tor v) can be written 

▼„P = (dp, v > = (dn, v > + nT(ds, v > 

= ^^-V v n + nTV v s. 

Equation (22.6) lends itself to interpretation in two opposite senses: as a way to 
deduce the density of mass-energy of the medium from information about pressure 
(as a function of n and s) and temperature (as a function of n and s); and conversely, 
as a way to deduce the two functions p(n, s) and T(n , 5 ) from the one function p(n , ^). 
It is natural to look at the second approach first; who does not like a strategy that 
makes an intellectual profit? Regarding p as a known (or calculable) function of 
n and 5 , one deduces from (22.6) 



and thence pressure and temperature individually, 

p(n,s) = - P, (22.7a) 

(22Jb) 

(“two equations of state from one”). The analysis simplifies still further when the 
fluid, already assumed to be everywhere of the same composition, is also everywhere 
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endowed with the same entropy per baryon, s, and is in a state of adiabatic flow 
(no shocks or heat conduction). Then the density p = p{n, s) reduces to a function 
of one variable out of which one derives everything ( p,p,p ) needed for the hydro¬ 
dynamics and the gravitation physics of the system (next chapter). Other choices 
of the “primary thermodynamic potential” are appropriate under other circum¬ 
stances (see Box 22.1). 

If differentiation leads from pin, s) to p(n, s ) and T(n, s), it does not follow that 
one can take any two functions p{n, s) and T(n, s) and proceed “backwards” (by 
integration) to the “primary function”, p(n, s). To be compatible with the first law 
of thermodynamics (22.6), the two functions must satisfy the consistency requirement 
[“Maxwell relation”; equality of second partial derivatives of p] Maxwell relation 

(dp/ds) n = n 2 (dT/dn) s . (22.7c) 


--- 

Box 22.1 PRINCIPAL ALTERNATIVES FOR “PRIMARY THERMODYNAMIC POTENTIAL" 
TO DESCRIBE A FLUID 


Primary thermodynamic potential 
and quantities on which it is most 
appropriately envisaged to depend 


“Density”; total amount of mass- 
energy (rest + thermal + • • •) per 
unit volume 

P = p(n, s ) 


“Secondary” thermodynamic quanti¬ 
ties obtained by differentiation of 
primary with or without use of 

d ti ) + pd ( i )- Tds=0 


Conditions under which convenient, 
appropriate, and relevant 


p{n, s) — n 



- P 


T(n t s ) 
p(n, s) 



P + P _ /9p\ 
n v9/t/ s 


Conditions of adiabatic flow (no 
shocks or heat conduction), so‘that s 
stays constant along streamline 


“Physical free energy” 

a{n, D = - - Ts 
n 

*"• 71 *"'(£), 

71 = -(#). 

Know or can calculate a (or the 
“sum over states” of statistical me¬ 
chanics) for conditions of specified 
volume per baryon and temperature 

“Chemical free energy” 

fip, T) = -i±Z - Ts 
n 

1 /nip, T ) = (df/dp) T 

sip, T) = -(df/BDp 

, „ /- ro//877„ 

p(p> n = -— rA—~ ’’-p 
(Bf/dp) T 

Relevant for determining equilibrium 
when pressure and temperature are 
specified 


“Chemical potential” (“energy to in¬ 
ject” expressed on a “per baryon” 
basis) 


Kp, s) 


p + p 


l/n(p, s) = (3/ u/dp), 
T{p, s) = (3j u/ds) p 


p(P . *) = 


M _ 
(Bp/dp), P 


When injection energy [= Fermi en¬ 
ergy for an ideal Fermi gas, relativis¬ 
tic or not; see exercise 22.3] is the 
center of attention 


n 
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Chemical potential equals 
"injection energy" at 
fixed entropy per baryon and 
total volume 


Laws of hydrodynamics for 
simple fluid without heat flow 
or viscosity; 


The chemical potential jut is also a unique function of n and s. It is defined as 
follows. (1) Take a sample of the simple fluid in a fixed thermodynamic state (fixed 
n and s ). (2) Take, separately, a much smaller sample of the same fluid, containing 
8A baryons in the same thermodynamic state as the large sample (same n and j). 
(3) Inject the smaller sample into the larger one, holding the volume of the large 
sample fixed during the injection process. (4) The total mass-energy injected, 

^injected = P X (volume of injected fluid) = p(8A/n\ 
plus the work required to perform the injection 


$iy _ / wor k done against pressure of large sampleX 

injection y to open up space in it for the injected fluid/ 

= /?(volume of injected fluid) = p(8A/n), 


is equal to j ul8A: 


M ^ ^injected A~ & W injection 


P+P 


8A. 


Stated more briefly: " 

( total mass-energy required, per baryon, to “create” and 
inject a small additional amount of fluid into a given 
sample, without changing s or volume of the sample 

_ p + p _ (OlL\ 

~ 71 JL \ d ”'s 

L[by first law of thermodynamics (22.6)] 

All the above laws and equations of thermodynamics are the same in curved 
spacetime as in flat spacetime; and the same in (relativistic) flat spacetime as in 
classical nonrelativistic thermodynamics—except for the inclusion of rest mass, 
together with all other forms of mass-energy, in p and jut. The reason is simple: the 
laws are all formulated as scalar equations linking thermodynamic variables that 
one measures in the rest frame of the fluid. 



§22.3. HYDRODYNAMICS IN CURVED SPACETIME* 

A simple perfect fluid flows through spacetime. It might be the Earth’s atmosphere 
circulating in the Earth’s gravitational field. It might be the gaseous interior of the 
Sun at rest in its own gravitational field. It might be interstellar gas accreting on¬ 
to a black hole. But whatever and wherever the fluid may be, its motion will be 
governed by the curved-spacetime laws of thermodynamics (§22.2) plus the local 

*For more detailed treatments of this subject see, e.g., Ehlers (1961), Taub (1971), Ellis (1971), 
Lichnerowicz (1967), Cattaneo (1971), and references cited therein; see also the references on kinetic 
theory cited at the beginning of §22.6. 
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law of energy-momentum conservation, Y • T = 0. The chief objective of this section 
is to reduce the equation V • T = 0 to usable form. The reduction will be performed 
in the text using abstract notation; the reader is encouraged to repeat the reduction 
using index notation. 

The stress-energy tensor for a perfect fluid, in curved spacetime as in flat (equival¬ 
ence principle!), is 

T = (p + p)u ® u + pg. (22.9) 

(See §5.5.) Its divergence is readily calculated using the chain rule; using the “com¬ 
patibility relation between g and Y” Yg = 0; using the identity (Yp) - g -Yp 
(which one readily verifies in index notation); and using 

0 = Y • T = [V(p + p) • u]u + [(p + p) Y • u)u + [(p + p)u] • Yu + ( Yp ) • g 
^-[divergence on first slot] 

= l v «,P + ▼„/» + (p + p)Y • u]u + (p + p) Y u u + Yp. (22.10) 

The component of this equation along the 4-velocity is especially simple (recall 
that u • Y u u — | Y u u 2 -- 0 because u 2 = — 1): 


0 = u • (▼ • D = -[▼„*> + Y u p + (p + p) ▼ • If] + Y uP 
= -▼uP - (p +/»)▼•«. 


Combine this with the equation of baryon conservation (22.3); the result is 


dp _ (p + p) dn 
dr n dr 


(22.11a) 


Notice that this is identical to the first law of thermodynamics (22.6) applied along 
a flow line, plus the assumption that the entropy per baryon is conserved along a 
flow line 

ds/dr = 0. (22.11b) 


There is no reason for surprise at this result. To insist on thermodynamic equilibrium 
and to demand that the entropy remain constant is to require zero exchange of heat 
between one element of the fluid and another. But the stress-energy tensor (22.9) 
recognizes that heat exchange is absent. Any heat exchange would show up as an 
energy flux term in T (Ex. 22.7); but no such term is present. Consequently, when 
one studies local energy conservation by evaluating u • (▼ • T) = 0, the stress-energy 
tensor reports that no heat flow is occurring—i.e. that ds/dr ~ 0. 

Three components of V • T = 0 remain: the components orthogonal to the fluid’s 
4-velocity. One can pluck them out of V • T = 0, leaving behind the component 
along i/, by use of the “projection tensor” 


(1) Origins of laws 


(2) Local energy 
conservation: adiabaticity 
flow 


P = g -b u ® u. 


( 22 . 12 ) 




Box 22.2 THERMODYNAMICS AND HYDRODYNAMICS FOR A SIMPLE 
PERFECT FLUID IN CURVED SPACETIME 


'\ 


A. Ten Quantities Characterize the Fluid 

Thermodynamic potentials all measured in rest 
frame 

baryon number density 
p, density of total mass-energy 
/?, pressure 
T y temperature 
s , entropy per baryon 
p, chemical potential per baryon 
Four components of the fluid 4-velocity 

B. Ten Equations Govern 
the Fluid's Motion 

Two equations of state 

p = p(n, s), T = T(n, s) (1), (2) 

subject to the compatibility constraint (“Max¬ 
well relation,” which follows from first law of 
thermodynamics) 

(dp/ds) n = n\dT/dn\, 

First law of thermodynamics 

dp = P — dn + nTds , (3) 

n 

which can be integrated to give p(n , s). 

___ 


Equation for chemical potential 

M = (p + p)/n, (4) 

which can be combined with p(n, s) and p(n, 5 ) 
to give p(n, 5 ). 

Law of baryon conservation 

dn/dr = n = — n V • u. (5) 

Conservation of energy along flow lines, which 
(assuming no energy exchange between adja¬ 
cent fluid elements) means “adiabatic flow ” 

ds/dr = 0 except in shock waves, where 

ds/dr > 0. (6) 

[Shock waves are not treated in this book; see 
Taub (1948), de Hoffman and Teller (1950), 
Israel (I960), May and White (1967), ZePdo- 
vich and Rayzer (1967); Lichnerowicz 
(1967, 1971); and Thorne (1973a).] 

Euler equations 

(P + p) y„i# = ~(g + U ® u) • Vp, 

(7), (8), (9) 

which determine the flow lines to which u is 
tangent. 

Normalization of 4-velocity 

I# • <f = —1. (10) 


(3) Euler equation 


(See exercise 22.4.) Contracting P with V ■ T = 0 [equation (22.10)] gives 

(p + p) ▼ u =-*•(▼/>)= -[Vp + (V)«]. (22.13) 


This is the (t Euler equation” of relativistic hydrodynamics. It has precisely the same 
form as the corresponding flat-spacetime Euler equation: 


inertial mass 
per unit volume 
[exercise 5.4] 



4-acceleration 
of fluid 



pressure gradient 
in the 3-surface 
orthogonal to 4-velocity 


(22.13') 


The pressure gradient, not “gravity,” is responsible for all deviation of flow lines 
from geodesics. 

Box 22.2 reorganizes and summarizes the above laws of thermodynamics and 
hydrodynamics. 
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Exercise 22.1. DIVERGENCE OF FLOW LINES PRODUCES VOLUME CHANGES 

Derive the equation dV/dr = (▼ • u)V [equation (22.2)] for the rate of change of volume 
of a fluid element. [Hint: Pick an event & Qi and calculate in a local Lorentz frame at 9 Q 
which momentarily moves with the fluid (“rest frame at & 0 ”).] [Solution: At events near 
the fluid has a very small ordinary velocity v> = dx j /dt. Consequently a cube of fluid 
at &Q with edges Ax — Ay = Az = L changes its edges, after time 8t , by the amounts 

5(dx) = [(dx/dt) <5*] at “frontface” “ VAx/dt) &] a t“back face” 

= (dv*/dx)L 8t , 

8(Ay) = (dv*/dy)L 8t, 

8(Az) = (dv*/dz)L 8t. 

The corresponding change in volume is 

5(dx Ay Az) = (dvi/dx^L 3 5/; 
so the rate of change of volume is 


dV/dt = V(du’/dx>). 


But in the local Lorentz rest frame at and near £P 0 (where x a = 0), the metric coefficients 
are Zilv = + 0(|x a | 2 ), and the ordinary velocity is v> = 0(|x a |); so 


_ UL _ 


di_ 

dr 


dt 


, dx^ dx v ) 1 


= 1 + 0(|jc«| 2 ), 


dx j 

dr 


= v* + 0(|x a | 3 ). 


Thus, the derivatives dV/dt and V(du j /dx j ) at & 0 are 


dV/dt = u a dV/dx a = u a V a = dV/dr 

= V(dvi/dx 0 = V(du a /dx a ) = Vu a :a = V(V • u). Q.E.D.] 

[Note that by working in flat spacetime, one could have inferred more easily that dV/dt = 
dV/dr and dv i /dx i = ▼*«/; one would then have concluded dV/dr = (▼ • u)V; and one 
could have invoked the equivalence principle to move this law into curved spacetime.] 

Exercise 22.2. EQUATION OF CONTINUITY 

Show that in the nonrelativistic limit in flat spacetime the equation of baryon conservation 
(22.3) becomes the “equation of continuity” 


dn 

dt 




Exercise 22.3. CHEMICAL POTENTIAL FOR IDEAL FERMI GAS 

Show that the chemical potential of an ideal Fermi gas, nonrelativistic or relativistic, is (at 
zero temperature) equal to the Fermi energy (energy of highest occupied momentum state) 
of that gas. 

Exercise 22.4. PROJECTION TENSORS 

Show that contraction of a tangent vector B with the “projection tensor” P = g + u & u 
projects B into the 3-surface orthogonal to the 4-velocity vector u. [Hint: perform the 


EXERCISES 
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calculation in an orthonormal frame with = u, and write B = B a e* a : then show that 
P • B = B'e-.] If n is a unit spacelike vector, show that P = g — n ® n is the corresponding 
projection operator. Note: There is no unique concept of “the projection orthogonal to a 
null vector.” Why? [Hint: draw pictures in flat spacetime suppressing one spatial dimension,] 

Exercise 22.5. PRESSURE GRADIENT IN STATIONARY GRAVITATIONAL FIELD 

A perfect fluid is at rest (flow lines have = constant) in a stationary gravitational field 
(metric coefficients are independent of a* 0 ). Show that the pressure gradient required to 
“support the fluid against gravity” (i.e., to make its flow lines be x> = constant instead of 
geodesics) is 

<“•»> 


Evaluate this pressure gradient in the Newtonian limit, using the coordinate system and metric 
coefficients of equation (18.15c). 

Exercise 22.6. EXPANSION, ROTATION, AND SHEAR 

Let a field of fluid 4-velocities u(&) be given. 

(a) Show that ▼*/ can be decomposed in the following manner: 

“«;/» = “a /3 + o af3 + J 0P a p - a a iip, (22.15a) 

where a is the 4-acceleration of the fluid 


a a = u a . p u^, 

6 is the “expansion ” of the fluid world lines 


6 = ▼•«/ = «%, 

A/a is the projection tensor 

Pap = Safi + U a U /3 » 


(22.15b) » 


(22.15c) 

(22.15d) 


° a /3 shear tensor of the fluid 

°a(S (22.15e) 

and co a p is the rotation 2-form of the fluid 

(22.15f) 

(b) Each of the component parts of this decomposition has a simple physical interpretation 
in the local rest frames of the fluid. The interpretation of the 4-acceleration a in terms of 
accelerometer readings should be familiar. Exercise 22.1 showed that the expansion# = ▼ * u 
describes the rate of increase of the volume of a fluid element, 

6 = (1 /V)(dV/dr). (22.15g) 

Exercise 22.4 explored the meaning and use of the projection tensor P. Verify that in a local 
Lorentz frame ( g^ — T ] aj3 , r a #y = 0) momentarily moving with the fluid ( u a = S a 0 ), a 
and co^ reduce to the classical (nonrelativistic) shear and rotation of the fluid. [See, e.g., 
§§2.4 and 2.5 of Ellis (1971) for both classical and relativistic descriptions of shear and 
rotation.] 
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Exercise 22.7. HYDRODYNAMICS WITH VISCOSITY AND HEAT FLOW.* 

(a) In §15 of Landau and Lifshitz (1959), one finds an analysis of viscous stresses for a 
classical (nonrelativistic) fluid. By carrying that analysis over directly to the local Lorentz 
rest frame of a relativistic fluid, and by then generalizing to frame-independent language, 
show that the contribution of viscosity to the stress-energy tensor is 

r (visc) = — 2tjo’ - £6P, (22.16a) 


where r\ > 0 is the “coefficient of dynamic viscosity ? > 0 is the “coefficient of bulk viscosity 
and a, 9 , P are the shear, expansion, and projection tensor of the fluid. 

(b) An idealized description of heat flow in a fluid introduces the heat-flux 4-vector q 
with components in the local rest-frame of the fluid, 


( energy per unit time crossing unit 
surface perpendicular to ej 


(22.16b) 


By generalizing from the fluid rest frame to frame-independent language, show that the 
contribution of heat flux to the stress-energy tensor is 


T (heat) = i/®g + qr®£/. 


(22.16c) 


Thereby conclude that, in this idealized picture, the stress-energy tensor for a viscous fluid 
with heat conduction is 

T a P = pu a u 0 + (p - $$)P af3 - 2t io a P + q a u$ + u a qP. (22.16d) 


f (c) Define the entropy 4-vector s by 

s == nsu + q/T. 


(22.16e) 


By calculations in the local rest-frame of the fluid, show that 




( rate of increase of entropy \ /rate at which heat and fluid 

in a unit volume / \carry entropy into a unit volume 

( rate at which entropy is being\ 
generated in a unit volume / 


) 

(22.16f) 


Thereby arrive at the following form of the second law of thermodynamics: 

▼ •s >0. 


(22.16g) 


(d) Calculate the law of local energy conservation, u- V • T = 0, for a viscous fluid with 
heat flow. Combine with the first law of thermodynamics and with the law of baryon 
conservation to obtain 


TV-s = 0* + 2t- q a {TJT+ a a ). (22.16h) 

Interpret each term of this equation as a contribution to entropy generation ( example: 
2r)o a/3 o a P describes entropy generation by viscous heating). [Note: The term q a a a is rela¬ 
tivistic in origin. It is associated with the inertia of the flowing heat.] 

(e) When one takes account of the inertia of the flowing heat, one obtains the following 
generalization of the classical law of heat conduction: 

+ Tap (22.16i) 


* Exercise supplied by John M. Stewart. 
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Electric and magnetic fields 


Maxwell equations and 
Lorentz force law 


(Eckart 1940). Here k is the coefficient of thermal conductivity. Use this equation to show 
that, for a fluid at rest in a stationary gravitational field (Exercise 22.5), 


q 0 = 0, q, = - ( T \f--goo) j- (22.16j) 

V-goo " - - 

[Thus, thermal equilibrium corresponds not to constant temperature, but to the redshifted 
temperature distribution Ty/ —g 00 = constant; Tolman (1934a), p. 313.] Also, use the ideal¬ 
ized law of heat conduction (22.16i) to reexpress the rate of entropy generation as 


T V • 5 = $6 2 + 2no afi o* + (K/T)P*P(T a + Ta a )(T fi + Taf) 
> 0. 


(22.16k) 


[For further details about heat flow and for discussions of the limitations of the above 
idealized description, see e.g., §4.18 of Ehlers (1971); also Marie (1969), Anderson (1970), 
Stewart (1971), and papers cited therein.] 


§22.4. ELECTRODYNAMICS IN CURVED SPACETIME 

In a local Lorentz frame in the presence of gravity, an observer can measure the 
electric and magnetic fields E and B using the usual Lorentz force law for charged 
particles. As in special relativity, he can regard E and B as components of an 
electromagnetic field tensor, 

fOj _ _ £), fjk — 

he can regard the charge and current densities as components of a 4-vector J a , and 
he can write Maxwell's equations and the Lorentz force equation in the special 
relativistic form, 



In any other frame these equations will have the same form, but with commas 
replaced by semicolons 

(22.17a) 

+ Fp y;a + F yC r,0 ~ (22.17b) 

ma a ~ F a @qup. (22.17c) 


These are the basic equations of electrodynamics in the presence of gravity. From 
them follows everything else. For example, as in special relativity, so also here 
(exercise 22.9), they imply the equation of charge conservation 


Charge conservation 


= 0; 


(22.18a) 
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and for an electromagnetic field interacting with charged matter (exercise 22.10) they 
imply vanishing divergence for the sum of the stress-energy tensors 

^jr(EM)aft ^(MATTER)** __ Q (22 18b) 


Local conservation of 
energy-momentum 


As in special relativity, so also here, one can introduce a vector potential A Vector potential 
Replacing commas by semicolons in the usual special-relativistic expression for F^ v 
in terms of A one obtains 

F nv = A vin - (22.19a) 


If all is well, this equation should guarantee (as in special relativity) that the Maxwell 
equations (22.17b) are satisfied. Indeed, it does, as one sees in exercise 22.8. To derive 
the wave equation that governs the vector potential, insert expression (22.19a) into 
the remaining Maxwell equations (22.17a), obtaining 


-A*'^ + A^p = 4t tJ«\ (22.19b) 

then commute covariant derivatives in the first term using the identity (16.6c), to 
obtain 


- A + A*.J a + R^A* = 4t tJ*. (22.19b') 


Finally, adopting the standard approach of special relativity, impose the Lorentz 
gauge condition 

A^.p = 0, (22.19c) Lorentz gauge condition 


thereby bringing the wave equation (22.19b') into the form 


(4 m A T = - Aa ' p n + Ra p AP = 4 ^“ 


(22.19d) 


Wave equation for vector 
potential 


The “de Rham vector wave operator” A which appears here is, apart from sign, 
a generalized d’Alambertian for vectors in curved spacetime. Mathematically it is 
more powerful than —A a '^.^ and than any other operator that reduces to (minus) 
the d’Alambertian in special relativity. [For a discussion, see de Rham (1955).] 
Although the electrodynamic equations (22.17a)-(22.19b) are all obtained from 
special relativity by the comma-goes-to-semicolon rule, the wave equation (22.19d) 
for the vector potential is not (“curvature coupling”; see Box 16.1). Nevertheless, 
when spacetime is flat (so R a ^ = 0), (22,19d) does reduce to the usual wave equation 
of special relativity. 


Exercise 22.8. THE VECTOR POTENTIAL FOR ELECTRODYNAMICS EXERCISES 

Show that in any coordinate frame the connection coefficients cancel out of both equations 
(22.19a) and (22.17b), so they can be written 

F — A —A 

* (iv fi, v ' 

F afi,y F f$y ,a T F ya,/3 


(22.20a) 

(22.20b) 
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(In the language of differential forms these equations are F = dA, dF = 0.) Then use this 
form of the equations to show that equation (22.19a) implies equation (22.17b), as asserted 
in the text. 

Exercise 22.9. CHARGE CONSERVATION IN THE PRESENCE OF GRAVITY 

Show that Maxwell's equations (22.17a,b) imply the equation of charge conservation (22.18a) 
when gravity is present, just as they do in special relativity theory. [Hints: Use the antisym¬ 
metry of F a &\ and bew'are of the noncommutation of the covariant derivatives, which must 
be handled using equations (16.6). Alternatively, show that in coordinate frames, equation 
(22.17a) can be written as 




(22.17a') 


and (22.18a) as 


<2u8a '’ 

and carry out the demonstration in a coordinate frame.) 

Exercise 22.10. INTERACTING ELECTROMAGNETIC FIELD 
AND CHARGED MATTER 

As in special relativity, so also in the presence of gravity (“equivalence principle”), the 
stress-energy tensor for an electromagnetic field is 

r<EM U = -k(w - J WgJ)' (22.21) 

Use Maxwell’s equations (22.17a,b) in the presence of gravity to show that 

( 22 . 22 ) 

But F a PJp is just the Lorentz 4-force per unit volume with which the electromagnetic field 
acts on the charged matter [see the Lorentz force equation (22.17c); also equation (5.43)]; 
i.e., it is 7 ( matter)«/J ;/ j . Consequently, the above equation can be rewritten in the form 
(22.18b) cited in the text. 


§22.5. GEOMETRIC OPTICS IN CURVED SPACETIME* 

Radio waves from the quasar 3C279 pass near the sun and get deflected by its 
gravitational field. Light rays emitted by newborn galaxies long ago and far away 
propagate through the cosmologically curved spacetime of the universe, and get 
focused (and redshifted) producing curvature-enlarged (but dim) images of the 
galaxies on the Earth’s sky. 


•Based in part on notes prepared by William L. Burke at Caltech in 1968. For more detailed treatments 
of geometric optics in curved spacetime, see, e.g., Sachs (1961), Jordan, Ehlers, and Sachs (1961), and 
Robinson (1961); also references discussed and listed in §41.11. 
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These and most other instances of the propagation of light and radio waves are 
subject to the laws of geometric optics. This section derives those laws, in curved 
spacetime, from Maxwell’s equations. 

The fundamental laws of geometric optics are: (1) light rays are null geodesics; 
(2) the polarization vector is perpendicular to the rays and is parallel-propagated 
along the rays; and (3) the amplitude is governed by an adiabatic invariant which, 
in quantum language, states that the number of photons is conserved. 

The conditions under which these laws hold are defined by conditions on three 
lengths: (1) the typical reduced wavelength of the waves, 


A 



( 


“classical distance of closest approach for 
a photon with one unit of angular momentum” 


), (22.23a) 


as measured in a typical local Lorentz frame (e.g., a frame at rest relative to nearby 
galaxies); (2) the typical length £ over which the amplitude, polarization, and 
wavelength of the waves vary, e.g., the radius of curvature of a wave front, or the 
length of a wave packet produced by a sudden outburst in a quasar; (3) the typical 
radius of curvature 6L of the spacetime through which the waves propagate, 


S = 


typical component of Riemann as measured 
in typical local Lorentz frame 


- 1/2 


(22.23 b) 


Geometric optics is valid whenever the reduced wavelength is very short compared 
to each of the other scales present, 


A < £ and A < $, (22.23c) 

so that the waves can be regarded locally as plane waves propagating through 
spacetime of negligible curvature. 

Mathematically one exploits the geometric-optics assumption, A < £ and A < $, 
as follows. Focus attention on waves that are highly monochromatic over regions 
<£. (More complex spectra can be analyzed by superposition, i.e., by Fourier 
analysis.) Split the vector potential of electromagnetic theory into a rapidly changing, 
real phase, 


9 (distance propagated)/A, 


and a slowly changing, complex amplitude (i.e. one with real and imaginary parts), 
A = Real part of{amplitude X e i$ } = ^{amplitude x e i$ }. 


Imagine holding fixed the scale of the amplitude variation, £, and the scale of the 
spacetime curvature, t while making the reduced wavelength, A, shorter and shorter. 
The phase will get larger and larger (i 9 cc 1/A) at any fixed event in spacetime, but 
the amplitude as a function of location in spacetime can remain virtually unchanged, 


Amplitude = 


"dominant part, " "small corrections (deviations from 
.independent of AJ Lgeometric optics) due to finite wavelength 


Overview of geometric optics 


Conditions for validity of 
geometric optics 


The "two-length-scale" 
expansion underlying 
geometric optics 
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The vector potential in 
geometric optics 


This circumstance allows one to expand the amplitude in powers of A:* 


Amplitude = < 

a + b + c 
. V | 

T independent] 

4 ccA] L 

Lof A ] 



[Actually, the expansion proceeds in powers of the dimensionless number 

^/(minimum of £ and $) == X/L . (22.24) 

Applied mathematicians call this a “two-length-scale expansion”; see, e.g., Cole 
(1968). The basic short-wavelength approximation here has a long history; see, e.g., 
Liouville (1837), Rayleigh (1912). Following a suggestion of Debye, it was applied 
to Maxwell’s equations by Sommerfeld and Runge (1911). It is familiar as the WKB 
approximation in quantum mechanics, and has many other applications as indicated 
by the bibliography in Keller, Lewis, and Seckler (1956). The contribution of higher 
order terms is considered by Kline (1954) and Lewis (1958). See especially the book 
of Froman and Froman (1965).] 

It is useful to introduce a parameter e that keeps track of how rapidly various 
terms approach zero (or infinity) as X/L approaches zero: 


A M = &{(a M + eb M + • • -)e i$/e }. (22.25) 

Any term with a factor e n in front of it varies as ( X/L) n in the limit of very small 
wavelengths [8 cc (A/L)” 1 ; cc (X/L) 2 ; etc.]. By convention, e is a dummy expan¬ 
sion parameter with eventual value unity; so it can be dropped from the calculations 
when it ceases to be useful. And by convention, all “post-geometric-optics correc¬ 
tions” are put into the amplitude terms />, c,.. . ; none are put into 0. 

Note that, while the phase 8 is a real function of position in spacetime, the 
amplitude and hence the vectors a, b, c, ... are complex. For example, to describe 
monochromatic waves with righthand circular polarization, propagating in the z 
direction, one could set 8 = co(z — t) and a = \/y/2a(e x q~ ie y ) with a real; so 

A = ft |c(e, q- /e y )e 4wUr_i) | = —^a(cos [co(z — t))e x — sin [co(z — t)]e y ) 


The assumed form (22.25) for the vector potential is the mathematical foundation 
of geometric optics. All the key equations of geometric optics result from inserting 
this vector potential into the source-free wave equation AA = 0 [equation (22.19d)] 
and into the Lorentz gauge condition V - A — 0 [equation (22.19c)]. The resulting 
Basic concepts of geometric equations (derived below) take their simplest form only when expressed in terms 
optics: of the following: 


*The equations for A are linear. Therefore the analysis would proceed equally well assuming, instead 
of an amplitude independent of A, a dominant term a cc A B , with b cc A n+1 , c cc A n+2 , etc. The results 
are independent of n. Choosing n = 1 would give field strengths and energy densities 7^„ cc F 2 cc 
A 2 /X 2 oc constant as X —0. 
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“wave vector ” k = V0; (22.26a) 

“scalar amplitude,” a = (a • a) 1/2 = (a^a^' 2 -, (2226b) 

“polarization vector,” f = a/a = “unit complex vector along a”. (22.26c) 

(Here a is the complex conjugate of a.) Light rays are defined to be the curves $P(A) 
normal to surfaces of constant phase 6. Since k = V0 is the normal to these surfaces, 
the differential equation for a fight ray is 

= rW, ,(*)■ (22.26d) 


Box 22.3, appropriate for study at this point, shows the polarization vector, wave 
vector, surfaces of constant phase, and fight rays for a propagating wave; the scalar 
amplitude, not shown there, merely tells the length of the vector amplitude a. Insight 
into the complex polarization vector, if not familiar from electrodynamics, can be 
developed later in Exercise 22.12. 

So much for the foundations. Now for the calculations. First insert the geometric- 
optics vector potential (22.25) into the Lorentz gauge condition: 


0 = A*., = £ { j + eb» + • • •) + (a* + eb* + . (22.27) 


The leading term (order 1/e) says 

k • a = 0 ( amplitude is perpendicular to wave vector ); 


(22.28) 


or, equivalently 

k • f = 0 ( polarization is perpendicular to wave vector). (22.28') 


The post-geometric-optics breakdown in this orthogonality condition is governed by 
the higher-order terms [0(1), 0(e), 0(e 2 ),...] in the gauge condition (22.27); for 
example, the 0(1) terms say 


k-b = i ▼ a. 


Next insert the vector potential (22.25) into the source-free wave equation (22.19d): 
0 = (A dR A)« = 


= £ 


-^kPkpia* + eb a + e 2 c a + •..) - 2 ± kP(a« + eb a +•••)• 


L£ 




- ~ k<i -,^ aa + eba + •■•)-(«“+ • • •) ; % + Ra ff( a0 + • • •) C is/e ] • (22.29) 


Collect terms of order 1/e 2 and 1/e (terms of order higher than 1/e govern post- 
geometric-optics corrections): 


(1) wave vector 

(2) scalar amplitude 

(3) polarization vector 

(4) light rays 


Derivation of laws of 
geometric optics 


{continued on page 576) 



574 


22. THERMODYNAMICS, HYDRODYNAMICS.AND KINETIC THEORY 


f 

Box 22.3 GEOMETRY OF AIM ELECTROMAGNETIC WAVE TRAIN 



The drawing shows surfaces of constant phase, 8 — constant, emerging through the 
“surface of simultaneity”, t — 0, of a local Lorentz frame. The surfaces shown are 
alternately “crests” (8 ~ 1764tt, 8 — 176677, . . .) and “troughs” (8 = 1765*77, 8 = 
176777,...) of the wave train. These surfaces make up a 1-form, k = d8. The 
“corresponding vector” k = V8 is the “wave vector” The wave vector is null, 
k • k = 0, according to Maxwell’s equations [equation (22.30)]. Therefore it lies in 
a surface of constant phase: 


( number of surfaces\ 
pierced by k 


j = (d8,k) = <£,*> = k-k = 0. 


But not only does it he in a surface of constant phase; it is also perpendicular to 
that surface! Any vector v in that surface must satisfy k * v — (k, v) — {d8 , i^) = 0 
because it pierces no surfaces. 

Geometric optics assumes that the reduced wavelength A, as measured in a typical 
local Lorentz frame, is small compared to the scale £ of inhomogeneities in the wave 
train and small compared to the radius of curvature of spacetime, Thus, over 
regions much larger than A but smaller than £ or the waves are plane-fronted 
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and monochromatic, and there exist Lorentz reference frames (Riemann normal 
coordinates). In one of these “extended” local Lorentz frames, the phase must be 

8 = k a x a -f constant; 

no other expression will yield V0 = /c. The corresponding vector potential [equation 
(22.25)] will be 

A* — exp[/(A:- x — /c 0 /)]} + (“post-geometric-optics corrections”); 

hence, 


lc° = 2tf/(period of wave) = Inr = co = (angular frequency), 

\k\ = 2fly'(wavelength of wave) = 1 /A — co, 
k points along direction of propagation of wave. 

At each event in spacetime there is a wave vector; and these wave vectors, tacked 
end-on-end, form a family of curves—the “light rays” or simply “rays”—whose 
tangent vector is k. The rays, like their tangent vector, lie both in and perpendicular 
to the surfaces of constant phase. 

The affine parameter A of a ray (not to be confused with wavelength = 2nA) 
satisfies k — d/dX ; therefore it is given by 

A = t/k° + constant = t/ co -f constant, 

where t is proper time along the ray as measured, not by the ray itself (its proper 
time is zero!), but by the local Lorentz observer who sees angular frequency co. Thus, 
while co is a frame-dependent quantity and t is also a frame-dependent quantity, 
their quotient t/oo when measured along the ray {not off the ray) is the frame-inde¬ 
pendent affine parameter. For a particle it is possible and natural to identify the 
affine parameter A with proper time t. For a light ray this identification is unnatural 
and impossible. The lapse of proper time along the ray is identically zero. The 
springing up of A to take the place of the vanished t gives one a tool to do what 
one might not have suspected to be possible. Given a light ray shot out at event 
(t and passing through event $, one can give a third event 6 along the same null 
world line that is twice as “far” from (I as SB is “far,” in a new sense of “far” that 
has nothing whatever directly to do with proper time (zero!), but is defined by equal 
increments of the affine parameter (A 6 , — A„ ? = A.^ — A^). The “affine parameter” 
has a meaning for any null geodesic analyzed even in isolation. In this respect, it 
is to be distinguished from the so-called “luminosity distance” which is sometimes 
introduced in dealing with the propagation of radiation through curved spacetime, 
and which is defined by the spreading apart of two or more light rays coming from 
a common source. 

Maxwell’s equations as explored in the text [equation (22.28')] guarantee that 
the complex polarization vector f is perpendicular to the wave vector k and that, 
therefore, it lies in a surface of constant phase (see drawing). Intuition into the 
polarization vector is developed in exercise 22.12. 
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The main laws of geometric 
optics: 


(1) Light rays are null 
geodesics 



kPkpa* = 0 

=*- k • k = 0 (wave vector is null)'. 


(22.30) 



ktkp b* - 2i(k p a a .p + jke. p a*) = 0 

k= 0 ] 




■y(V • k)a (propagation equation for vector amplitude). 


(22.31) 


These equations (22.30, 22.31) together with equation (22.28) are the basis from 
which all subsequent results will follow. As a first consequence, one can obtain the 
geodesic law from equation (22.30). Form the gradient of k • k = 0, 

0 = (k p kp). a = 2 kf>kp. a , 

and use the fact that k p = 0 p is the gradient of a scalar to interchange indices, 

e M3a = 6 ;a0 or 

0 = k% ;tt = k*k' itt . 

The result is 


Y k k = 0 (propagation equation for wave vector). (22.32) 

Notice that this is the geodesic equation! Combined with equation (22.30), it is the 
statement, derived from Maxwell’s equations in curved spacetime, that light rays 
are null geodesics , the first main result of geometric optics. 

Turn now from the propagation vector k — V0 to the wave amplitude a = af, 
and obtain separate equations for the magnitude a and polarization f. Use equation 
(22.31) to compute 

2 a d k a = 2 a ^7 k a — V k a 2 = \(a • a) = a • \a + a*V^a 
— k)(a • a + a • a) = — a 2 V • k\ 


so 




k)a (propagation equation for scalar amplitude). (22.33) 


Next write a — af in equation (22.31) to obtain 

0 = ▼ k (af) + !(▼ • k)af = aV k f + f + y (▼ • k)aj = a\f 
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or 


Y k f = 0 ( propagation equation for polarization vector). (22.34) 

This together with equation (22280, constitutes the second main result of geometric 
optics, that the polarization vector is perpendicular to the rays and is parallel-propa¬ 
gated along the rays. It is now possible to see that these results, derived from 
equations (22.30) and (22.31) are consistent with the gauge condition (22.28). The 
vectors k and f, specified at one point, are fixed along the entire ray by their 
propagation equations. But because both propagation equations are parallel-tran¬ 
sport laws, the conditions k • k = 0, f • f = 1, and k • f = 0, once imposed on the 
vectors at one point, will be satisfied along the entire ray. 

The equation (22.33) for the scalar amplitude can be reformulated as a conserva¬ 
tion law. Since d k = (k • ▼), one rewrites the equation as (k * V)a 2 + a 2 V • k = 0, 
or 


V • (a 2 k) = 0. (22.35) 

Consequently the vector a 2 k is a “conserved current,” and the integral fa 2 k M d 3 l 
has a fixed, unchanging value for each 3-volume cutting a given tube formed of 
light rays. (The tube must be so formed of rays that an integral of a 2 k over the 
walls of the tube will give zero.) What is conserved? To remain purely classical, one 
could say it is the “number of light rays” and call a 2 k° the “density of light rays” 
on an x° = constant hypersurface. But the proper correspondence and more concrete 
physical interpretation make one prefer to call equation (22.35) the law of conserva¬ 
tion of photon number. It is the third main result of geometric optics. Photon number, 
of course, is not always conserved; it is an adiabatic invariant, a quantity that is 
not changed by influences (e.g., spacetime curvature, — \/&l 2 ) which change slowly 
($ > A) compared to the photon frequency. 

Box 22.4 summarizes the above equations of geometric optics, along with others 
derived in the exercises. 


Exercise 22.11. ELECTROMAGNETIC FIELD AND STRESS ENERGY 

Derive the equations given in part D of Box 22.4 for F ' E , B , and T. 


Exercise 22.12. POLARIZATION 

At an event L f 0 through which geometric-optics waves are passing, introduce a local Lorentz 
frame with r-axis along the direction of propagation. Then k = w(e 0 + e z ). Since the 
polarization vector is orthogonal to k , it is f = /°(e 0 + e v ) + f l e x + f 2 e y ; and since 
f-f=l,ithas|/ 1 | 2 + |/*| 2 =l. 

(a) Show that the component/° of the polarization vector has no influence on the electric 
and magnetic fields measured in the given frame; i.e., show that one can add a multiple 
of k to f without affecting any physical measurements. 


(2) polarization vector is 
perpendicular to ray and is 
parallel propagated along ray 


(3) conservation of "photon 
number" 


EXERCISES 


{continued on page 581) 
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Box 22.4 GEOMETRIC OPTICS IN CURVED SPACETIME 

(Summary of Results Derived in Text and Exercises) 

A. Geometric Optics Assumption 

Electromagnetic waves propagating in a source-free region of spacetime are locally 
plane-fronted and monochromatic (reduced wavelength A scale 6 over which 
amplitude, wavelength, or polarization vary; and A l 3L = mean radius of curva¬ 
ture of spacetime). 

B. Rays, Phase, and Wave Vector (see Box 22.3) 

Everything (amplitude, polarization, energy, etc.) is transported along rays; and 
the quantities on one ray do not influence the quantities on any other ray. 

The rays are null geodesics of curved spacetime, with tangent vectors (“wave 
vectors' 

▼** = 0 . 

The rays both lie in and are perpendicular to surfaces of constant phase, 6 = 
const.; and their tangent vectors are the gradient of 6\ 

k = V0. I 

In a local Lorentz frame, k° is the “angular frequency” and k°/2ir is the ordinary 
frequency of the waves, and 

n — k/k° 

is a unit 3-vector pointing along their direction of propagation. 

C. Amplitude and Polarization Vector 

The waves are characterized by a real amplitude a and a complex polarization 
vector f of unit length, f- f = 1. (Of the fundamental quantities <9, k , a, f, all 
are real except f. See exercise 22.12 for deeper understanding of f .) 

The polarization vector is everywhere orthogonal to the rays, k* f ~ 0; and is 
parallel-transported along them, V k f — 0. 

The propagation law for the amplitude is 


d k a = -y(v -k)a. 
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r ~-\ 

This propagation law is equivalent to a law of conservation ofphotons (classically: 
of rays); a 2 k is the “conserved current” satisfying V • ( a 2 k ) = 0; and 
(& t 77h)~ 1 fa 2 k° VlgT d 3 x is the number of photons (rays) in the 3-volume of I 
integration on any x° = constant hypersurface, and is constant as this volume 
is carried along the rays. 

The propagation law holds separately on each hypersurface of constant phase. 
There it can be interpreted as conservation of a a 2 d , where (2 is a two-dimen¬ 
sional cross-sectional area of a pulse of photons or rays. See exercise 22.13. 



D. Vector Potential, Electromagnetic Field, 
and Stress-Energy-Momentum 

At any event the vector potential in Lorentz gauge is 

A = &{ae i9 f}> 

where & denotes the real part. 

The electromagnetic field tensor is orthogonal to the rays, F • k = 0, and is given 
by 

F = &{iae i9 k A f}. 

The corresponding electric and magnetic fields in any local Lorentz frame are 

E — &{/aA:V*(projection of/ perpendicular to k )} 9 
B = n x E, where n = k/k°. 

The stress-energy tensor, averaged over a wavelength, is 

T = {\/%v)a 2 k <& 

corresponding to an energy density in a local Lorentz frame of 

T 00 = (1/8*7 )(ak 0 ) 2 


and an energy flux of 

joi — jw n f _ 

_^ rays (in n = k/k° direction) with the speed of 

light. This is identical with the stress-energy tensor that would be produced by 
a beam of photons with 4-momenta p — hk. 

Conservation of energy-momentum V ■ T = 0 follows from the ray conservation 
law V • (a 2 k) = 0 and the geodesic law Y k k = (k • V)/c = 0: 

877 ▼ • T = ▼ • (a 2 k «*) = [▼• (a 2 k)]k + a 2 (k-V)k = 0. 

V- J 
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Box 22.4 (continued) 


The adiabatic (geometric optics) invariant “ray number” a 2 k° or “photon number” 
( 87 rft )' 1 ^ 0 in a unit volume is proportional to the energy, ( 87 r)* 1 ^ 2 ^ 0 ) 2 , 
divided by the frequency, k °—corresponding exactly to the harmonic oscillator 
adiabatic invariant E/co [Einstein (1912), Ehrenfest (1916), Landau and Lifshitz 
(I960)]. 


E. Photon Reinterpretation of Geometric Optics 


The laws of geometric optics can be reinterpreted as follows. This reinterpretation 
becomes a foundation of the standard quantum theory of the electromagnetic field 
(see, e.g., Chapters 1 and 13 of Baym (1969)]; and the classical limit of that 
quantum theory is standard Maxwell electrodynamics. 

Photons are particles of zero rest mass that move along null geodesics of spacetime 
(the null rays). 

The 4-momentum of a photon is related to the tangent vector of the null ray (wave 
vector) by p = tik. A renormalization of the affine parameter, 

(new parameter) — (1/ft) X (old parameter), 

makes p the tangent vector to the ray. 

Each photon possesses a polarization vector, f, which is orthogonal to its 4-mo- 
mentum (p • f = 0 ), and which it parallel-transports along its geodesic world 
line (Y p f = 0 ). 

A swarm of photons, all with nearly the same 4-momentum p and polarization 
vector f (as compared by parallel transport), make up a classical electromagnetic 
wave. The scalar amplitude a of the wave is determined by equating the 
stress-energy tensor of the wave 

to the stress-energy tensor of a swarm of photons with number-flux vector S, 

T = p ® S 

[see equation (5.18)]. The result: 

S = — (—\ 2 p = —^—a 2 k 
8 t 7 \ /j / 877ft 


or, in any local Lorentz frame, 


1/2 


/ number density of photons \ 

a = ( energy of one pholo H ' 
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(b) Show that 
listed: 

f = 
f = 

f = 

f = 
f - 

(c) Show that 

of ellipse) is the same as viewed in any local Lorentz frame at any event along a given ray. 
[Hint: Use pictures and abstract calculations rather than Lorentz transformations and com¬ 
ponent calculations.] 

Exercise 22.13. THE AREA OF A BUNDLE OF RAYS 

Write equation (22.31) in a coordinate system in which one of the coordinates is chosen 
to be x° = 0, the phase (a retarded time coordinate). 

(a) Show that g 00 = 0 and that no derivatives d/dd appear in equation (22.33); so propa¬ 
gation of a can be described within a single 6 = constant hypersurface. 

(b) Perform the following construction (see Figure 22.1). Pick a ray <?0 along which a 
is to be propagated. Pick a bundle of rays, with two-dimensional cross section, that (i) all 
lie in the same constant-phase surface as <? 0 , and (ii) surround (? 0 . (The surface is three-di- 


the following polarization vectors correspond to the types of polarization 


? x , linear polarization in x direction; 
linear polarization in y direction; 


1 

V2' 

1 

V2' 

ae x + i( 1 — a 2 ) 1/2 e y , righthand elliptical polarization. 


r(e x + ie y ), righthand circular polarization; 
r(e x — ie y ), lefthand circular polarization; 


the type of polarization (linear; circular; elliptical with given eccentricity 



Figure 22.1. 

Geometric optics for a bundle of rays with two-dimensional cross section, all lying in a surface of constant 
phase, 0 = const. Sketch (a) shows the bundle, surrounding a central ray 6-% in a spacetime diagram 
with one spatial dimension suppressed. Sketch (b) shows the bundle as viewed on a slice of simultaneity 
in a local Lorentz frame at the event & 0 . Slicing the bundle turns each ray into a “photon”; so the 
bundle becomes a two-dimensional surface filled with photons. The area d of this photon-filled surface 
obeys the following laws (see exercises 22.13 and 22.14): (1) (1 is independent of the choice of Lorentz 
frame; it depends only on location *J? 0 along the ray t ? 0 . (2) The amplitude a of the waves satisfies 

da 2 = constant all along the ray (P 0 

(“conservation of photon flux ”). (3) d obeys the “propagation equation *' (22.36). 
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mensional, so any bundle filling it has a two-dimensional cross section.) At any event 
in any local Lorentz frame there, on a “slice of simultaneity” x° = constant, measure the 
cross-sectional area d of the bundle. (Note: the area being measured is perpendicular to 
k in the three-dimensional Euclidean sense; it can be thought of as the region occupied 
momentarily by a group of photons propagating along, side by side, in the k direction.) Show 
that the area d is the same, at a given event ^ 0 , regardless of what Lorentz frame is used 
to measure it; but the area changes from point to point along the ray <? 0 as a result of the 
rays' divergence away from each other or convergence toward each other: 

d k d = (▼ * k)d. (22.36) 

Then show that da 2 is a constant everywhere along the ray (? 0 (“conservation of photon 
flux”). [Hints: (i) Any vector ( connecting adjacent rays in the bundle is perpendicular to 
k, because ( lies in a surface of constant 6 and k • f = (k, ( ) = (d6 , ( ) = (change in 6 
along () = 0. (ii) Consider, for simplicity, a bundle with rectangular cross section as seen 
in a specific local Lorentz frame at a specific event £P 0 [edge vectors v and iv with v ■ w = 0 
(edges perpendicular) and re 0 = ive 0 = 0 (edges in surface of constant time) and 
v * k = iv • k = 0 (since edge vectors connect adjacent rays of the bundle)]. Show pictorially 
that in any other Lorentz frame at ^ 0 , the edge vectors are v‘ = v + ak and iv' = iv + fik 
for some a and /?. Conclude that in all Lorentz frames at & 0 the cross section has identical 
shape and identical area, and is spatially perpendicular to the direction of propagation 
(k • v = k • h » = 0). (iii) By a calculation in a local Lorentz frame show that d k d = (▼ • k)d. 
(iv) Conclude from d k a = — |(V * k)a that d k (da 2 ) = 0.] 

Exercise 22.14. FOCUSING THEOREM 

The cross-sectional area d of a bundle of rays all lying in the same surface of constant phase 
changes along the central ray of the bundle at the rate (22.36) (see Figure 22.1). 

(a) Derive the following equation (“focusing equation ”) for the second derivative of d x/ 2 : 

PjfL = _ + I R a(5 k*kfrv 2, (22.37) 

where A is affine parameter along the central ray (Ic = d/d\), and the “magnitude of the 
shear of the rays”, |a|, is defined by the equation 

|a|* = i ( 22 - 38 ) 

[Hint: This is a vigorous exercise in index manipulations. The key equations needed in the 
manipulations are d a k a = (k a . a )d [equation (22.36)]; /c a ;/3 k& = 0 [geodesic equation (22.32) 
for rays]; k a .p — k fi;a [which follows from k a = 0 J; and the rule (16.6c) for interchanging 
covariant derivatives of a vector.] 

(b) Show that, in a local Lorentz frame where k = u(e t 4- e g ) at the origin, 

M 2 = \ K.y? + ( 22 - 39 ) 

Thus, |ct| 2 is nonnegative, which justifies the use of the absolute value sign. 

(c) Discussion: The quantity |a| is called the shear of the bundle of rays because it measures 
the extent to which neighboring rays are sliding past each other [see, e.g., Sachs (1964)]. 
Hence, the focusing equation (22.37) says that shear focuses a bundle of rays (makes 
d 2 d 1/2 /d\ 2 < 0); and spacetime curvature also focuses it if R a pk a k& > 0, but defocuses it 
if R afi k a k fi < 0. (When a bundle of toothpicks, originally circular in cross section, is squeezed 
into an elliptic cross section, it is sheared.) 
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(d) Assume that the energy density 7^, as measured by any observer anywhere in space- 
time, is nonnegative. By combining the focusing equation (22.37) with the Einstein field 
equation, conclude that 


/for any bundle of rays, all in the same\ 
^ djs} ~ — ® ( sur ^ ace constant phase, anywhere in ] 


^spacetime 


/ 


(22.40) 


{focusing theorem). This theorem plays a crucial role in black-hole physics (§34.5) and in 
the theory of singularities (§34.6). 


§22.6. KINETIC THEORY IN CURVED SPACETIME* 

The stars in a galaxy wander through spacetime, each on its own geodesic world 
line, each helping to produce the spacetime curvature felt by all the others. Photons, 
left over from the hot phases of the big bang, bathe the Earth, bringing with 
themselves data on the homogeneity and isotropy of the universe. Theoretical 
analyses of these and many other problems are unmanageable, if they attempt to 
keep track of the motion of every single star or photon. But a statistical description 
gives accurate results and is powerful. Moreover, for most problems in astrophysics 
and cosmology, the simplest of statistical descriptions—one ignoring collisions—is 
adequate. Usually collisions are unimportant for the large-scale behavior of a system 
(e.g.; a galaxy), or they are so important that a fluid description is possible (e.g., 
in a stellar interior). __ - 

Consider, then, a swarm of particles (stars, or photons, or black holes, or . . .) 
that move through spacetime on geodesic world lines, without colliding. Assume, 
for simplicity, that the particles all have the same rest mass. Then all information 
of a statistical nature about the particles can be incorporated into a single function, 
the “distribution function ” or “number density in phase space ”, 

Define % in terms of measurements made by a specific local Lorentz observer 
at a specific event in curved spacetime. Give the observer a box with 3-volume 
T x (and with imaginary walls). Ask the observer to count how many particles, N, 
are inside the box and have local-Lorentz momentum components p’ in the range 

P> — -i- Ap' < p> < P’ + y ApK 

(He can ignore the particle energies p°; since all particles have the same rest mass 
m, energy 

p° — (m 2 4- p 2 ) 1/2 


♦For more detailed and sophisticated treatments of this topic, see, e.g., Tauber and Weinberg (1961), 
and Lindquist (1966), Marie (1969), Ehlers (1971), Stewart (1971), Israel (1972), and references cited 
therein. Ehlers (1971) is a particularly good introductory review article. 


Volume in phase space for a 
group of identical particles 
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Lorentz invariance of volume 
in phase space 


Liouville's theorem 
(conservation of volume in 
phase space) 


Number density in phase 
space (distribution function) 


is fixed uniquely by momentum.) The volume in momentum space occupied by the 
N particles is Y p = Ap z Ap v Ap z \ and the volume in phase space is 

r = r x r p . (22.41) 

Other observers at 9 0 , moving relative to the first, will disagree on how much spatial 
volume Y x and how much momentum volume Y p these same N particles occupy: 

Y x and Y p depend on the choice of Lorentz frame. (22.42) 

However, all observers will agree on the value of the product Y = ^ x Y p (“volume 
in phase space”): 

The phase-space volume Y occupied by a given set of N identical 
particles at a given event in spacetime is independent (22.43) 

of the local Lorentz frame in which it is measured. 

(See Box 22.5 for proof.) Moreover, as the same N particles move through spacetime 
along their geodesic world lines (and through momentum space), the volume Y they 
span in phase space remains constant: 

The Y occupied by a given swarm of N particles is 
independent of location along the world line of the (22.44) 

swarm (“Liouville’s theorem in curved spacetime ”). 

(See Box 22.6 for proof.) 

More convenient for applications than the volume Y in phase space occupied 
by a given set of N particles is the “ number density in phase space ” (“distribution 
function ”) in the neighborhood of one of these particles: 

91 = N/Y . (22.45) 

On what does this number density depend? It depends on the location in space- 
time, 9, at which the measurements are made. It also depends on the 4-momentum 
p of the particle in whose neighborhood the measurements are made. But because 
the particles all have the same rest mass, p cannot take on any and every value 
in the tangent space at 9. Rather, p is confined to the “forward mass hyperboloid” 
at 9: 


p 2 = m 2 ; p lies inside future fight cone. 


Thus, 


91 = 91 


/ location, 9> 
\in spacetime 


( 4 -momentum p, which must fie 1 
on the forward mass hyperboloid 
of the tangent space at 9 


(22.46) 


Pick some one particle in the swarm, with geodesic world fine 9(X) [X — (affine 
parameter) = (proper time, if particle has finite rest mass)], and with 4-momentum 
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Box 22.5 VOLUME IN PHASE SPACE 


A. For Swarm of Identical Particles 
with Nonzero Rest Mass 

Pick an event through which passes a particle 
named “John” with a 4-momentum named “P”. 
In John’s local Lorentz rest frame at (“barred 
frame”, S), select a small 3-volume, C V X = Ax Ay 
£z , containing him. Also select a small “3-volume 
in momentum space,” T-= Ap x Ap* Ap z centered 
on John’s momentum, which is P x — P v — 
P z — 0. Focus attention on all particles whose world 
lines pass through *V- and which have momenta 
pi in the range T p surrounding P7 = 0. 


t t 



Examine this bundle in another local Lorentz 
frame (“unbarred frame”, S) at f? 0 , which moves 
with speed /? relative to the rest frame. Orient axes 
so the relative motion of the frames is in the a* 
and 3c directions. Then the space volume occu¬ 
pied in the new frame has Ay — Ay , Az — AY (no 
effect of motion on transverse directions), and 
Ax = (1 — /? 2 ) 172 Ax (Lorentz contraction in lon¬ 
gitudinal direction). Hence C V X = (1 — /? 2 ) 1/2 T- 
(“transformation law for space volumes”) or, 
equivalently [since P° — m/( 1 — /? 2 ) 1/2 ]: 

poor _ ctr_ __ /constant, independent^ 
x ~ m x ~ \of Lorentz frame / 



A momentum-space diagram, analogous to the 
spacetime diagram, depicts the momentum spread 
for particles in the bundle, and shows that Ap x — 
Ap x /( 1 — /? 2 ) 172 . The Lorentz transformation 
from S to S leaves transverse components of mo¬ 
menta unaffected; so Ap v = Apv, Ap z = Ap z . 
Hence c V p = T p /(\ — /? 2 ) 1/2 (“transformation law 
for momentum volumes”); or, equivalently 


Yz = 

po 


T- 

V 

m 


( constant, independent^ 
of Lorentz frame A 


Although the spatial 3-volumes < ¥ x and V- 
differ from one frame to another, and the momen¬ 
tum 3-volumes T p and T- differ, the volume in 
six-dimensional phase space is Lorentz-invariant: 

r = r/r ; = r f r r 

It is a frame-independent, geometric object! 


B. For Swarm of Identical Particles 
with Zero Rest Mass 

Examine a sequence of systems, each with particles 
of smaller rest mass and of higher velocity relative 
to a laboratory. For every bundle of particles in 
each system, P 0£ V X , T p /P°, and are Lorentz- 

invariant. Hence, in the limit as m —0, as 
P —1, and as P° = m/( 1 - $ 2 ) l/ 2 —finite 
value (particles of zero rest mass moving with 
speed of light), and T* p /P° and C V 9 C V‘ 9 are 

still Lorentz-invariant, geometric quantities. 
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r 


Box 22.6 CONSERVATION OF VOLUME IN PHASE SPACE 

Examine a very small bundle of identical particles 
that move through curved spacetime on neighbor¬ 
ing geodesics. Measure the bundle’s volume in 
phase space, T (T = in any local Lorentz 

frame), as a function of affine parameter X along 
the central geodesic of the bundle. The following 
calculation shows that 

d‘V/dX — 0 /“Liouville theorem in\ 

\ curved spacetime” / 


Proof for particles of finite rest mass: Examine 
particle motion during time interval Sr, using local 
Lorentz rest frame of central particle. All velocities 
are small in this frame, so 

pi — mdxf/dt. 

Hence (see pictures) the spreads in momentum 
and position conserve Ax Ap x , Ay Ap *, and Az Ap z \ 
i.e., 

d*V S(dx Ay A7 Ap x Apv Ap z ) _ 
dr 8t 

But r = aX -f b for some arbitrary constants a and 
b ; so dT/dX = 0. 


Prooffor particles of zero rest mass. Examine parti¬ 
cle motion in local Lorentz frame where central 
particle has P = P°(e 0 + e x ). In this frame, all 
particles have p v < p°, p z < p° 9 p x = p° + 
0([p v Y/P°) ~ P°- Since p a — dx a /dX for appro¬ 
priate normalization of affine parameters (see Box 


22.4), one can write dxfdt = p*/p°\ i.e., 

= 1 + 0([ p y/P^ + [p*/P 0 ] 2 ) 


dy_ 

dt 


1, 

£l 

po 5 


dz 

~di 


EL 

P 



4 






djc 


/ = 0 



► x 


Each particle moves with speed dx/dt proportional 
to height in diagram 

dx/dt = p*/m, 

and conserves its momentum, dp x /dt — 0. Hence 
the region occupied by particles deforms, but main¬ 
tains its area. Same is true for (y — p v ) and (z — p *). 


f 

4 


8t 


A P X J 





Ap*\ 


■\'l r • 


Ax 


Ax 


t = 0 


t — 8t 


Each particle (“photon”) moves with dx/dt — 1 and 
dp x /dt = 0 in the local Lorentz frame. Area and shape 
of occupied region are preserved. 
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Hence (see pictures) Ax Ap x , Ay Ap v , and Az Ap z 
are all conserved; and 

dT 5(Jx Ay Az Ap x Ap v Ap z ) _ 

~ST = 5i ” 

But r and the affine parameter X of central particle 
are related by t — P°X [cf. equation (16.4)]; thus 

dr/dX = 0. 




Ay Ay 


t = 0 t = St 

Particle (“photon”) speeds are proportional to height 
in diagram 

dy/dt = p»/P°, 

and dp v /dt = 0. Hence, occupied region deforms but 
maintains its area. Same is true of z — p z . 





p(X). Examine the density in phase space in this particle’s neighborhood at each 
point along its world line: 

% = Sfc[$>(A), p(X)]. 

Calculate %{X) as follows: (1) Pick an initial event £P(0) on the world line, and a 
phase-space volume T containing the particle. (2) Cover with red paint all the 
particles contained in T at ^(0). (3) Watch the red particles move through spacetime 
alongside the initial particle. (4) As they move, the phase-space region they occupy 
changes shape extensively; but its volume V remains fixed (Liouville’s theorem). 

Moreover, no particles can enter or leave that phase-space region (once in, always 
in; once out, always out; boundaries of phase-space region are attached to and move 
with the particles). (5) Hence, at any X along the initial particle’s world line, the 
particle is in a phase-space region of unchanged volume T, unchanged number of 
particles N, and unchanged ratio % = N/T : 

d%[^P{X), p{ A)] _ n nl A1 ^ Collisionless Boltzmann 

^ ~ (ZZA ) e q uat j on (kinetic equation) 

This equation for the conservation of 91 along a particle’s trajectory in phase space 
is called the “collisionless Boltzmann equation,” or the “kinetic equation 

Photons provide an important application of the Boltzmann equation. But when 
discussing photons one usually does not think in terms of the number density in 
phase space. Rather, one speaks of the “specific intensity” I v of radiation at a given 
frequency v , flowing in a given direction, n, as measured in a specified local Lorentz 
frame: 

, =_ ^(energy) - 

* d( time) d( area) ^(frequency) d(solid angle) 


Distribution function for 
photons expressed in terms 
of specific intensity, ! v 

Invariance and conservation 
of !Jv z 


EXERCISES 
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(See Figure 22.2). A simple calculation in the local Lorentz frame reveals that 

« = (22.49) 

where h is Planck’s constant (see Figure 22.2). Thus, if two different observers at 
the same or different events in spacetime look at the same photon (and neighboring 
photons) as it passes them, they will see different frequencies v (“doppler shift,” 
“cosmological red shift,” “gravitational redshift”), and different specific intensities 
but they wall obtain identical values for the ratio I v /v z . Thus I v /v z , like is 
invariant from observer to observer and from event to event along a given photon’s 
world line. 


Exercise 22.15. INVERSE SQUARE LAW FOR FLUX 

The specific flux of radiation entering a telescope from a given source is defined by 

F, = f /„ da, (22.50) 

where integration is over the total solid angle (assumed < 4sr) subtended by the source on 
the observer’s sky. Use the Boltzmann equation (conservation of I v /v z ) to show that 
F v cc (distance from source) -2 for observers who are all at rest relative to each other in flat 
spacetime. 

Exercise 22.16. BRIGHTNESS OF THE SUN 

Does the surface of the sun look any brighter to an astronaut standing on Mercury than 
to a student standing on Earth? 

Exercise 22.17. BLACK BODY RADIATION 

An “optically thick” source of black-body radiation (e.g., the surface of a star, or the hot 
matter filling the universe shortly after the big bang) emits photons isotropically with a specific 
intensity, as seen by an observer at rest near the source, given (Planck radiation law) by 


r _ 2 hu 3 

1 V — e hv/kT _ 1 ■ 


(22.51) 


Here T is the temperature of the source. Show that any observer, in any local Lorentz frame, 
anywhere in the universe, who examines this radiation as it flows past him, will also see 
a black-body spectrum. Show, further, that if he calculates a temperature by measuring the 
specific intensity /„ at any one frequency, and if he calculates a temperature from the shape 
of the spectrum, those temperatures will agree,.(Radiation remains black body rather than 
being “diluted” into “grey-body.”) Finally, show that the temperature he measures is red- 
shifted by precisely the same factor as the frequency of any given photon is redshifted, 

7 ° bserved = ( "observed \ for a given photon . ( 22 . 52 ) 

^emitted ' ^emitted ' 

[Note that the redshifts can be “Doppler” in origin, “cosmological” in origin, “gravitational” 
in origin, or some inseparable mixture. All that matters is the fact that the parallel-transport 
law for a photon’s 4-momentum, V p p — 0, guarantees that the redshift ^served/^emitted * s 
independent of frequency emitted.] 
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f 



3-momentum volume, with direction 
of momentum vectors reversed for 
ease of visualization (telescope as 
an emitter, not a receiver!) 

Figure 22.2. 

Number density in phase space for photons, interpreted in terms of the specific intensity An 
astronomer has a telescope with filter that admits only photons arriving from within a small solid angle 
AS? about the z-direction, and having energies between p° and p° 4* Ap° . The collecting area, d, of his 
telescope lies in the x 9 jy-plane (perpendicular to the incoming photon beam). Let 8N be the number 
of photons that cross the area d in a time interval 8t. [All energies, areas, times, and lengths are measured 
in the orthonormal frame (“proper reference frame; §13.6) which the astronomer Fermi-Walker 
transports with himself along his (possibly accelerated) world line—or, equivalently, in a local Lorentz 
frame momentarily at rest with respect to the astronomer.] The 8N photons, just before the time interval 
5/ begins, he in the cylinder of area d and height 8z - 8t shown above. Their spatial 3-volume is thus 
r x = d 8t. Their momentum 3-volume is T p = (p 0 ) 2 Ap° AS? (see drawing). Hence, their number density 
in phase space is 

8N 8N _ 8N _ 

° X ~ ~ a 8t(p°)HAp°) AS? h 3 d 8t v 2 Av AS? 

where v is the photon frequency measured by the telescope (p° = hv). 

The specific intensity of the photons, I v (a standard concept in astronomy), is the energy per unit 
area per unit time per unit frequency per unit solid angle crossing a surface perpendicular to the beam: 
i.e., 

hv8N 

lv ~ d8t Au AS?' 


Direct comparison reveals & = h 4 {I,/**). 

Thus, conservation of ^ along a photon’s world line implies conservation of I y /v 3 . This conservation 
law finds important applications in cosmology (e.g.. Box 29.2 and Ex. 29.5) and in the gravitational lens 
effect (Refsdal 1964); see also exercises 22.15-22.17. 


Exercise 22.18. STRESS-ENERGY TENSOR 

(a) Show that the stress-energy tensor for a swarm of identical particles at an event can 
be written as an integral over the mass hyperboloid of the momentum space at ^ 0 : 

T = J(#p ® p)(d'Y p /p°), (22.53) 

d‘V B dp* dp y dp* . ... . 

-p = _l— iL — L .— in a local Lorentz frame. 

p° p° 


(22.54) 




590 


22. THERMODYNAMICS, HYDRODYNAMICS.AND KINETIC THEORY 


(Notice from Box 22.5 that dV p /p° is a Lorentz-invariant volume element for any segment 
of the mass hyperboloid.) 

(b) Verify that the Boltzmann equation, dX/dX = 0, implies ▼ • T ~ 0 for any swarm 
of identical particles. [Hint: Calculate ▼ • T in a local Lorentz frame, using the above 
expression for T, and using the geodesic equation in the form Dp^/dX = 0.] 


Exercise 22.19. KINETIC THEORY FOR NONIDENTICAL PARTICLES 

For a swarm of particles with a wide distribution of rest masses, define 


X ~ 


AN 


r x r p Am 9 


(22.55) 


where T x and T p are spatial and momentum 3-volumes, and AN is the number of particles 
in the region T/Tp with rest masses between m — Am/2 and m + Am/2. Show the following. 

(a) k Y/F p Am is independent of Lorentz frame and independent of location on the world 
tube of a bundle of particles. 

(b) X can be regarded as a function of location in spacetime and 4-momentum p inside 
the future light cone of the tangent space at 


X - X($> y p). 


(22.56) 


(c) X satisfies the collisionless Boltzmann equation (kinetic equation) 


d\ 


along geodesic trajectory of any particle. 


(22.57) 


(d) X can be rewritten in a local Lorentz frame as 

AN 


X = 


[(p°/m) Ax Ay Az][Ap° Ap x Ap v Ap z ] 


(22.58) 


(e) The stress-energy tensor at an event & can be written as an integral over the interior 
of the future light cone of momentum space 


7>' = J dp 0 dp 1 dp 2 dp 3 

in a local Lorentz frame (Track-1 notation for integral; see Box 5.3); 

T = J (Xp 0 p)m~ lif \ in frame-independent notation 

= J (Xp 0 p)m~ l dp 0 A dp 1 A dp 2 A dp 3 
in a local Lorentz frame (Track-2 notation; see Box 5.4). 


(22.59) 


(22.590 
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RELATIVISTIC STARS 


Wherein the reader , armed 
with the magic potions and powers 
of Geometrodynamics , conquers the stars. 



CHAPTER 


SPHERICAL STARS 
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§23.1. PROLOG 

Beautiful though gravitation theory may be, it is a sterile subject until it touches 

the real physical world. Only the hard reality of experiments and of astronomical Preview of the rest of this 

observations can bring gravitation theory to life. And only by building theoretical book 

models of stars (Part V), of the universe (Part VI), of stellar collapse and black holes 

(Part VII), of gravitational waves and their sources (Part VIII), and of gravitational 

experiments (Part IX), can one understand clearly the contacts between gravitation 

theory and reality. 

The model-building in this book will follow the tradition of theoretical physics. 

Each Part (stars, universe, collapse, . . .) will begin with the most oversimplified 
model conceivable, and will subsequently add only those additional touches of 
realism necessary to make contact with the least complex of actual physical systems. 

The result will be a tested intellectual framework, ready to support and organize 
the additional complexities demanded by greater realism. Greater realism will not 
be attempted in this book. But the reader seeking it could start in no better place 
than the two-volume treatise on Relativistic Astrophysics by Zel’dovich and Novikov 
(1971, 1974). 

Begin, now, with models for relativistic stars. As a major simplification, insist 
(initially) that all stars studied be static. Thereby exclude not only exploding and 
pulsating stars, but even quiescent ones with stationary rotational motions. From 

the static assumption, plus a demand that the star be made of “perfect fluid” (no Static stars must be spherical 

shear stresses allowed!), plus Einstein’s field equations, it probably follows that the 

star is spherically symmetric. However, nobody has yet given a proof. [For proofs 

under more restricted assumptions, see Avez (1964) and Kunzle (1971).] In the 

absence of a proof, assume the result: insist that all stars studied be spherical as 

well as static. 
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§23.2. COORDINATES AND METRIC FOR A STATIC, 

SPHERICAL SYSTEM 

Metric for any static. To deduce the gravitational field for a static spherical star — or for any other static, 

spherical system: spherical system — begin with the metric of special relativity (no gravity) in the 

spherically symmetric form 

ds~ = - dt 2 + dr 2 + r 2 d£2 2 , (23.1) 

where 

dQ 2 = dd 2 + sin 2 9 d$ 2 . (23.2) 

(1) generalized from flat Try to modify this metric to allow for curvature due to the gravitational influence 

spacetime 0 f the star, while preserving spherical symmetry. The simplest and most obvious 

guess is to allow those metric components that are already non-zero in equation 
(23.1) to assume different values: 

ds 2 _ -e 2 * dt 2 + dr 2 + R 2 dti 2 , (23.3) 

where A , and R are functions of r only. (The static assumption demands 

dg^/dt = 0.) To verify that this guess is good, use it in constructing stellar models, 
and check that the resulting models have the same generality (same set of quantities 
freely specifiable) as in Newtonian theory and as expected from general physical 
considerations. An apparently more general metric 

ds 2 = — a 2 dt 2 — lab dr dt + c 2 dr 2 _+ R 2 dQ 2 (23.4) 

actually is not more general in any physical sense. One can perform a coordinate 
transformation to a new time coordinate V defined by 

e 0 dt' — a dt + b dr. (23.5) 

By inserting this in equation (23.4), and by defining e 2A = b 2 + c 2 , one obtains the 
postulated line element (23.3), apart from a prime on the t.* 

The necessity to allow for arbitrary coordinates in general relativity may appear 
burdensome when one is formulating the theory; but it gives an added flexibility, 
something one should always try to turn to one’s advantage when formulating and 
solving problems. The g n = 0 simplification (called a coordinate condition) in equa¬ 
tion (23.3) results from an advantageous choice of the t coordinate. The r coordinate, 

(2) specialized to however, is also at one’s disposal (as long as one chooses it in a way that respects 

" Schwarzschild form" spherical symmetry; thus not / = r + cos#). One can turn this freedom to advan¬ 
tage by introducing a new coordinate r'(r) defined by 

r' = R(r). (23.6) 

*Of course, equation (23.5) only succeeds in defining a new time coordinate /' if it is integrable as 
a differential equation for C . By choosing the integrating factor e 0 to be just e* = one sees that 
* f' = / + f{b(r)/a(r)] dr is the integral of (23.5); thus the required t’ coordinate always exists, no matter 

what the functions a(r), b(r). c(r ), and R(r ) in equation (23.4) may be. 
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With this choice of the radial coordinate, and with the primes dropped, equation 
(23.3) reduces to 

ds 2 = — e 24> dt 2 + e 2A dr 2 + r 2 d£2 2 , (23.7) 

a line element with just two unknown functions, <P(r) and A(r). This coordinate 
system and metric have been used in most theoretical models for relativistic stars 
since the pioneering work of Schwarzschild (1916b), Tolman (1939), and Oppen- 
heimer and Volkoff (1939). These particular coordinates are sometimes called “cur¬ 
vature coordinates” and sometimes “Schwarzschild coordinates.” The central idea 
of these coordinates, in a nutshell, is (Schwarzschild r- coordinate) = (proper circum¬ 
ference)/^. 

For a more rigorous proof that in any static spherical system Schwarzschild (3) derived more rigorously 
coordinates can be introduced, bringing the metric into the simple form (23.7), see 
Box 23.3 at the end of this chapter. 


Exercise 23.1. ISOTROPIC COORDINATES AND NEWTONIAN LIMIT EXERCISE 

An alternative set of coordinates sometimes used for static, spherical systems is the “isotropic 
coordinate system” (r, 7, 0, <£>). The metric in isotropic coordinates has the form 

ds 2 = -e 2 * dt 2 + e 2 *[dr 2 +r 2 dQ% (23.8) 

with <P and [i being functions of 7. 

(a) Exhibit the coordinate transformation connecting the Schwarzschild coordinates (23.7) 
to the isotropic coordinates (23.8). 

(b) From equation (16.2a) [or equivalently (18.15c)], show that, in the Newtonian limit, 
the metric coefficient <P of the isotropic line element becomes the Newtonian potential; and 
H becomes equal to By combining with part (a), discover that A = rd<P/dr in the 
Newtonian limit. 


§23.3. PHYSICAL INTERPRETATION OF 
SCHWARZSCHILD COORDINATES 


In general relativity, because the use of arbitrary coordinates is permitted, the 
physical significance of statements about tensor or vector components and other 
quantities is not always obvious. There are, however, some situations where the 
interpretation is almost as straightforward as in special relativity. The most obvious 
example is the center point of a local inertial coordinate system, where the principle 
of equivalence allows one to treat all local quantities (quantities not involving 
spacetime curvature) exactly as in special relativity. Schwarzschild coordinates for 
a spherical system turn out to be a second example. 

One’s first reaction when meeting a new metric should be to examine it, not in 
order to learn about the gravitational field, for which the curvature tensor is more 


The form of any metric can 
reveal the nature of the 
coordinates being used 
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Geometric significance of the 
Schwarzschild coordinates: 

(1) 6, <f> are angles on sphere 


(2) r measures surface area 
of sphere 


(3) t has 3 special geometric 
properties 


(4) description of a 

"machine" to measure t 


directly informative, but to learn about the coordinates. (Are they, for instance, 
locally inertial at some point?) 

The names given to the coordinates have no intrinsic significance. A coordinate 
transformation t f = 0, r' = & O' = r, <>_ = t is perfectly permissible, and has no 
influence on the physics or the mathematics of a relativistic problem. The only thing 
it affects is easy communication between the investigator who adopts it and his 
colleagues. Thus the names tr6$ for the Schwarzschild coordinates (23.7) provide 
a mnemonic device pointing out the geometric content of the coordinates.* In 
particular, the names 6,6 are justified by the fact that on each two-dimensional 
surface of constant r and f, the distance between two nearby events is given by 
ds 2 = r 2 df2 2 , as befits standard 9, <J> coordinates on a sphere of radius r. The area 
of this two-dimensional sphere is clearly 

A = J* (rd6)(r sin 6 d§) — 4nr 2 ; (23.9) 

hence, the metric (23.7) tells how to measure the r coordinate that it employs. One 
can merely measure (in proper length units) the area A of the sphere, composed 
of all points rotationally equivalent to the point C JP for which the value r(f) is desired; 
and one can then calculate 

r{f) = ( P r °P er 0f ;P here /4.y' ! . (23.9') 

\through point £? / / 

The Schwarzschild coordinates have been picked for convenience, and not for the 
ease with which one could build a coordinate-measuring machine. This makes it 
more difficult to design a machine to measure t than machines to measure r, 6 , <f>. 

The geometric properties of t on which a measuring device can be based are: 
(1) the time-independent distances (dg a ^/dt — 0) between world lines of constant 
r, 6 , (2) the orthogonality ( g tr = g te = g t(f) ~ 0) of these world lines to the t = 
constant hypersurfaces; and (3) a labeling of these hypersurfaces by Minkowski 
(special relativistic) coordinate time at spatial infinity, where spacetime becomes 
flat. This labeling produces a constraint 

0(oo) = 0 (23.10) 

in the metric (23.7). [Mathematically, this constraint is imposed by a simple rescaling 
transformation t ' = e 0(oc V, and by then dropping the prime.] 

One “machine” design which constructs (mentally) such a t coordinate, and in 
the process measures it, is the following. Observers using radar sets arrange to move 
along the coordinate lines r, 6, <p = const. They do this by adjusting their velocities 
until each finds that the radar echos from his neighbors, or from “benchmark” 
reference points in the asymptotically flat space, require the same round-trip time 
at each repetition. Equivalently, each returning echo must show zero doppler shift; 

*For an example of misleading names, consider those in the equation 

ds 2 = — e 2 * {8 ) d4>' 2 + e 2Am dd' 2 + 6' 2 (dt' 2 + sin 2 t dr' 2 ), 
which is equivalent to equation (23.7), but employs the coordinates t' = 8, r' = <£, 6 ' = r, <p' = t. 
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it must return with the same frequency at which it was sent out. Next a master clock 
is set up near spatial infinity (far from the star). It is constructed to measure proper 
time—which, for it, is Minkowski time “at infinity”—and to emit a standard one- 
Hertz signal. Each observer adjusts the rate of his “coordinate clock” to beat in time 
with the signals he receives from the master clock. To set the zero of his “coordinate 
clock,” now that its rate is correct, he synchronizes with the master clock, taking 
account of the coordinate time At required for radar signals to travel from the master 
to him. [To compute the transit time, he assumes that for radar signals (^ refle ction — 
'emission) = ('return - 'reflection) = At > so that the echo is obtained by time-inversion 
about the reflection event. This time-reversal invariance distinguishes the time t in 
the metric (23.7) from the more general t coordinates allowed by equation (23.4).] 
Each observer moving along a coordinate fine (r, 0, <> = const.) now has a clock that 
measures coordinate time t in his neighborhood. 

The above discussion identifies the Schwarzschild coordinates of equation (23.7) 
by their intrinsic geometric properties. Not only are r and t radial and time variables, 
respectively (in that d/dr and d/dt are spacelike and timelike, respectively, and are 
orthogonal also to the spheres defined by rotational symmetry), but they have 
particular properties [Airr 2 = surface area; dg^/dt = 0; d/dr • d/dt = g n = 0; 
d/dt • d/dt = g tt = — 1 at r = oo] that distinguish them from other possible coordi¬ 
nate choices [r' = /(r), f = t + F(r)]. No claim is made that these are the only 
coordinates that might reasonably be called r and r; for an alternative choice 
(“isotropic coordinates”), see exercise 23.1. However, they provide a choice that is 
reasonable, unambiguous, useful, and often used. 


§23.4. DESCRIPTION OF THE MATTER INSIDE A STAR 

To high precision, the matter inside any star is a perfect fluid. (Shear stresses are 
negligible, and energy transport is negligible on a “hydrodynamic time scale.”) Thus, 
it is reasonable in model building to describe the matter by perfect-fluid parameters: 

p = p(r) = density of mass-energy in rest-frame of fluid; 
p = p(r) = isotropic pressure in rest-frame of fluid; 
n = n(r) = number density of baryons in rest-frame of fluid; (23.11) 
u M = u^(r) — 4-velocity of fluid; 

T^ v = (p + p)u^u v + pg^ v — stress-energy tensor of fluid. (23.12) 

(For Track-1 discussion, see Box 5.1; for greater Track-2 detail, see §§22.2 and 22.3.) 
In order that the star be static, each element of fluid must remain always at rest 
in the static coordinate system; i.e., each element must move along a world line 
of constant r, 0, cf>; i.e., each element must have 4-velocity components 


Other coordinates are 
possible, but Schwarzschild 
are particularly simple 


Material inside star to be 
idealized as perfect fluid 


Parameters describing perfect 
fluid: 

(1) P, P. n 


(2) i/ 


u r = dr/dr = 0, u B = dd/dj = 0, u* = d$/dr = 0. (23.13a) 
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(3) T 


Proper reference frame of 
fluid 


Components of u and T in 
proper reference frame 


Equation of state: 
(1) in general 


(2) idealized to 

"one-parameter form" 
p = p(n), p = p(n) m 


The normalization of 4-velocity, 

— 1 = u • u = g^uW = gtt^u* = w*, 

then determines id, 

id — dt/dr — u =. e~ (p d/dt\ (23.13b) 


and this, together with the general form (23.12) of the stress-energy tensor and the 
form (23.7) of the metric, determines T* v \ 


T oo = p e ~ 2 <p^ ttt _ pe -2 * T ee = pr -2 y 

T«P = 0 if a ^ /?. 


— pr 2 s j n 2 Q 

(23.14) 


Although these components of the stress-energy tensor in Schwarzschild coordi¬ 
nates are useful for calculations, the normalization factors e~ 2<t> , e ~ 2A , r~ 2 , r~ 2 sin' 2 8 
make them inconvenient for physical interpretations. More convenient are compo¬ 
nents on orthonormal tetrads carried by the fluid elements (“proper reference 
frames”; see §13.6): 


_ _d_ _ 

1 a 

e . __LA 



1 3 . 


dr 

e* a r 

r dr' 

6 r d0’ 

e 0 = 

r sin 8 d<p ’ 

1 Jaj 

w % — 

e* dt, 

w r = e A dr, 

lit 9 — r d6, 

w* 

= r sin 8 dip; 

(23.15b) 


U 

= ef y id = 1, u r — u * 

= u* = 

0; 

(23.15c) 

J tt 

II 

<0 

t? 

III 

p. n ? = t„ 

II 

II 

T afi = 

= 0 if a ^ /3. 

(23.15d) 


See exercise 23.2 below. 

The structure of a star—i.e., the set of functions <P(r ), A(r), p(r ), p(r), n(r) —is 
determined in part by the Einstein field equations, G* lP — 87 rT ttP 9 and in part by 
the law of local conservation of energy-momentum in the fluid, T^ v ;v = 0. However, 
these are not sufficient to fix the structure uniquely. Also necessary is the functional 
dependence of pressure p and density p on number density of baryons n : 

P = pin). P~p{n). (23.16) 

Normally one cannot deduce p and p from a knowledge solely of n. One must know, 
in addition, the temperature T or the entropy per baryon then the laws of 
thermodynamics plus equations of state will determine all remaining thermodynamic 
variables: 

p - p(n, s) 9 p = p(«,5), .... 

(See §22.2 and Box 22.1 for full Track-2 discussions.) To pass from the given thermo¬ 
dynamic knowledge, p(n, s) and p(n, 5), to the desired knowledge, p{n) and p(«), one 
needs information about the star’s thermal properties, and especially about the way 
in which energy*generation plus heat flow have conspired to distribute the entropy, 
s = s(n ): 


Pi n ) = p[n, *(«)], p(n ) = p[n, $(«)]. 
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There exist three important applications of the theory of relativistic stars: neutron 
stars, white dwarfs, and supermassive stars (stars with M > 10 3 M o , which may exist 
according to theory, but the existence of which has never yet been confirmed by 
observation). In all three cases, happily, the passage from p — pin,s), p(n,s), to 
p = pin), p = pin), is trivial. 

Consider first a neutron star. Though hot by ordinary standards, a neutron star 
is so cold by any nuclear-matter scale of temperatures that essentially all its thermal 
degrees of freedom are frozen out (“degenerate gas”; “quantum fluid”). It is not 
important that a detailed treatment of the substance of a neutron star is beyond 
the capability of present theory (allowance for the interaction between baryon and 
baryon; production at sufficiently high pressures of hyperons and mesons). The 
simple fact is that one is dealing with matter at densities comparable to the density 
of matter in an atomic nucleus (2 X 10 14 g/cm 3 ) and higher. Everything one knows 
about nuclear matter [see, for example, Bohr and Mottelson (1969)] tells one that 
it is degenerate, and that one can estimate in order of magnitude its degeneracy 
temperature by treating it as though it were an ideal Fermi neutron gas. (In a normal 
atomic nucleus, a little more than 50 per cent of all baryons are neutrons, the rest 
are protons; in a neutron star, as many as 99 per cent are neutrons.) When approxi¬ 
mating the neutron-star matter as an ideal Fermi neutron gas, one considers the 
neutrons to occupy free-particle quantum states, with two particles of opposite spin 
in each occupied state, and a sharp drop from 100 per cent occupancy of quantum 
states to empty states when the particle energy rises to the level of the “Fermi energy” 
[for more on such an ideal Fermi gas, see Kittel, Section 19 (1958); or at an intro¬ 
ductory level, see Sears, Section 16-5 (1953)]. In matter at nuclear density, the Fermi 
energy is of the order 

^Fermi ~~ 30 MeV or 3 x 10 11 K; 

and at higher density the temperature required to unfreeze the degeneracy is even 
greater. In other words, for matter at and above nuclear densities, already at zero 
temperature the kinetic energy of the particles (governed by the Pauli exclusion 
principle and by their Fermi energy) is a primary source of pressure. Nuclear forces 
make a large correction to this pressure, but for T < 30 MeV = 3 x 10 11 K, energies 
of thermal agitation do not. 

A star, in collapsing from a normal state to a neutron-star state (see Chapter 24), 
emits a huge flux of neutrinos at temperatures > 10 10 K, and thereby cools to 
T < 3 X 10 11 K within a few seconds after formation. Consequently, in all neutron 
stars older than a few seconds one can neglect thermal contributions to the pressure 
and density; i.e., one can set 

Pin, s) - p{n , 5 = 0) = p{n), p(n , s) = pin , s = 0) = pin). 

A white dwarf is similar, except that here electrons rather than neutrons are the 
source of Fermi gas pressure and degeneracy. Typical white-dwarf temperatures 
satisfy 


Justification for idealized 
equation of state: 

(1) in neutron stars 


(2) in white dwarfs 


kT ^Fermi electrons» 
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(3) in supermassive stars 


EXERCISE 


Five equations needed to 
determine 5 stellar-structure 
functions: &, A, p, p, n 


the Fermi kinetic energy (Pauli exclusion principle), and not random kT energy, 
is primarily responsible for the pressure and energy density; and one can set 

p(n, s) = p(n , 5 = 0)= p(n\ p{n , s) = p(«, 5 = 0) = p(w). 

In a supermassive star (see Chapter 24), the situation is quite different. There 
temperature and entropy are almost the whole story, so far as pressure and energy 
density are concerned. However, convection keeps the star stirred up and produces 
a uniform entropy distribution 

5 = const, independent of radius; 


so one can write 

/?(/?, 5 ) = p s (n ), p(«, s) = p s («). 



functions depending on 



uniform entropy per baryon, 



_5, in the star 



In all three cases—neutron stars, white dwarfs, supermassive stars—one regards 
the relations p(n) and p(n) as “equations of state”; and having specified them, one 
can calculate the star’s structure without further reference to its thermal properties. 


Exercise 23.2. PROPER REFERENCE FRAMES OF FLUID ELEMENTS 

(a) Verify that equations (23.15a,b) define an orthonormal tetrad and its dual basis of 1-forms, 
at each event in spacetime. 

(b) Verify that the components of the fluid 4-velocity relative to these tetrads are given 
by equations (23.15c). Why do these components guarantee that the tetrads form “proper 
reference frames” for the fluid elements? 

(c) Verify equations (23.15d) for the components of the stress-energy tensor. 


§23.5. EQUATIONS OF STRUCTURE 

The structure of a relativistic star is determined by five functions of radius r: the 
metric functions <£(/•), A(r), the pressure p{r\ the density of mass-energy p(r), and 
the number density of baryons, n(r). Hence, to determine the structure uniquely, 
one needs five equations of structure, plus boundary conditions. Two equations of 
structure, the equations of state p(n) and p(n), are already in hand. The remaining 
three must be the essential content of the Einstein field equations and of the law 
of local energy-momentum conservation, T ^ v .„ = 0. 

One knows that the law of local energy-momentum conservation for the fluid 
follows as an identity from the Einstein field equations. Without loss of information, 
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one can therefore impose all ten field equations and ignore local energy-momentum 
conservation. But that is an inefficient way to proceed. Almost always the equations 
T**., = 0 can be reduced to usable form more easily than can the field equations. 
Hence, the most efficient procedure is to: (1) evaluate the four equations T^ v — 0; 
(2) evaluate enough field equations (six) to obtain a complete set (6 + 4 = 10); 
and (3) evaluate the remaining four field equations as checks of the results of 
(1) and (2). 

The Track-2 reader has learned (§22.3) that the equations T **= 0 for a perfect 
jluid take on an especially simple form when projected (1) on the 4-velocity u of 
the fluid itself, and (2) orthogonal to a. Projection along u {u li T flv . y = 0) gives the 
local law of energy conservation (22.11a), 


dr 


-(p + pyy * u = 


P 4* P dn 
n !h' 


where u = d/dr ; i.e., t is proper time along the world line of any chosen element 
of the fluid. For a static star, or for any other static system, both sides of this equation 
must vanish identically (no fluid element ever sees any change in its own density). 
Projection of ;v = 0 orthogonal to u gives the reasonable equation 

7meTtial mass \ - (4 . accderation) = _ /pressure gradient, projected\ 

Vper unit volume/ \perpendicular to u ) 

i.e., 

(p + PUu U = -Up + UuP) U l 


[see equation (22.13)]. When applied to a static star, this equation tells how much 
pressure gradient is needed to prevent a fluid element from falling. Only the radial 
component of this equation has content, since the pressure depends only on r. The 
radial component in. the Schwarzschild coordinate system says [see the fine element 
(23.7) and the 4-velocity components (23.13)], 

(P + P) U r;X = “(P + p)r a TV U a U v - -(p + p)r° r0 U 0 U° 

= (p + />)#, r = -P,r (23.17) 

(Track-1 readers can derive this from scratch at the end of the section, exercise 23.3.) 
In the Newtonian limit, <P becomes the Newtonian potential (since g 00 = 
— e 2tp ~ — 1 — 2 <P\ and the pressure becomes much smaller than the mass-energy 
density; consequently equation (23.17) becomes 


P*r= P s' (23.17N) 

This is the Newtonian version of the equation describing the balance between 
gravitational force and pressure gradient. 

The pressure gradient that prevents a fluid element from falling appears in 
Einstein’s theory as the source of an acceleration. This acceleration, by keeping the 
fluid element at a fixed r value, causes it to depart from geodesic motion (from 
“fiducial world line”; from motion of free fall into the center of the star). Newtonian 


The most efficient procedure 
for solving Einstein equations 


Equation of hydrostatic 
equilibrium derived 


Comparison of Newton and 
Einstein views of hydrostatic 
equilibrium 
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Equation for A derived 


theory, on the other hand, views as the fiducial world line the one that stays at a 
fixed r value. It regards the “gravitational force” as trying (without success, because 
balanced by the pressure gradient) to pull a particle from a fixed-r world line onto 
a geodesic world line. In the two theories the magnitudes of the acceleration, whether 
“actually taking place” (Einstein theory) or “trying to take place” (Newtonian 
theory), are the same to lowest order (but opposite in direction); so it is no surprise 
that (23.17) and (23.17N) differ only in detail. 

Turn next to the Einstein field equation. Here, as is often the case, the components 
of the field equation in the fluid’s orthonormal frame [equations (23.15a,b)] are 
simpler than the components in the coordinate basis. One already knows the stress- 
energy tensor T-q in the orthonormal frame [equation (23.15d)]; and Track-2 readers 
have already calculated the Einstein tensor (exercise 14.13; Track-1 readers will 
face the task at the end of this section, exercise 23.4). All that remains is to equate 
to 8 t 7 7^. Examine first the 00 component of the field equations: 

G 55 = r “ 2 — r~ 2 e~ 2A — r~\d/dr)(e~ 2A ) 

= r\d/dr)[r( 1 - e~ 2A )} = 8 *r w = 8 ? 7 p. 


This equation becomes easy to solve as soon as one notices that it is a differential 
equation linear in the quantity e~ 2A ; a bit of tidying up then focuses attention on 
the quantity r(l — e ~ 2A ). Give this quantity the name 2 m(r) (so far only a name!); 
thus, 

2 m = r( 1 - e~ 2A ); e 2A = (1 - 2m/r)" 1 . (23.18) 

In this notation the 00 component of the Einstein tensor becomes 


0>6 — 


2 dm(r) 
r 2 dr 


— 8 TTp. 


Integrate and find 


m(r) - 1 4?rr 2 p dr 4 - m(0). 


(23.19) 


For the constant of integration m(0), a zero value means a space geometry smooth 
at the origin (physically acceptable); a non-zero value means a geometry with a 
singularity at the origin (physically unacceptable: no local Lorentz frame at r = 0): 


ds 2 = [1 — 2 m( 0 )/r ]~ 1 dr 2 4 r 2 {d6 2 4 sin 2 8 d<f> 2 ) 
~ — [r/ 2 m( 0 )] dr 2 4 - r 2 {dd 2 4 * sin 2 # d<p 2 ) 
ds 2 — [1 — (8?r/3 )p c r 2 ]~ 1 dr 2 4 * r 2 (d6 2 4- sin 2 # d<j> 
~ dr 2 4 - r 2 (d6 2 4 * sin 2 # dcj> 2 ) 


at r ^ 0 if m(0) ^ 0; 

2) (23.20) 

at r ^ 0 if m(0) = 0. 


"Mass-energy inside radius 
r," m(r), defined 


The quantity m(r), defined by equation (23.18) and calculated from equation 
(23.19) with m(0) = 0, is a relativistic analog of the “mass-energy inside radius r.” 
Box 23.1 spells out the analogy in detail. 
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Box 23.1 MASS-ENERGY INSIDE RADIUS r 


The total mass-energy M of an isolated star is well-defined (Chapter 19). But not 
well-defined, in general, is the distribution of that mass-energy from point to point 
inside the star and in its gravitational field (no unique “gravitational stress-energy 
tensor”). This was the crucial message of §20.4 (Track 2). 

The message is tare-in general. But for the case of a spherical star—and only 
for that case—the message loses its bite. Spherical symmetry allows one to select 
a distribution of the total mass-energy that is physically reasonable. In Schwarzschild 
coordinates, it is defined by 

“total mass-energy inside radius r” = m{r) = I 4?7 r 2 pdr. (1) 

•'o 

The fully convincing argument for this definition is found only by considering a 
generalization of it to time-dependent spherically symmetric stars (pulsating, col¬ 
lapsing, or exploding stars; see Chapters 26 and 32, and especially exercise 32.7). 
For them one finds that the mass-energy m associated with a given ball of matter 
(fixed baryon number) can change in time only to the extent that locally measurable 
energy fluxes can be detected at the boundary of the ball. [Such energy fluxes could 
be the power expended by pressure forces against the moving boundary surface, 
or heat fluxes, or radiation (photon or neutrino) fluxes. But since spherically symme¬ 
tric gravitational waves do not exist (Chapters 35 and 36), neither physical intuition 
nor Einstein’s equations require that problems of localizing gravitational-wave energy 
be faced.] Thus the energy m is localized, not by a mathematical convention, but 
by the circumstance that transfer of energy (with this definition of m) is detectable 
by local measurements. [For the mathematical details of m(r, t) in the time-depend¬ 
ent case, see Misner and Sharp (1964), Misner (1965), and exercise 32.7.] 

In addition to the critical “local energy flux” property of m{r) described above, 
there are three further properties that verify its identification as mass-energy. They 
are: (1) Everywhere outside the star 


m(r) = M= 


( total mass-energy of star as measured from 
Kepler’s third law for distant planets 


( 2 ) 


see §23.6 for proof. (2) For a Newtonian star, where “mass inside radius r” has a 
unique meaning, m(r) is that mass. (3) For a relativistic star, m(r) splits nicely into 
“rest mass-energy” m 0 (r) plus “internal energy” U(r) plus “gravitational potential 
energy” &(r). 

To recognize and appreciate the split 

m(r) = m a (r) + U(r ) + Q(r), (3) 


proceed as follows. First split the total density of mass-energy, p, into a part p 0 w 
due to rest mass—where /u 0 is the average rest mass of the baryonic species pres- 


604 


23. SPHERICAL STARS 


r 


Box 23.1 (continued) 

ent—and a part p — p 0 /t due to internal thermal energy, compressional energy, etc. 
Next notice that the proper volume of a shell of thickness dr is 


d L T — 4'7rr 2 (e A dr) = 4^r 2 (l — 2m/r) 1/2 dr , 
not 4 tt/* 2 dr. Consequently, the total rest mass inside radius r is 

m 0 = f /jL 0 nd^ r = f 4flr 2 (l —2 m/r)~ 1/2 ii 0 n dr, 

J o •'o 

and the total internal energy is 

£/ = J* (p — p 0 n) = J* 4-77r 2 (l — 2m/r)“ 1/2 (p — p 0 «) dr. 


(4) 


(5) 


( 6 ) 


Subtract these from the total mass-energy, m; the quantity that is left must be 
the gravitational potential energy, 


a 


— — f p[(l — 2m/r) 1/2 — l]4flr 2 dr 
*<) 

— I (f>m/r)4'nr 2 dr. 

t ° 

•—[Newtonian limit, m/r < 1] 


(7) 


(See exercise 23.7.) 


Equation for 0 derived Turn next to the rr component of the field equations: 

G - ~ —r ~ 2 4 * r~ 2 e~ 2A 4 * 2 r~ l e~ 2A d<P/dr 

— 8-77 = 8-77/7. 

Solving this equation for the derivative of and replacing e~ 2A by 1 — 2 m/r, one 
obtains an expression for the gradient of the potential <P\ 

d<P m 4 4-77 r 3 p 
dr r(r — 2m) 

This expression reduces to the familiar formula 

d<P/dr = m/r 2 


(23.21) 

(23.2 IN) 


in the Newtonian limit. 
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In most studies of stellar structure, one replaces equation (23.17) by the equivalent 
equation obtained with the help of (23.21), 


dp _ (p 4* p)(m 4* 4^rr 3 p) 
dr r(r — 2 m) 


(23.22) 


This is called the Oppenheimer-Volkoff (OV) equation of hydrostatic equilibrium. 
Its Newtonian limit, 


dp/dr ~ —f>m/r 2 


(23.22N) 


is familiar. 

Compare two stellar models, one relativistic and the other Newtonian. Suppose 
that at a given radius r [determined in both cases by (proper area) = 47rr 2 ], the two 
configurations have the same values of p, p , and m. Then in the relativistic model 
the pressure gradient is 


dp dp 

d (proper radial distance) e A dr 

(p 4- p)(m 4* 47 Tr 3 p) 
r 2 (l — 2 m/r) 1/2 

In contrast, Newtonian theory gives for the pressure gradient 

dp dp pm 

^(proper radial distance) dr r 2 


(23.23) 


(23.23N) 


The relativistic expression for the gradient is larger than the Newtonian expression 

(1) because the numerator is larger (added pressure term in both factors) and 

(2) because the denominator is smaller [shrinkage factor (1 — 2m/r) 1/2 ]. Therefore, 
as one proceeds deeper into the star, one finds pressure rising faster than Newtonian 
gravitation theory would predict. Moreover, this rise in pressure is in a certain sense 
“self-regenerative.” The more the pressure goes up, the larger the pressure-correction 
terms become in the numerator of (23.23); and the larger these terms become, the 
faster is the further rise of the pressure as one probes still deeper into the star. The 
geometric factor [1 — 2m(r)/r] l/2 in the denominator of (23.23) further augments 
this regenerative rise of pressure towards the center. It is appropriate to summarize 
the situation in short-hand terms by saying that general relativity predicts stronger 
gravitational forces in a stationary body than does Newtonian theory. These forces, 
among their other important effects, can pull certain white-dwarf stars and super- 
massive stars into gravitational collapse under circumstances (see Chapter 24) where 
Newtonian theory would have predicted stable hydrostatic equilibrium. As the most 
elementary indication that a new factor has surfaced in the analysis of stability, note 
that no star in hydrostatic equilibrium can ever have 2 m(r)/r > 1 (see Box 23.2 
for one illustration and §23.8 for discussion), a phenomenon alien to Newtonian 
theory. 

Now in hand are five equations of structure [two equations of state (23.16); 
equation (23.19), expressing m(r) = \r( 1 — e~ 2/1 ) as a volume integral of p; the source 


Equation of hydrostatic 
equilibrium rewritten in 
"OV" form 


Comparison of pressure 
gradients in Newtonian and 
relativistic stars 


Equations of stellar structure 
summarized 
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EXERCISES 


equation (23.21) for 4>\ and the OV equation of hydrostatic equilibrium (23.22)] for 
the five structure functions p,p, n , 0, A . If the theory of relativistic stars as outlined 
above is well posed, then each of the remaining eight Einstein field equations 
Gafi — must be either vacuous (“0 = 0”), or must be a consequence of the 

five equations of structure. This is, indeed, the case, as one can verify by straightfor¬ 
ward but tedious computations. 

To construct a stellar model, one needs boundary conditions as well as structure 
equations. To facilitate the presentation of boundary conditions, the next section 
will examine the star’s external gravitational field. 


Exercise 23.3. LAW OF LOCAL ENERGY-MOMENTUM CONSERVATION 
(for readers who have not studied Chapter 22) 

Evaluate the four components of the equation T a(i = 0 for the stress-energy tensor (23.14) 
in the Schwarzschild coordinate system of equation (23.7). [Answer: only P* ;j8 = 0 gives 
a nonvacuous result; it gives equation (23.17).] 


Exercise 23.4. EINSTEIN CURVATURE TENSOR 

(for readers who have not studied Chapter 14) 

Calculate the components of the Einstein curvature tensor, G a/J , in Schwarzschild coordinates. 
Then perform a transformation to obtain G the components in the orthonormal frame 
of equations (23.15a,b). [See Box 8.6, or Box 14.2 and equation (14.7).] 


Exercise 23.5. TOTAL NUMBER OF BARYONS IN A STAR 

Show that, if r = R is the location of the surface of a static star, then the total number of 
baryons inside the star is 

A = f 4t rr 2 ne A dr. (23.24) 

[Hint: See the discussion of m 0 in Box 23.1.] 


Exercise 23.6. BUOYANT FORCE IN A STAR 

An observer at rest at some point inside a relativistic star measures the radial pressure-buoyant 
force, F buoy , on a small fluid element of volume V. Let him use the usual laboratory tech¬ 
niques. Do not confuse him by telling him he is in a relativistic star. What value will he 
find for F huoy> in terms of p,p, m, V, and dp/drl If he equates this buoyant force to an equal 
and opposite gravitational force, F^. &w , what will F^ y be in terms of p,/?, m, F, and r? (Use 
equation 23.22.) How do these results differ from the corresponding Newtonian results? 


Exercise 23.7. GRAVITATIONAL ENERGY OF A NEWTONIAN STAR 

Calculate in Newtonian theory the energy one would gain from gravity if one were to 
construct a star by adding one spherical shell of matter on top of another, working from 
the inside outward. Use Laplace’s equation (r 2 <P r ) r = 4nr 2 p and the equation of hydrostatic 
equilibrium p r — — p<P r to put the answer in the following equivalent forms: 
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(energy gained from gravity) = — (gravitational potential energy) 


jR R 

= f (pr0 T )4iTr 2 dr = f (pm/r)4i7r 2 dr 
' J o 

= — — f (p0)4?7r 2 dr = — f (0 T ) 2 4i7r 2 dr 
2 J Q 5^7 J 0 

r R 

= 3 J 477r 2 /? dr . 


§23.6. EXTERNAL GRAVITATIONAL FIELD 

Outside a star the density and pressure vanish, so only the metric parameters # 
and yl = —^ In (1 — 2m/r) need be considered. From equation (23.19) one sees that 
“the mass inside radius r,” m(r), stays constant for values of r greater than R (outside 
the star). Its constant value is denoted by M: 

m(r) = M for r > R (i.e., outside the star). (23.25) 

By integrating equation (23.21) with p = 0 and m — AT, and by imposing the 
boundary condition (23.10) on 0 at r = oo (“normalization of scale of time at 
r — oo”), one finds 


0(r) = yin (1 - 2 M/r) for r > R. (23.26) 

Consequently, outside the star the spacetime geometry (23.7) becomes 

ds^-i 1 - dt* + - + r 2 {d6 2 + sin^ d<j> 2 ). (23.27) 

\ r 1 (1 - 2 M/r) 

This is called the “Schwarzschild geometry” or “Schwarzschild gratitational field” 
or “Schwarzschild line element,” because Karl Schwarzschild (1916a) discovered it 
as an exact solution to Einstein’s field equations a few months after Einstein formu¬ 
lated general relativity theory. 

In that region of spacetime, r>2M, where the geometry is nearly flat, Newton’s 
theory of gravity is valid, and the Newtonian potential is 

0 = — M/r for r > R, r > 2M . (23.26N) 

Consequently, M is the mass that governs the Keplerian motions of planets in the 
distant, Newtonian gravitational field—i.e., it is the star’s “total mass-energy” (see 
Chapters 19 and 20). Since the metric (23.27) far outside the star is precisely diagonal 
(g t j == 0), the star’s total angular momentum must vanish. This result accords with 
the absence of internal fluid motions. 


Spacetime outside star 
possesses "Schwarzschild" 
geometry 


Total mass-energy of star 




608 


23. SPHERICAL STARS 


Equations of stellar structure 
collected together 


How to solve the equations 
of stellar structure 


§23.7. HOW TO CONSTRUCT A STELLAR MODEL 

The equations of stellar structure (23.16), (23.19), (23.21), (23.22), and associated 
boundary conditions (to be discussed below), all gathered together along with the 
line element, read as follows. 


Line Element 


ds 2 = dt 2 + 


dr 2 

1 — 2 m/r 


+ r 2 (d6 2 + sin 2 # d<j> 2 ) 



dr 2 

1 - 2 M/r 


+ r 2 (d6 2 + sin 2 # d<p 2 ) 


(23.27') 


for r > R. 


Mass Equation 


m — 


I 4?rr 2 p dr, with m(r = 0) = 0. 
‘'o 


(23.28a) 


OV Equation of Hydrostatic Equilibrium 

+ ^ , with p(r — 0) = p = central pressure. (23.28b) 

dr r(r — 2m) 


Equations of State 


p — p(n ), -- (23.28c) 

p = p(rt). (23.28d) 


Source Equation for <P 

= C + with#(f=Ji)= > ln( ,_2«/J!). (23.28e) 

dr r(r — 2m) 2 

To construct a stellar model one can proceed as follows. First specify the equations 
of state (23.28c,d) and a value of the central pressure, p c . Also specify an arbitrary 
(later to be renormalized) value, # 0 , for <P{r — 0). The boundary conditions 
p(r = 0) = p c , <P{r — 0) = <P 0 , m(r — 0) = 0 are sufficient to determine uniquely the 
solution to the coupled equations (23.28). Integrate these coupled equations outward 
from r — 0 until the pressure vanishes. [The OV equation, (23.28b), guarantees that 
the pressure will decrease monotonically so long as the equations of state obey the 
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reasonable restriction p > 0 for all p > 0.] The point at which the pressure reaches 
zero is the star’s surface; the value of r there is the star’s radius, R\ and the value 
of m there is the star’s total mass-energy, Af. Having reached the surface, renormalize 
4> by adding a constant to it everywhere, so that it obeys the boundary condition 
(23.28e). The result is a relativistic stellar model whose structure functions <P , m, 
p, p , n satisfy the equations of structure. 

Notice that for any fixed choice of the equations of state p = p(ri), p = p(n), the 
stellar models form a one-parameter sequence (parameter p c ). Once the central 
pressure has been specified, the model is determined uniquely. 

The next chapter describes a variety of realistic stellar models constructed numer¬ 
ically by the above prescription. For an idealized stellar model constructed analyti¬ 
cally, see Box 23.2. 


Exercise 23.8. NEWTONIAN STARS OF UNIFORM DENSITY EXERCISE 

Calculate the structures of uniform-density configurations in Newtonian theory. Show that 
the relativistic configurations of Box 23.2 become identical to the Newtonian configurations 
in the weak-gravity limit. Also show that there are no mass or radius limits in Newtonian 
theory. 

(continued on page 612) 


--N 

Box 23.2 RELATIVISTIC MODEL STAR OF UNIFORM DENSITY 

For realistic equations of state (see next chapter), the equations of stellar structure 
(23.28) cannot be integrated analytically; numerical integration is necessary. How¬ 
ever, analytic solutions exist for various idealized and ad hoc equations of state. One 
of the most useful analytic solutions [Karl Schwarzschild (1916b)] describes a star 
of uniform density, 

p = p 0 = constant for all p. (1) 

It is not necessary to indulge in the fiction of “an incompressible fluid” to accept 
this model as interesting. Incompressibility would imply a speed of sound, 
v = ( dp/dp ) u2 , of unlimited magnitude, therefore in excess of the speed of light, 
and therefore in contradiction with a central principle of special relativity (“principle 
of causality”) that no physical effect can be propagated at a speed v > 1. (If a source 
could cause an effect so quickly in one local Lorentz frame, then there would exist 
another local Lorentz frame in which the effect would occur before the source had 
acted!) However, that the part of the fluid in the region of high pressure has the 
same density as the part of the fluid in the region of low pressure is an idea easy 
to admit, if only one thinks of the fluid having a composition that varies from one 

v_ 





Box 23.2 (continued) 


r value to another ("hand-tailored”). Whether one thinks along this line, or simply 
has in mind a globe of water limited in size to a small fraction of the dimensions 
of the earth, one has in Schwarzschild’s model an instructive example of hydrostatics 
done in the framework of Einstein’s theory. 

The mass equation (23.28a) gives immediately 


(4sr/3 )p 0 r 3 for r < R 

M = (4ir/3)p 0 R 3 for r>R.' 


( 2 ) 


from which follows the length-correction factor in the metric 

^(proper disn.ce) = ^ „ _ 2m(r)/ri -m ( 3 ) 

dr 


When for ease of visualization the space geometry (r, <j>) of an equatorial slice through 
the star is viewed as embedded in a Euclidean 3-geometry (z,r,<j>) [see §23.8], the 
“lift” out of the plane z = 0 is 


z(r) = 


(R 3 /2M) 1/2 [l - (1 - 2Mr 2 /f? 3 ) 1/2 ] for r < R, 

(R 3 /2M) 1/2 [\ - (1 - 2 M/R) 1/2 ] + [ 8 M(r - 2M)] 1/2 - [ 8 M(R - 2M)] 1/2 

for r > R. (4) 


The knowledge of m(r) from (2) allows the equation of hydrostatic equilibrium 
(23.28b) to be integrated to give the pressure: 


f (1 - 2 Mr 2 /R 3 f' 2 - (1 - 2M/R) V2 1 
P ~ p «( 3 (i _ 2 M/R) 1/2 - (1 - 2 Mr 2 /R 3 ) 1/2 l t01 " r 


(5) 


The pressure in turn leads via (23.28e) to the time-correction factor in the metric. 
d(proper time) 


dt 



l( 

, 2M\ 1/Z 1/ 

2 Mr 2 \ 1/2 

for r < R 

e* = 

2 \ 

R ) 2 \ 

R 3 ) 



. (1 

- 2 M/r) 1/2 


for r > R. 


( 6 ) 


Several features of these uniform-density configurations are noteworthy. (1) For 
fixed energy density, p 0 , the central pressure 


f 1 - (1 - 2 M/R) 1 ' 2 1 
13(1 - 2 ~M/W F2 - 1 )’ 


(7) 


increases monotonically as the radius, R, increases—and, hence, also as the mass, 
M = (4w/3)p 0 f? 3 , and the ratio (“strength of gravity”) 

2 M/R = (8w/3 )p 0 R 2 


( 8 ) 



increase. This is natural, since, as more and more matter is added to the star, a 
greater and greater pressure is required to support it. (2) The central pressure 
becomes infinite when M, R , and 2 M/R reach the limiting values 


-ft ii m = (9/4)M Um = (3wp 0 )~ 1/2 , (9) 

(2 M/R\ m = 8/9. (10) 

No star of uniform density can have a mass and radius exceeding these limits. These 

limits are purely relativistic phenomena; no such limits occur in Newtonian theory. 
(3) Inside the star the space geometry (geometry of a hypersurface t = constant) 
is that of a three-dimensional spherical surface with radius of curvature 

a = (3/877 Po )i/2. (11) 

[See equation (4), above.] Outside the star the (Schwarzschild) space geometry 
is that of a three-dimensional paraboloid of revolution. The interior and exterior 
geometries join together smoothly. All these details are shown in the following three 
diagrams. There all quantities are given in the following geometric units (to convert 
mass in g or density in g/cm 3 into mass in cm or density in cm -2 , multiply by 
0.742 X 10 _28 cm/g): lengths, in units (3/8?7p 0 ) 1/2 ; pressure, in units p 0 ; mass, in 
units (3/32 t 7 Po ) 1/2 . 
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r 

Box 23.2 (continued) 

The mass “after assembly” is what is called M. The mass of the same fluid, 
dispersed in droplets at infinite separation, is called M before in the following table. 



^before 

small 

0.0882 

0.894 

1.0913 

1.374 

M 

small 

0.0828 

0.636 

0.729 

0.838 (critical) 

Difference 

(binding): 

tb m- v3 

0.0054 

0.258 

0.362 

0.536 


v_ 


§23.8. THE SPACETIME GEOMETRY FOR A STATIC STAR 


Surface area of spheres, A-^rr 2 

(1) increases monotonically 
from center of star 
outward 


dr — zt(l — 2 m/r) 1/2 di. (23.29) 

Note that r is zero at the center of the star (where m cc r 3 ), and note that r is always 
nonnegative by definition. Therefore r must at first increase with X as one moves 
outward from 1=0; r(X) can later reach a maximum and start decreasing only at 
a point where 1m/r becomes unity [see equation (23.29)]. Such a behavior can and 
does happen in a closed model universe, a 3-sphere of uniform density and radius 
a, where 


For a highly relativistic star, the spacetime geometry departs strongly from Euclid- 
Lorentz flatness. Consequently, there is no a priori reason to expect that the surface 
area 477r 2 , and hence also the radial coordinate r, will increase monotonically as 
one moves from the center of the star outward. Fortunately, the equations of stellar 
structure guarantee that r will increase monotonically from 0 at the star's center to 
00 at an infinite distance away from the star, so long as p > 0 and so long as the 
star is static (equilibrium). 

The monotonicity of r can be seen as follows. Introduce as a new radial coordinate 
proper distance, 1, from the center of the star. By virtue of expression (23.27 ) for 
the metric, 1 and r are related by 


r(i) = a sin (i/a) 

[see Chapter 27; especially the embedding diagram of Box 27.2(A)], However, the 
field equations demand that such a system be dynamic. Here, on the contrary, 
attention is limited to a system where conditions are static. In such a system, the 
condition of hydrostatic equilibrium (23.28b) applies. Then the pressure gradient 
is given by an expression with the factor [1 — 2 m(r)/r] in its denominator. If 2 m/r 
approaches unity with increasing X in some region of the star, the pressure gradient 
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there becomes so large that one comes to the point p = 0 (surface of the star) before 
one comes to any point where 2 m(r)/r might attain unit value. Moreover, after the 
surface of the star is passed, m remains constant, m(r) = M , and 2 m{r)/r decreases. 
Consequently, 2m/r is always less than unity; and r(X) cannot have a maximum, 
Q.e.d. (Details of the proof are left to the reader as exercise 23.9.) 

Although the radii of curvature, r, and corresponding spherical surface areas, 4 at 2 , 
increase monotonically from the center of a star outward, they do not increase at 
the same rate as they would in flat spacetime. In flat spacetime the rate of increase 
is given by dr/d (proper radial distance) = dr/di = 1. In a star it is given by 
dr/di = (1 — 2 m/r) 1/2 < 1. Consequently, if one were to climb a long ladder 
outward from the center of a relativistic star, measuring for each successive spher¬ 
ical shell its Schwarzschild r-value (“proper circumference”/^), one would find 
these r-values to increase surprisingly slowly. 

This strange behavior is most easily visualized by means of an “embedding 
diagram.” It would be too much for any easy visualization if one were to attempt 
to embed the whole curved four-dimensional manifold in some higher-dimensional 
flat space. [See, however, Fronsdal (1959) and Clarke (1970) for a global embedding 
in 5 + 1 dimensions, and Kasner (1921b) for a local embedding in 4 + 2 dimensions. 
One can never embed a non-flat, vacuum metric (G = 0) in a flat space of 5 
dimensions (Kasner, 1921c).] Therefore seek a simpler picture (Flamm 1916). Space 
at one time in the context of a static system has the same 3-geometry as space at 
another time. Therefore, depict 3-space only as it is at one time, t = constant . 
Moreover, at any one time the space itself has spherical symmetry. Consequently, 
one slice through the center, r — 0, that divides the space symmetrically into two 
halves (for example, the equatorial slice, 9 = 77/2) has the same 2-geometry as any 
other such slice (any selected angle of tilt, at any azimuth) through the center. 
Therefore limit attention to the 2-geometry of the equatorial slice. The geometry 
on this slice is described by the line element 


(2) but increases more slowly 
than in flat spacetime 


Embedding of spacetime in a 
flat space of higher 
dimensionality 


ds 2 = [1 - 2m(r)/r]~ 1 dr 2 + r 2 d<t > 2 . (23.30) 


Now one may embed this two-dimensional curved-space geometry in the flat geom¬ 
etry of a Euclidean three-dimensional manifold. 

If the curvature of the two-dimensional slice is zero or negligible, the embedding Construction of "embedding 
is trivial. In this event, identify the 2-geometry with the slice z = 0 of the Euclidean diagram" for equatorial slice 
3-space. Moreover, introduce into that 3-space the familiar cylindrical coordinates throu 9 h star 
z, r, <j>, that one employs for any problem with axial symmetry (see Fig. 23.1 and 
Box 23.2 for more detail). Then one recognizes the flat two-dimensional slice as the 

with z =- 0, with <f> running from 0 to 277, and 
r from 0 to oo. One has identified the r and <£ of the slice with the r and of the 
Euclidean 3-space. 

If the 2-geometry is curved, as it is when the equatorial section is taken through 
a real star, then maintain the identification between the r, <f>, of the slice and the 
r, <f>, of the Euclidean 3-geometry, but bend up the slice out of the plane z = 0 (except 
at the origin, r — 0). At the same time, insist that the bending be axially symmetric. 

In other words, require that the amount of the “lift” above the plane z = 0 shall 
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Geometry within (grey) and around (white) a star of radius R = 2.66A/, schematically displayed. The 
star is in hydrostatic equilibrium and has zero angular momentum (spherical symmetry). The two- 
dimensional geometry 


ds 2 = [1 — 2m(r)/r]~ l dr 2 -f r 2 d<$> 2 

of an equatorial slice through the star (8 — n/2, t = constant) is represented as embedded in Euclidean 
3-space, in such a way that distances between any two nearby points (r, <$>) and (r 4- dr, <> 4- d<*>) are 
correctly reproduced. Distances measured off the curved surface have no physical meaning; points off 
that surface have no physical meaning; and the Euclidean 3-space itself has no physical meaning. Only 
the curved 2-geometry has meaning. A circle of Schwarzschild coordinate radius r has proper circum¬ 
ference 2<nr (attention limited to equatorial plane of star, 8 = tt/ 2). Replace this circle by a sphere of 
proper area 4 tjt 2 , similarly for all the other circles, in order to visualize the entire 3-geometry in and 
around the star at any chosen moment of Schwarzschild coordinate time t . The factor [1 — 2m(r)/r] -1 
develops no singularity as r decreases within r = 2M , because m{r) decreases sufficiently fast with decreas¬ 
ing r. 


be independent of $, whatever may be its dependence on r. Thus the whole story - 
of the embedding is summarized by the single function, the lift, 

z — z(r) (“embedding formula”). 

The geometry on this curved two-dimensional locus in Euclidean space (a made-up 
3-space; it has nothing whatever to do with the real world) is to be identical with 
the geometry of the two-dimensional equatorial slice through the actual star; in other 
words, the line elements in the two cases are to be identical. To work out this 
requirement in mathematical terms, write the line element in three-dimensional 
Euclidean space in the form 

ds 2 = dz 2 + dr 2 + r 2 d<t> 2 . (23.31) 

Restrict to the chosen locus (“lifted surface”) by writing z = z(r) or dz = (dz/dr) dr. 
Thus have 




(23.32) 
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on the two-dimensional locus in the 3-geometry, to be identified with 
ds 2 = [1 - 2m(r)/r] _1 dr 2 + r 2 d<(> 2 
in the actual star. Compare and conclude 

(^5r) 2 + 1 = I 1 “ Zwto/iT 1 . (23.33) 

This equation is information enough to find the lift as a function of r; thus, 

r T dr 

z(r) = J ---— everywhere, (23.34a) 

UMr) ~ M 

z(r) = [8 M(r — 2 M)] l/2 + constant outside the star. (23.34b) 

Outside the star this embedded surface is a segment of a paraboloid of revolution. 
Its form inside the star depends on how the mass, m , varies as a function of r. Recall 
that m(r) varies as (477/3)p c r 3 near the center of the star. Conclude that the embedded 
surface there looks like a segment of a sphere of radius a — (3/877p c ) 1/2 ; thus, 

[a - z(r)] 2 + r 2 = a 2 for r < a = (3/^p c ) 1/2 . (23.34c) 

In the special case of a star with uniform density (Box 23.2), the entire interior is 
of the spherical form (23.34c); in the general case it is not. In all cases, because 
r > 2 m(r\ equation (23.34a) produces a surface with z and r as monotonically 
increasing functions of each other. This means that the embedded surface always 
opens upward and outward like a bowl; it always looks qualitatively like Figure 
23.1; it never has a neck, and it never flattens out except asymptotically at r — oo. 
At the star’s surface, even though the density may drop discontinuously to zero (p 
finite inside when p = 0; p zero outside), the interior and exterior geometries will 
join together smoothly [ dz/dr , as given by equation (23.33), is continuous]. 

It must be emphasized that only points lying on the embedded 2-surface have 
physical significance so far as the stellar geometry is concerned: the three-dimen¬ 
sional regions inside and outside the bowl of Figure 23.1 are physically meaningless. 
So is the Euclidean embedding space. It merely permits one to visualize the geometry 
of space around the star in a convenient manner. 


Exercise 23.9. GOOD BEHAVIOR OF r 

Carry out explicitly the full details of the proof, at the beginning of this section, that 2 m/r 
is always less than unity and r is a monotonic function of l. 

Exercise 23.10. CENTER OF STAR OCCUPIED BY IDEAL FERMI GAS AT 
EXTREME RELATIVISTIC LIMIT 

Opposite to the idealization of a star built from an incompressible fluid is the idealization 
in which it is built from an ideal Fermi gas [ideal neutron star; see Oppenheimer and Volkoff 
(1939)] at zero temperature, so highly compressed that the particles have relativistic energies, 


Description of embedded 
surface 


EXERCISES 
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in comparison with which any rest mass they possess is negligible. In this limit, with two 
particles per occupied cell of volume h 3 in phase space, one has 


and finally 


/number density} = „ = (2/h ^ f” F p'- dp = 8 *p//3h\ 
\of fermions / J 0 

/density of \ = = (2 /h*)4* P F cp • p* dp = 2 vcPt * 

\ mass-energy / r r 


P = 


A ener gy \ 

\per particle/ 
^/volume \ 
\per particle/ 


d(p/n) 

d(l/n) 


/h 3 


= 2ircp T 4 /3h 3 = p/3, 


as if one were dealing with radiation instead of particles (p F = Fermi momentum; momen¬ 
tum of highest occupied state). 


C Box 23.3 RIGOROUS DERIVATION OF THE SPHERICALLY SYMMETRIC LINE ELEMENT 


Section 23.2 gave a heuristic derivation of the 
general spherically symmetric line element (23.7). 
This box attempts a more rigorous derivation, 
applicable to nonstatic systems, as well as static 
ones. 

Begin with a manifold M 4 on which a metric 
ds 2 of Lorentz signature is defined. Assume M 4 
to be spherically symmetric in the sense that to 
any 3x3 rotation matrix A there corresponds a 
mapping (rotation) of M 4 , also called A (A:M 4 
— M 4 :& — t-AP), that preserves the lengths of 
all curves. Further assumptions and constructions 
will be numbered (i), (ii), etc., so one can see what 
specializations are needed to get to the line ele¬ 
ment (23.7). Daggers (f) indicate assumptions that 
are found inapplicable to some other physically 
interesting situations. 

For any point form the set s ~ S(f?) = 
{A& £ M 4 \A £ S0(3)} of all points equivalent 
to 9 under rotations. Assume (i)f that s is 
a two-dimensional surface (except for center 
points, where s is zero-dimensional), and (ii) that 
the metric on i 1 is that of a standard 2-sphere. Then 
on s one will have 

(ds\ = R\s)dS2\ ( 1 ) 

where dQ 2 is the standard metric of a unit sphere 
( dQ 2 = d8 2 -{- sin 2 0 d<t> 2 for some 8 , <f>, defined on 
s), and where IttR is the circumference of 5. If M 2 
is the set of all such surfaces then S:M 4 —>- 


M 2 :$> — y- s = S(<?) allows one to obtain, from 
R : M 2 —*- <&:s — R(s) [the “circumference” 
function on M 2 as defined by equation (1)], a 
corresponding function R:M 4 — $l'.9 —*■ 
R(S(&)) on M 4 which in some cases can eventually 
be used as a coordinate on M 4 . (Note: $ denotes 
here the real numbers.) 

Now assume (iii)f there is a spherically symme¬ 
tric 4-velocity field u , defined so that if 9 = G£x ) 
is one trajectory of u with u = d/dr , then each 
curve ~ AG(r) obtained by a rotation must also 
be a trajectory of u. The orthogonal projection of 
u onto any sphere s must then vanish, as there 
are no rotation invariant non-zero vector fields on 
2-spheres. Thus u is orthogonal to each s. Also, 
if two trajectories of u start on some same sphere 
5, so 6^(0) = AG 2 ( 0), then the same rotation^ will 
always relate them, G x {r) = AG 2 ( t), since trajec¬ 
tories are uniquely defined by any one point on 
them. Then SCG^r)) and S(G 2 (r)) are both the 
same curve in M 2 , whose tangent d/dr one can 
call also u ; in this way one obtains a vector field 
u on M 2 . Give each trajectory of u on M 2 a differ¬ 
ent label r to define a function r(^) on A/ 2 . Denote 
by r — r(S(^)) a corresponding function r on M 4 
with dr/dr — 0. Since functions and their gradi¬ 
ents on M 4 define corresponding quantities on M 2 , 
inner products such as df * dg can be defined on 
M 2 by their values on M 4 \ thus, from the metric 
on M 4 one obtains a metric on M 2 . Then by equa- 
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(a) Write out the relativistic equation of hydrostatic equilibrium for a substance satisfying 
the equation of state p — p/3. 

(b) Show that there exists a well-defined analytic solution for the limiting case of infinite 
central density, in which m{r)/r has the value 3/14. 

(c) Find p(r), p(r ), and n(r). 

(d) Show that the number of particles out to any finite r-value is finite, despite the fact 
that n(r) is infinite at the origin. 

(e) Show that the 3-geometry has a “conical singularity” at r = 0. 

(f) Make an “embedding diagram” for this 3-geometry [“lift” z(r) as a function of r from 
(23.34)]. (Note that the conical singularity at r — 0, otherwise physically unreasonable, arises 
because the density of mass-energy goes to infinity at that point. Note also that the calculated 
mass of the system diverges to infinity as r —oo. In actuality with decreasing density the 
Fermi momentum falls from relativistic to nonrelativistic values, the equation of state changes 
its mathematical form, and the total mass M converges to a finite value). 


tion (23.5) or equivalently by drawing curves in 
M 2 orthogonal to the r = const, lines, and giving 
each a different label t , one obtains coordinates 
with g rt — dr' dt ^ 0 . Both r and t labels were 
assigned arbitrarily on the corresponding curves, 
so it is clear that transformations t' = t'(t) and 
r' — r'(r) are not excluded. 

On one 2-sphere s in A/ 4 , on the t — 0 hypersur¬ 
face, choose a set of 8 , <£ coordinates by picking 
the pole {8 = 0 ) and the prime meridian (<f> = 0 ) 
arbitrarily. Then extend the definition of 8,<f>, over 
the t = 0 hypersurface by requiring 8 and <f> to be 
constant on curves orthogonal to each 2 -sphere s, 
i.e., by demanding that (d/dr) d<p be orthogonal to 
each s at t = 0. Extend the definition of 8 and $ 
to t 0 by requiring them to be constant on 
curves with tangent u , so (3/3 t) Td<t) u. But each 
s is a surface of constant r and t ; so (3/30) rt0 and 
(d/cty)r *0 are tangent to s, while u oc (d/dt) is 
orthogonal to each s. Consequently, 

gte = (d/dt)-(d/d8) = 0 (2) 

and 

*f, = (3/3O-0/a*) = O (3) 

in the tr8<(> coordinate system just constructed. The 
vector (9 /dr) te<f> does not depend on the arbitrary 
directions introduced in the original choice of 8 , 4> 
coordinates on one sphere s; it is invariant under 
transformations 8 — 8(8', <£ = <#>')• 

nothing except 8 and <f> introduced nonrotationally 
invariant elements into the discussion; so (d/dr) te<f> 
must be a rotationally invariant vector field (un¬ 


like, say, 3 /d<p); so it is, like u , orthogonal to each 
2-sphere This invariance then gives 


g r$ = (d/dr)-(d/d$) = 0, (4) 

^ r0 = (3/0r) • (a/a<>) = 0, (5) 

which, with g tr = 0 as previously established, gives 
g tr = 0. The result is a line element of the form 
(23.3). Further specialization, a change of radial 
and time coordinates to R and T, where R is de¬ 
fined by (1) above and 


dT= e* 


1 dR 

Srr dr 


dt — LM 

Stt dt 



_ /integrating 
\factor 


followed by a change of notation, leads to 
Schwarzschild coordinates and the line element 
(23.7)—though such a transformation is possible 
(i.e., nonsingular) only where dR A dT 0: 

( 7 Rf= w*i + i 

Stt Stt 

If (iv)f spacetime is asymptotically flat, so 
r —oo is a region where the metric can take on 
its special relativity values, then the arbitrariness 
in the t coordinate, t' = can be ehminated 
by requiring g tt = -las r —>• oo. Then (d/dt\ H 
is uniquely determined by natural requirements 
(independent of the arbitrary 8 , <J>, choices), and 
whenever it is desired to make the further physical 
assumption (v)f of a time-independent geometry, 
this can be appropriately restated as dg^/dt = 0. 


a 


j 
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PULSARS AND NEUTRON STARS; 
QUASARS AND 
SUPERMASSIVE STARS 


Go, wond'rous creature, mount where Science guides. 
Go, measure earth, weigh air, and state the tides ; 

Instruct the planets in what orbs to run. 
Correct old time, and regulate the sun. 

ALEXANDER POPE (1733) 


V 


§24.1. OVERVIEW 

Types of stellar configurations Five kinds of stellar configurations are recognized in which relativistic effects should 
where relativity should be be significant: white dwarfs, neutron stars, black holes, supermassive stars, and 
important relativistic star clusters. The key facts about each type of configuration are summar¬ 

ized in Box 24.1; and the most important details are described in the text of this 
chapter (white dwarfs in §24.2; neutron stars and their connection to pulsars in 
§§24.2 and 24.3; supermassive stars and their possible connection to quasars and 
galactic nuclei in §§24.4 and 24.5; and relativistic star clusters in §24.6; a detailed 
discussion of black holes is delayed until Chapter 33). 

The book Stars and Relativity by Zel’dovich and Novikov (1971) presents a clear 
and very complete treatment of all these astrophysical applications of relativistic 
stellar theory. In a sense, that book can be regarded as a companion volume to 
this one; it picks up, with astrophysical emphasis, all the topics that this book treats 
with gravitational emphasis. This chapter is meant only to give the reader a brief 
survey of the material to be found in Stars and Relativity . 


(continued on page 621 ) 
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Box 24.1. STELLAR CONFIGURATIONS WHERE RELATIVISTIC EFFECTS ARE IMPORTANT 

[For detailed analyses and references on all these topics, 
see Zel'dovich and Novikov (1971).] 


A. White Dwarf Stars 

Are stars of about one solar mass, with radii 
about 5,000 kilometers and densities about 
10 6 g/cm 3 — 1 ton/cm 3 ; support themselves 
against gravity by the pressure of degenerate 
electrons; have stopped burning nuclear fuel, 
and are gradually cooling as they radiate 
away their remaining store of thermal energy. 

Were observed and studied astronomically long 
before they were understood theoretically. 

Key points in history: 

August 1926, Dirac (1926) formulated Fermi- 
Dirac statistics, following Fermi (February). 
December 1926, R. H. Fowler (1926) used 
Fermi-Dirac statistics to explain the nature 
of white dwarfs; he invoked electron de¬ 
generacy pressure to hold the star out 
against the inward pull of gravity. 

1930, S. Chandrasekhar (193la,b) calculated 
white-dwarf models taking account of spe¬ 
cial relativistic effects in the electron-de¬ 
generacy equation of state; he discovered 
that no white dwarf can be more massive 
than —1.2 solar masses (“Chandrasekhar 
Limit”). 

1932, L. D. Landau (1932) gave an elemen¬ 
tary explanation of the Chandrasekhar 
limit. 

1949, S. A. Kaplan (1949) derived the effects 
of general relativity on the mass-radius 
curve for massive white dwarfs, and de¬ 
duced that general relativity probably in¬ 
duces an instability when the radius be¬ 
comes smaller than 1.1 x 10 3 km. 

Role of general relativity in white dwarfs: 
negligible influence on structure; 
significant influence on stability, on pulsation 


frequencies, and on form of mass-radius 
curve near the Chandrasekhar limit (i.e., in 
massive white dwarfs). Electron capture also 
significant. See, e.g., Zel’dovich and Novi¬ 
kov (1971); Faulkner and Gribbin (1968). 

B. Neutron Stars 

Are stars of about one solar mass, with radii 
about 10 km and densities about 10 14 g/cm 3 
(same as density of an atomic nucleus); are 
supported against gravity by the pressure of 
degenerate neutrons and by nucleon-nucleon 
strong-interaction forces; are not burning nu¬ 
clear fuel; the energy being radiated is the 
energy of rotation and the remaining store of 
internal thermal energy. 

Theoretical calculations predicted their exist¬ 
ence in 1934, but they were not verified to 
exist observationally until 1968. 

Key points in history: 

1932, neutron discovered by Chadwick (1932). 
1933-34, Baade and Zwicky (1934a,b,c) (1) 
invented the concept of neutron star; (2) 
identified a new class of astronomical ob¬ 
jects which they called “supemovae”; (3) 
suggested that supemovae might be created 
by the collapse of a normal star to form a 
neutron star. (See Figure 24.1.) 

1939, Oppenheimer and Volkoff (1939) per¬ 
formed the first detailed calculations of the 
structures of neutron stars; in the process, 
they laid the foundations of the general 
relativistic theory of stellar structure as pre¬ 
sented in Chapter 23. (See Figure 24.1.) 
1942, Duyvendak (1942) and Mayall and Oort 
(1942) deduced that the Crab nebula is a 
remnant of the supernova observed by Chi- 
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Box 24.1 (continued) 

nese astronomers in a.d. 1054. Baade (1942) 
and Minkowskii (1942) identified the 
“south preceding star,” near the center of 
the Crab Nebula, as probably the (col¬ 
lapsed) remnant of the star that exploded 
in 1054 (see frontispiece). 

1967, Pulsars were discovered by Hewish et 
al (1968). 

1968, Gold (1968) advanced the idea that 
pulsars are rotating neutron stars; and sub¬ 
sequent observations confirmed this sug¬ 
gestion. 

1969, Cocke, Disney, and Taylor (1969) dis¬ 
covered that the “south preceding star” of 
the Crab nebula is a pulsar, thereby clinch¬ 
ing the connection between supernovae, 
neutron stars, and pulsars. 

Role of general relativity in neutron stars: 
significant effects (as much as a factor of 2) 
on structure and vibration periods; 
gravitational radiation reaction may be the 
dominant force that damps nonradial vi¬ 
brations. 

C. Black Holes 

Are objects created when a star collapses to a 
size smaller than twice its geometrized mass 
(R < 2M— (M/M e ) X 3 km), thereby creat¬ 
ing such strong spacetime curvatures that it 
can no longer communicate with the external 
universe (detailed analysis of black holes in 
Chapters 33 and 34). 

No one who accepts general relativity has found 
any way to escape the prediction that black 
holes must exist in our galaxy. This prediction 
depends in no way on the complexity of the 
collapse that forms the black holes, or on 
unknown properties of matter at high density. 
However, the existence of black holes has not 
yet been verified observationally. 



Key points in history: 

1795, Laplace (1795) noted that, according to 
Newtonian gravity and Newton’s corpuscu¬ 
lar theory of light, light cannot escape from 
a sufficiently massive object (Figure 24.1). 
1939, Oppenheimer and Snyder (1939) calcu¬ 
lated the collapse of a homogeneous sphere 
of pressure-free fluid, using general rela¬ 
tivity, and discovered that the sphere cuts 
itself off from communication with the rest 
of the universe. This was the first calcula¬ 
tion of how a black hole can form (Figure 
24.0. 

1965, Beginning of an era of intensive theo¬ 
retical investigation of black-hole physics. 

Role of general relativity in black-hole physics: 
No sensible account of black holes possible 
in Newtonian theory. The physics of black 
holes calls on Einstein’s description of grav¬ 
ity from beginning to end. 

D. Supermassive Stars 

Are stars of mass between 10 3 and 10 9 solar 
masses, constructed from a hot plasma of 
density typically less than that in normal 
stars; are supported primarily by the pressure 
of photons, which are trapped in the plasma 
and are in thermal equilibrium with it; bum 
nuclear fuel (hydrogen) at some stages in their 
evolution. 

Theoretical calculations suggest (but not with 
complete confidence) that supermassive stars 
exist in the centers of galaxies and quasars, 
and perhaps elsewhere. Supermassive stars 
conceivably could be the energy sources for 
some quasars and galactic nuclei. However, 
astronomical observations have not yet 
yielded definitive evidence about their exist¬ 
ence or their roles in the universe if they do 
exist. 



§24.2. ENDPOINT OF STELLAR EVOLUTION 



Key points in history: 

1963, Hoyle and Fowler (1963a,b) conceived 
the idea of supermassive stars, calculated 
their properties, and suggested that they 
might be associated with galactic nuclei and 
quasars. 

1963-64, Chandrasekhar (1964a,b) and 
Feynman (1964) developed the general 
relativistic theory of stellar pulsations; and 
Feynman used it to show that supermassive 
stars, although Newtonian in structure, are 
subject to a general-relativistic instability. 
1964 and after, calculations by many workers 
have elaborated on and extended the ideas 
of Hoyle and Fowler, but have not pro¬ 
duced any spectacular breakthrough. 

Role of general relativity in supermassive stars: 
negligible influence on structure, except in the 
extreme case of a compact, rapidly rotating, 
disc-like configuration [see Bardeen and 
Wagoner (1971); Salpeter and Wagoner 
(1971)]. 

significant influence on stability. 

E. Relativistic Star Clusters 

Are clusters of stars so dense that relativistic 
corrections to Newtonian theory modify their 
structure. 

V_ 



Theoretical calculations suggest that relativistic 
star clusters might, but quite possibly do not, 
form in the nuclei of some galaxies and qua¬ 
sars; if they do try to form, they might be 
destroyed during formation by star-star colli¬ 
sions, which convert the cluster into super¬ 
massive stars or into a dense conglomerate of 
stars and gas. Astronomical observations have 
yielded no definitive evidence, as yet, about 
the existence of relativistic clusters. 

Key points in history: 

1965, ZePdovich and Podurets (1965) con¬ 
ceived the idea of relativistic star clusters, 
developed the theory of their structure 
using general relativity and kinetic theory 
(cf. §25.7), and speculated about their sta¬ 
bility. 

1968, Ipser (1969) developed the theory of 
star-cluster stability and showed (in agree¬ 
ment with the ZePdovich-Podurets specula¬ 
tions) that, when it becomes too dense, a 
cluster begins to collapse to form a black 
hole. 

Role of general relativity in star clusters: 
significant effect on structure when gravita¬ 
tional redshift from center to infinity ex¬ 
ceeds z c = A\/X — 0.05. 
induces collapse of cluster to form black hole 
when central redshift reaches z c ~ 0.50. 

_ J 


§24.2. THE ENDPOINT OF STELLAR EVOLUTION 

After the normal stages of evolution, stars “die” by a variety of processes. Some 
stars explode, scattering themselves into the interstellar medium; others contract 
into a white-dwarf state; and others—according to current theory—collapse to a 
neutron-star state, or beyond, into a black hole. Although one knows little at present 
about a star’s dynamic evolution into its final state, much is known about the final 
states themselves. The final states include dispersed nebulae, which are of no interest 
here; cold stellar configurations, the subject of this section; and “black holes,” the 
subject of Part VII. 


{continued on page 624 ) 
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On Massive Neutron Cores 
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It has been suggested that, when the pressure within stellar matter becomes high enough 
a new phase consisting of neutrons will be formed. In this paper we study the gravitational’ 
equilibrium of masses of neutrons, using the equation of state for a cold Fermi gas, and general 
relativity For masses under *© only one equilibrium solution exists, which is approximately 
described by the nonrelativistic Fermi equation of state and Newtonian gravitational theory. 

or masses i© <m<}Q two solutions exist, one stable and quasi-Newtonian, one more 
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Figure 24.1. 

Two important arrivals on the astrophysical scene: 
the neutron star (1933) and the black hole (1795, 1939). 
No proper account of either can forego general relativity. 
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catalyzed matter 
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What does one mean in principle by the term “the final state of stellar evolution”? 
Start with a star containing a given number, A , of baryons and let it evolve to 
the absolute, burned-out end point of thermonuclear combustion (minimum mass- 
energy possible for the ,4-baryon system). If the normal course of thermonuclear 
combustion is too slow, speed it up by catalysis. If an explosion occurs, collect the 
outgoing matter, extract its kinetic energy, and let it fall back onto the system. 
Repeat this operation as many times as needed to arrive at burnout (cold Fe 56 
for the part of the system under modest pressure; other nuclear species in the region 
closer to the center; “cold matter catalyzed to the end point of thermonuclear 
combustion” throughout). End up finally with the system in its absolutely lowest 
energy state, with all angular momentum removed and all heat extracted, so that 
it sits at the absolute zero of temperature and has zero angular velocity. Such a 
“dead” system, depending upon its mass and prior history (two distinct energy 
minima for certain A-v alues), ends up as a cold stellar configuration (neutron star, 
or “white” dwarf), or as a “dead” black hole. 

The analysis of a cold stellar configuration demands an equation of state. The 
temperature is fixed at zero; the nuclear composition in principle is specified 
uniquely by the density; and therefore the pressure is also fixed uniquely once the 
density has been specified [equation of state p(p) for “cold catalyzed matter”]. 

The white dwarfs and neutron stars observed by astronomers are not really built 
of cold catalyzed matter. However, the matter in them is sufficiently near the end 
point of thermonuclear evolution and sufficiently cold that it can be idealized with 
fair accuracy as cold and catalyzed (see §23.4). 

The equation of state, p(/?), for cold catalyzed matter is shown graphically in 
Figure 24.2. This version of the equation of state was constructed by Harrison and 
Wheeler in 1958. Other versions constructed more recently [see Cameron (1970) 
and Baym, Bethe, and Pethick (1971) for references] are almost identical to the 
Harrison-Wheeler version at densities well below nuclear densities, p < 3 X 10 13 
g/cm 3 . At nuclear and supernuclear densities, all versions differ because of differing 
assumptions about nucleon-nucleon interactions. Along with the equation of state, 
in Figure 24.2 are shown properties of the models of cold stars constructed from 
this equation of state by integrating numerically the equations of structure (23.28). 

The equation of state can be understood by following the transformations that 
occur as a sample of cold catalyzed matter is compressed to higher and higher 
densities. At each stage in the compression, each possible thermonuclear reaction 
is to be catalyzed to its endpoint and the resultant thermal energy is to be removed. 

When the sample is at zero pressure, it is a ball of pure, cold Fe 56 , since Fe 56 
is the most tightly bound of all nuclei. It has the density 7.86 g/cm 3 . As the sample 
is compressed, its internal pressure is provided at first by normal solid-state forces; 
but the atoms are soon squeezed so closely together that the electrons become quite 
oblivious of their nuclei, and begin to form a degenerate Fermi gas. By the time 
a density of p = 10 5 g/cm 3 has been reached, valence forces are completely negligi¬ 
ble, the degenerate electron pressure dominates, and the compressibility index, y 
(see legend for Figure 24.2), is 5/3, the value for a nonrelativistically degenerate 
Fermi gas. Between 10 5 and 10 7 g/cm 3 , the pressure-providing electrons gradually 



Equation of state 


A 



Stellar models 



Figure 24.2. 

The Harrison-Wheeler equation of state for cold matter at the absolute end point of thermonuclear 
evolution, and the corresponding Harrison-Wakano-Wheeler stellar models. The equation of state is 
exhibited in the form of a plot of “compressibility index,” 

P + p dp 

y =- 

P dp 

as a function of density of mass-energy, p. (Small y corresponds to easy compressibility.) The curve 
is parameterized by the logarithm of the pressure, log l0 /?, in units of g/cm 3 [same units as p; note that 
p{ g/cm 3 ) = (1/c 2 ) X p{ dyne/cm 2 )]. The chemical composition of the matter as a function of density 
is indicated as follows: Fe, Fe 56 nuclei; A, nuclei more neutron rich than Fe 56 ; e, electrons; n, free 
neutrons; p, free protons. 

The first law of thennodynamics [equation (22.6)], when applied to cold matter (zero entropy) says 
dp/(p + p) - dn/n ; i.e., 

n = _p±£ exp (_r’_4M 

/W 56 \ J 0 p + p) 

Here /x Fe , the rest mass of an Fe 56 atom, is the ratio between p + p ~ p and n/ 56 in the limit of zero 
density. From this equation and a knowledge of p(p )—(see Figure)—one can calculate n(p). 

The equilibrium configurations are represented by curves of total mass-energy, M, versus radius, R. 
(R is defined such that 4ttR 2 is the star’s surface area.) The M(R) curve is parameterized by the logarithm 
of the central density, log l0 p c , measured in g/cm 3 . Only configurations along two branches of the 
curve are stable against small perturbations and can therefore exist in nature: the white dwarfs, with 
log 10 Pc < 8.38, and the neutron stars, with 13.43 < log 10 p c < 15.78 (see Box 26.1). 

For greater detail on both the equation of state and the equilibrium configurations, see Harrison, 
Thome, Wakano, and Wheeler (1965); also, for an updated table of the equation of state, see Hartle 
and Thome (1968). 








626 


24. PULSARS, NEUTRON STARS; QUASARS. SUPERMASSIVE STARS 


Equilibrium configurations 
cold, catalyzed matter: 


(1) forms and stability 


become relativistically degenerate, and y approaches 4/3. Above p = 1.4 X 10’ 
g/cm 3 , the rest mass of 62 Fe:^ nuclei, plus the rest mass of 44 electrons, plus the 
rather large Fermi kinetic energy of 44 electrons at the top of the Fermi sea, exceeds 
the rest mass of 56 Ni|| nuclei. Consequently, as the catalyzed sample of matter 
is compressed past p = 1.4 X 10 7 g/cm 3 , the nuclear reaction 


62 Fegj 


56 Nig 



/highly compressed 
\neutral atoms 


(24.1) 


goes to its end point, with a release of energy. As the compression continues beyond 
this point, the rising Fermi energy of the electrons induces new nuclear reactions 
similar to (24.1), but involving different nuclei. In these reactions more and more 
electrons are swallowed up to form new nuclei, which are more and more neutron- 
rich. When the density reaches p = 3 X 10 11 g/cm 3 , the nuclei are so highly neutron- 
rich (Y|| 2 ) that neutrons begin to drip off them. The matter now becomes highly 
compressible for a short time (3 x 10 11 < p < 4 X 10 11 ), since most of the re¬ 
maining electrons are swallowed up very rapidly by the dripping nuclei. Above 
p — 4 x 10 11 g/cm 3 free neutrons become plentiful and their degeneracy pressure 
exceeds that of the electrons. Further compression to p — 10 13 g/cm 3 completely 
disintegrates the remaining nuclei, leaving the sample almost pure neutrons with 
y = 5/3, the value for a nonrelativistically degenerate Fermi gas. Intermixed with 
the neutrons are just enough degenerate electrons to prevent the neutrons from 
decaying, and just enough protons to maintain charge neutrality. Compression 
beyond p — 10 13 g/cm 3 pushes the sample into the domain of nuclear densities where 
the physics of matter is only poorly understood. This Harrison-Wheeler version of 
the equation of state ignores all nucleon-nucleon interactions at and above nuclear 
densities; it idealizes matter as a noninteracting mixture of neutrons, protons, and 
electrons with neutrons dominating; and it shows a compressibility index of 5/3 while 
the neutrons are nonrelativistic, but 4/3 after they attain relativistic Fermi energies. 
Other versions of the equation of state attempt to take into account the nucleon- 
nucleon interactions in a variety of ways [see Cameron (1970), Baym, Bethe, and 
Pethick (1971), and many references cited therein], 
for Corresponding to each value of the central density, p c , there is one stellar equilib¬ 
rium configuration. Equilibrium, yes; but is the equilibrium stable? Stability studies 
(Chapter 26, especially Box 26.1) show that many of the models are unstable against 
small radial perturbations, which lead to gravitational collapse. Only white-dwarf 
stars in the range log 10 p c < 8.4 and neutron stars in the range 13.4 < log 10 p c < 15.8 
are stable. Instability for the region of log 10 p c values between 8.4 and 13.4 is caused 
by a combination of (1) relativistic strengthening of the gravitational forces, and 
(2) high compressibility of the matter due to electron capture and neutron drip by 
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the atomic nuclei. Neutron stars are stable for a simple reason. Neutron-dominated 
matter is so difficult to compress that even the relativistically strengthened gravita¬ 
tional forces cannot overcome it. Above log 10 p c — 15.8, the gravitational forces 
become strong enough to win out over the pressure of the nuclear matter, and the 
stars are all unstable. [See Gerlach (1968) for the possibility—which, however, he 
rates as unlikely—that there might exist a third family of stable equilibrium con¬ 
figurations, additional to white dwarfs and neutron stars.] 

The white-dwarf stars have masses below 1.2 M 0 and radii between —3000 and (2) white-dwarf stars 
—20,000 km. They are supported almost entirely by the pressure of the degenerate 
electron gas. Relativistic deviations from Newtonian structure are only a fraction 
of a per cent, but relativistic effects on stability and pulsations are important from 
P c ^ 10 s g/cm 3 to the upper limit of the white-dwarf family at p c = 10 8 * 4 g/cm 3 [see, 
e.g., Faulkner and Gribbin (1968)]. The properties of white-dwarf models are fairly 
independent of whose version of the equation of state is used in the calculations. 

The properties of neutron stars are moderately dependent on the equation of state (3) neutron stars 
used. However, all versions lead to upper and lower limits on the mass and central 
density. The correct lower limits probably he in the range 



13.4 < log 10 p cmin < 14.0, 
0.05 M q < M mln < 0.2 M q \ 

the correct upper limits are probably in the range 


(24.2) 


15.0 < log 10 p cmax < 16.0, 

(24.3) 

0.5 M© < M max < 3 M© 

[see Rhoades (1971)]. Neutron stars typically have radii between —6 km and —100 
km. Relativistic deviations from Newtonian structure are great, sometimes more than 
50 per cent. 

It appears certain that no cold stellar configuration can have a mass exceeding 
—5 M 0 [Rhoades (1971)] (1.2 M Q according to the Harrison-Wheeler equation of 
state, Figure 24.2). Any star more massive than this must reduce its mass below 
this limit if it is to fade away into quiet obscurity, otherwise relativistic gravitational (4) black holes 
forces will eventually pull it into catastrophic gravitational collapse past white-dwarf 
radii, past neutron-star radii, and into a black hole a few kilometers in size (see 
Part VII). 
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Theory predicts that, when a star more massive than the Chandrasekhar limit of 
1.2 M q has exhausted the nuclear fuel in its core and has compressed its core to 
white-dwarf densities, an instability pushes the star into catastrophic collapse. The 
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Birth of a neutron star by 
stellar collapse 


Dynamics of a newborn 
neutron star 


Neutron star as a pulsar 


Pulsar radiation as a tool 
studying neutron stars 


core implodes upon itself until nucleon-nucleon repulsion halts the implosion. The 
result is a neutron star, unless the core’s mass is so great that gravity overcomes 
the nucleon-nucleon repulsion and pulls the star on in to form a black hole. Not 
all the star’s mass should become part of the neutron star or black hole. Much of 
it, perhaps most, can be ejected into interstellar space by the violence that accom¬ 
panies the collapse—violence due to flash nuclear burning, shock waves, and energy 
transport by neutrinos (“stick of dynamite in center of star, ignited by collapse ). 

The collapsed core holds more interest for gravitation theory than the ejected 
envelope. That core, granted a mass small enough to avoid the black-hole fate, will 
initially be a hot, wildly pulsating, rapidly rotating glob of nuclear matter with a 
strong, embedded magnetic field (see Figure 24.3). The pulsations must die out 
quickly. They emit a huge flux of gravitational radiation, and radiation reaction 
damps them in a characteristic time of ~~1 second [see Wheeler (1966); Thorne 
(1969a)]. Moreover, the pulsations push and pull elementary particle reactions back 
and forth by raising and lowering the Fermi energies in the core’s interior; these 
particle reactions can convert pulsation energy into heat at about the same rate as 
the pulsation energy is radiated by gravity. [See Langer and Cameron (1969); also 
§11.5 of Zel’dovich and Novikov (1971) for details and references.] 

The result, after a few seconds, is a rapidly rotating centrifugally flattened neutron 
star with a strong (perhaps 10 12 gauss) magnetic field; all the pulsations are gone. 
If the star is deformed from axial symmetry (e.g., by centrifugal forces or by a 
nonsymmetric magnetic field), its rotation produces a steady outgoing stream of 
gravitational waves, which act back on the star to remove rotational energy. Whether 
or not this occurs, the rotating magnetic field itself radiates electromagnetic waves. 
They slow the rotation and transport energy into the surrounding, exploding gas 
cloud (nebula). [See Pacini (1968), Goldreich and Julian (1968), and Ostriker and 
Gunn (1969) for basic considerations.] 

Somehow, but nobody understands in detail how, the rotating neutron star beams 
coherent radio waves and light out into space. Each time the beam sweeps past the 
Earth optical and radio telescopes see a pulse of radiation. The light is emitted 
synchronously with the radio waves, but the light pulses reach Earth earlier (~~1 
second for the pulsar in the crab nebula) because of the retardation of the radio 
waves by the plasma along the way. This is the essence of the 1973 theory of pulsars, 
accepted by most astrophysicists. 

for Although the mechanism of coherent emission is not understood, the pulsar 
radiation can nevertheless be a powerful tool in the experimental study of neutron 
stars. Anything that affects the stellar rotation rate, even minutely (fractional changes 
as small as 10" 9 ) will produce measurable irregularities in the timing of the pulses 
at Earth. If the star’s crust and mantle are crystalline, as 1973 theory predicts, they 
may be subject to cracking, faulting, or slippage (“starquake’) that changes the 
moment of inertia, and thence the rotation rate. Debris falling into the star will also 
change its rotation. Whichever the cause, after such a disturbance the star may rotate 
differentially for awhile; and how it returns to rigid rotation may depend on such 
phenomena as superfluidity in its deep interior. Thus, pulsar-timing data may 
eventually give information about the interior and crust of the neutron star, and 
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Figure 24.3. 

“Collapse, pursuit, and plunge scenario” [schematic from Ruffini and Wheeler (1971b)]. 

• A star with white-dwarf core (A), slowly rotating, 

• evolves by straightforward astrophysics, 

• arrives at the point of gravitational instability, 

• collapses, and 

• ends up as a rapidly spinning neutron-star pancake (B,B'). 

• It then fragments (C) because it has too much angular momentum to collapse into a single stable 

object. If the substance of the neutron-star pancake were an incompressible fluid, the fragmentation 
would have a close tie to wellrknown and often observed phenomena (“drop formation”). However, 
the more massive a neutron star is, the smaller it is, so one’s insight into this and subsequent stages 
of the scenario are of necessity subject to correction or amendment. One can not today guarantee 
that fragmentation takes place at all; nevertheless, fragmentation will be assumed in what follows. 

• The fragments dissipate energy and angular momentum via gravitational radiation. 

• One by one as they revolve they coalesce (“pursuit and plunge scenario”). 

• In each such plunge a pulse of gravitational radiation emerges. 

• Fragments of debris fall onto the coalesced objects (neutron stars or black holes, as the case may 

be), changing their angular momenta. 

• Eventually the distinct neutron stars or black holes or both unite into one such collapsed object with 

a final pulse of gravitational radiation. 

• The details of the complete scenario differ completely from one evolving star to another, depending 

on 

• the mass of its core, and 

• the angular momentum of this core. 

• An entirely different kind of picture therefore has to be drawn for altered values of these two 

parameters. 

• Even for the values of these parameters adopted in the drawing, the present picture can at best possess 

only qualitative validity. 

• Detailed computer analysis would seem essential for any firm prediction about the course of any selected 

scenario. 
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thence (by combination with theory) about its mass and radius. These issues are 
discussed in detail in a review article by Ruderman (1972) as well as in Zel’dovich 
and Novikov (1971). 


§24.4. SUPERMASSIVE STARS AND 
STELLAR INSTABILITIES 


Theory of the stability of 
Newtonian stars 


When a Newtonian star of mass M oscillates adiabatically in its fundamental mode, 
the change in its radius, SR, obeys a harmonic-oscillator equation, 

MSR = - kSR , (24.4) 

with a “spring constant” k that depends on the star’s mean adiabatic index [recall: 

= (n/p)(dp/dn) constentrovy ], on its gravitational potential energy on the trace 
I — fpr 2 d‘Y of the second moment of its mass distribution, and on its mass M, 

k = 3 M(F l - 4/3)|0|/7 (24.5) 

(See Box 24.2). If T l > 4/3 the Newtonian star is stable and oscillates; if T l < 4/3 
the star is unstable and either collapses or explodes, depending on its initial condi¬ 
tions and overall energetics. This result is a famous theorem in Newtonian stellar 
theory—but it is relevant only for adiabatic oscillations. 


\ 

Box 24.2 OSCILLATION OF A NEWTONIAN STAR 

The following is a volume-averaged analysis of the lowest mode of radial oscillation. 

Such analyses are useful in understanding the qualitative behavior and stability of 
a star, [See Zel’dovich and Novikov (1971) for an extensive exploitation of them.] 
However, for precise quantitative results, one must perform a more detailed analysis 
[see, e.g., Ledoux and Walraven (1958); also Chapter 26 of this book]. 

1. Let M — star’s total mass 

R — star’s radius 
p — mean density = (3/47 t)M/R 3 
p — mean pressure 

r i = mean adiabatic index = («/^)(^/3^)adiabatic 

= (WpXdp/dp)^^ in Newtonian limit, where p — const. X n. 

2. Then the mean pressure-buoyancy force F buoy and the counterbalancing gravita¬ 
tional force in the equilibrium star are 


F buoy = Pj R 

= iv ftV = pM/R 2 = (477/3 )p 2 R. 
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3. When the oscillating star has expanded or contracted so its radius is R + SR, 
then its mean density will have changed to 

P + Sp = (3/4 it)M[R- 3 + S(R~ 3 )] = p- 3 (p/R) SR, 

and its mean pressure will be 

p + 8p=p + (p/p)T 1 Sp =p- 3(T 1 p/R)8R. 

The corresponding changes in the forces will be 


"W = T~Rt SR= “ (3r > + l)J ZT = " (3A + 1)F ^(^)’ 

*4 av = (y-)(2pi?6p + P 2 5/?) = (yP 2 ^)(- 5 f) = -54av(^). 

Consequently, the restoring force will be (recall: F buoy = Fg,. av ) 

8F — SF — 3 ( T — —I F — 

°^grav 0/ buoy “ J l 1 1 3 / S 1 *^ R ' 

4. This restoring force produces an acceleration, 

SF^ - SF hxloy = -pSR. 

Hence, the equation of motion for the oscillations is 

SR = -3(7\ - 4/3)(4tt/3)p <5F, 

corresponding to a “spring constant” k and angular frequency of oscillation to, 
given by to 2 = 47 t(/\ — 4/3)p, and k = Mto 2 . 

5. A more nearly exact analysis (see exercise 39.7 for details, or Box 26.2 for an 
alternative derivation) yields the improved formula 

W 2 = 3 (r x - 4/3)|J2|//, 

S =( s,ar ' s «lf-gravi»tional\ = If p0dr =--( Si dr dr\ 
\energy / 2 J 2 J \x - x’\ 

T /trace of second moment of\ r 9 
/ - , . = pr 2 dT, 

\ star s mass distnbution / J 


for the square of the oscillation frequency. 

6. Note that 7\ > 4/3 corresponds to stable oscillations; J\ < 4/3 corresponds to 
exponentially developing collapse or explosion. 
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Stability theory predicts 
"engine-driven oscillations" 
and quick death for stars of 
M > 60 M q 


Possible existence of 
supermassive stars 


Relativistic instabilities in a 
supermassive star 


In a real star no oscillation is precisely adiabatic. The oscillations in temperature 
cause corresponding oscillations in the stellar opacity and in nuclear burning rates. 
These insert energy into or extract energy from the gas vibrations. 

All main-sequence stars thus far observed and studied have masses below 60 Af©. 
For such small masses, theory predicts low enough temperatures that gas pressure 
dominates over radiation pressure, and the adiabatic index is nearly that of nonrela- 
tivistic gas, F 1 ^ 5/3. Such stars vibrate stably. The net effect of the oscillating 
opacity and burning rate is usually to extract energy from the vibrations. Thus, they 
damp. (The vibrations of Cepheid variable stars are a notable exception.) 

No one has yet seen a main-sequence star with mass above about 60 A/©. This 
is explained as follows. For masses above 60 A/©, the temperature should be so high 
that radiation pressure dominates over gas pressure, and the adiabatic index r i is 
only slightly above the value 4/3 for pure radiation. Consequently the “spring 
constant” of the star, although positive, is very small. On the inward stroke of an 
oscillation, the central temperature rises, and nuclear burning speeds up. (The 
nuclear burning rate goes as a very high power of the central temperature; for 
example, in a massive star HCNO burning releases energy at a rate e HCN0 ^ T c 11 .) 
Because the spring constant is so small, the inward stroke lasts for a long time, and 
the enhanced nuclear burning produces a significant excess of thermal energy and 
pressure. Hence, on the outward stroke the star expands more vigorously than it 
contracted (“engine”). Successive vibrations are driven to higher and higher ampli¬ 
tudes. Eventually, calculations suggest, the star either explodes, or it ejects enough 
mass by its vigorous vibrations to drop below the critical limit of M — 60 A/©. Hence, 
stars of mass above 60 A/© should not live long enough that astronomers could have 
a reasonable probability of discovering them. 

Of course, this “engine action” does not prevent massive stars from forming, living 
a short time, and then disrupting themselves. Such a possibility is particularly 
intriguing for supermassive stars [M between 10 3 Af© and 10 9 Af© — 0.01 X (mass 
of a galaxy)]. Although such stars may be exceedingly rare, by their huge masses 
and huge release of explosive energy they might play an important role in the 
universe. Moreover, it is conceivable that the oscillations of such stars, like those 
of Cepheid variables, might be sustained at large amplitudes for long times (a million 
years?), with nonlinear damping processes preventing their further growth. 

Theory predicts that general relativistic effects should strongly influence the 
oscillations of a supermassive star. The increase in “gravitational force,” acting 

on a shell of matter on the inward stroke is greater in general relativity than in 
Newtonian theory, and the decrease on the outward stroke is also greater. Conse¬ 
quently the “effective index” /\ crit of gravitational forces is increased above the 
Newtonian value of 4/3; thus. 


/fractional increase in \ 
“pressure-like force of 
gravity” per unit fractional 
change in baryon-number 

/ 



= A cm = (4/3) + a(M/R) + 0(M 2 /R 2 ), (24.6) 
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where a is a constant of the order of unity that depends on the structure of the 
star (see Box 26.2). To resist gravity, one has only the elasticity of the relativistic 
material of the star: 

/ fractional increase in 
“pressure-like resisting 
force” per unit fractional 
change in baryon number 
\ density 

The effective spring constant for the vibrations of the star is governed by the delicate 
margin between these two indices: 




/ contribution \ 

/ effective \ 

= 1 of “elastic ) — ( 

\ spring constant / 

V forces” / ' 


/ contribution 
\ of gravity 


= 3 M(r x -r lcrit ) 


1421 


(24.8) 


(derivation in Chapter 26). The relativistic rise in the effective index of gravity above 
4/3 [equation (24.6)] brings on the transition from stability (positive k ; vibration) 
to instability (negative k ; explosion or collapse) under conditions when one otherwise 
would have expected stability. For supermassive stars, Fowler and Hoyle (1964) show 
that 

A = 4/3 + f(A//M 0 )“ 1/2 , 


where f is a constant of order unity. As a newly formed supermassive star contracts 
inward, heating up, but not yet hot enough to ignite its nuclear fuel, it approaches 
nearer and nearer to instability against collapse. Unless burning halts the contraction, 
collapse sets in at a radius R crit given by 

A - 4/3 + f(M/M o r 1/2 = r lcrit = 4/3 4 aM/R; 


i.e., 


R = (a/2£)(M/M 0 ) 1/2 X (Schwarzschild Radius) 

— 10 4 x (Schwarzschild Radius) if M — 10 8 M 0 . 

The relativistic instability occurs far outside the Schwarzschild radius when the star 
is very massive. Relativity hardly modifies the star’s structure at all; but because 
of the delicate balance between 5 A av and 5F buoy m the Newtonian oscillations (Box 
24.2), tiny relativistic corrections to these forces can completely change the stability. 

In practice, the story of a supermassive star is far more complicated than has been 
indicated here. Rotation can stabilize it against relativistic collapse for a while. 
However, after the star has lost all angular momentum in excess of the critical value 


Temporary stabilization 
rotation 
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Possible scenarios for 
evolution and death of a 
supermassive star 


Supermassive stars as 
possible energy sources for 
quasars and galactic nuclei 


Other possible energy 
sources: dense star clusters; 
black holes 


Relativistic star clusters 


J crit = M 2 (“extreme Kerr limit”; see Chapter 33), and after it has contracted to 
near the Schwarzschild radius, rotation is helpless to stave off implosion. Depending 
on its mass and angular momentum, the star may ignite its fuel before or after 
relativistic collapse begins, and before or after implosion through the Schwarzschild 
radius. When the fuel is ignited, it can wreak havoc, because even if the star is not 
then imploding, its adiabatic index will be very near the critical one, and the burning 
may drive oscillations to higher and higher amplitudes. These processes are so 
complex that in 1973 one is far from having satisfactory analyses of them, but for 
reviews of what is known and has been done, the reader can consult Fowler (1966). 
Thorne (1967), and Zel’dovich and Novikov (1971). 

The theory of stellar pulsations in general relativity is presented for Track-2 readers 
in Chapter 26 of this book. 


§24.5. QUASARS AND EXPLOSIONS IN GALACTIC NUCLEI 

Supermassive stars were first conceived by Hoyle and Fowler (1963a,b) as an expla¬ 
nation for explosions in the nuclei of galaxies. Shortly thereafter, when quasars were 
discovered, Hoyle and Fowler quite naturally appealed to their supermassive stars 
for an explanation of these puzzles as well. Whether galactic explosions or quasars 
are driven by supermassive stars remains a subject of debate in astronomical circles 
even as this book is being finished, in 1973. Hence, this book will avoid the issue 
except for the following remark. 

Whatever is responsible for quasars and galactic explosions must be a machine 
of great mass (Af ~~ 10 6 to 10 10 M 0 ) and small radius (light-travel time across the 
machine, as deduced from light variations, is sometimes less than a day). The 
machine might be a coherent object, i.e., a supermassive star; or it might be a dense 
mixture of ordinary stars and much gas. Actually these two possibilities may not 
be distinct. Star-star collisions in a dense cluster can lead to stellar coalescence and 
the gradual building up of one or more supermassive stars [Sanders (1970); Spitzer 
(1971); Colgate (1967)]. Thus, at one stage in its life, a galactic nucleus or quasar 
might be driven by collisions in a dense star cluster; and at a later stage it might 
be driven by a supermassive star; and at a still later stage that star might collapse 
to leave behind a massive black hole (10 6 -10 9 Af 0 ), but a black hole that is still 
“live” and active (Chapter 33). 


§24.6. RELATIVISTIC STAR CLUSTERS 

The normal astrophysical evolution of a galactic nucleus is estimated [Sanders (1970); 
Spitzer (1971)] to lead under some circumstances to a star cluster so dense that 
general relativity influences its structure and evolution. The theory of relativistic 
star clusters is closely related to that of relativistic stars, as developed in Chapter 
23. A star is a swarm of gas molecules that collide frequently; a star cluster is a 
swarm of stars that collide rarely. But the frequency of collisions is relatively unim- 
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portant in a steady state. For the theory of relativistic star clusters, see: §25.7 of 
this book; Zel’dovich and Podurets (1965); Fackerell, Ipser, and Thome (1969); 
Chapter 12 of Zel’dovich and Novikov (1971); and references cited there. A relativ¬ 
istic star cluster is a latent volcano. No future is evident for it except to evolve with 
enormous energy release to a massive black hole, either by direct collapse (possibly 
a star at a time) or by first coalescing into a supermassive star that later collapses. 




CHAPTER 



THE "PIT IN THE POTENTIAL" AS THE 
CENTRAL NEW FEATURE OF MOTION 
IN SCHWARZSCHILD GEOMETRY 


" Eccentric, intervolved , yet regular 
Then most , when most irregular they seem; 
And in their motions harmony divine" 
MILTON. 1665 




§25.1. FROM KEPLER'S LAWS TO THE EFFECTIVE POTENTIAL 
FOR MOTION IN SCHWARZSCHILD GEOMETRY 

No greater glory crowns Newton’s theory of gravitation than the account it gives 
of the principal features of the solar system: a planet in its motion sweeps out equal 
areas in equal times; its orbit is an ellipse, with one focus at the sun; and the cube 
of the semimajor axis, a, of the ellipse, multiplied by the square of the average 
angular velocity of the planet in its orbit (co = 27r/period) gives a number with the 
dimensions of a length, the same number for all the planets (Box 25.1), equal to 
the mass of the sun: 

M — co 2 a 3 . 

Exactly the same is true for the satellites of Jupiter (Figure 25.1), and of the Earth 
(Box 25.1), and true throughout the heavens. What more can one possibly expect 
of Einstein’s theory of gravity when it in its turn grapples with this centuries-old 
theme of a test object moving under the influence of a spherically symmetric center 
of attraction? The principal new result can be stated in a single sentence: The particle 
Overview of this chapter is governed by an “effective potential” (Figure 25.2 and §§25.5, 25.6) that possesses 

not only (1) the long distance — M/r attractive behavior and (2) the shorter distance 


/-\ 

This chapter is entirely Track 2, 
except for Figures 25.2 and 
25.6, and Boxes 25.6 and 25.7 
(pp. 639, 660, 674, and 677), 
which Track-1 readers should 
peruse for insight and flavor. 

No earlier Track-2 material is 
needed as preparation for it. 

§25.2 (symmetries) is needed 
as preparation for Box 30.2 
(Mixmaster cosmology). The 
rest of the chapter is not 
essential for any later chapter, 
but it will be helpful in 
understanding 

(1) Chapters 31-34 
(gravitational collapse 
and black holes), and 

(2) Chapter 40 (solar-system 
experiments). 

_ J 
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(angular momentum) 2 /;- 2 repulsive behavior of Newtonian gravitational theory, but 
also (3) at still shorter distances a pit in the potential, which (1) captures a particle 
that comes too close; (2) establishes a critical distance of closest approach for this 
black-hole capture process; (3) for a particle that approaches this critical point 
without crossing it, lengthens the turn-around time as compared to Newtonian 
expectations; and thereby (4) makes the period for a radial excursion longer than 
the period of a revolution; (5) causes an otherwise Keplerian orbit to precess; and 
(6) deflects a fast particle and a photon through larger angles than Newtonian theory 
would predict. 

The pit in the potential being thus the central new feature of motion in Schwarzs- 
child geometry and the source of major predictions (Box 25.2), it is appropriate to 
look for the most direct road into the concept of effective potential and its meaning 
and application. In this search no guide is closer to hand than Newtonian mechanics. 

Analytic mechanics offers several ways to deal with the problem of motion in a 
central field of force, and among them are two of central relevance here: (1) the 
world-line method, which includes second-order differential equations of motion, 
Lagrange’s equations, search for constants of integration, reduction to first-order 
equations, and further integration in rather different ways according as one wants 
the shape of the orbit, 9 = 9(r), or the time to get to a given point on the world 
line, t = t(r); and (2) the wave-crest method, otherwise known as the “eikonal 
method” or “Hamilton-Jacobi method,” which gives the motion by the condition 
of “constructive interference of wave crests,” thus making a single leap from the 
Hamilton-Jacobi equation to the motion of the test object. Both methods are em- 

(continued on page 641) 



West East West East 



Figure 25.1. 

Jupiter’s satellites, as followed from night to night with field glasses 
or telescope, provide an opportunity to check for oneself the central 
ideas of gravitation physics in the Newtonian approximation (dis¬ 
tances large compared to Schwarzschild radius). For the practically 
circular orbits of these satellites, Kepler’s law becomes A/ 1 = oi 2 r 3 
(“1-2-3 principle'’) and the velocity in orbit is/? = ur. Out of observa¬ 
tions on any two of the quantities /?, A/, w, r , one can find the other 
two. (In the opposite limiting case of two objects, each of mass M, 
going around their common center of gravity with separation r, one 
has A/ = u-r 3 /!. /? = cor/2). The configurations of satellites I-IV of 
Jupiter as given here for December 1964 (days 0.0, 1.0, 2.0, etc. in 
“universal time,” for which see any good dictionary or encyclopedia) 
are taken from The American Ephemeris and Nautical Almanac for 
1964 [U.S. Government Printing Office (1962)]. 




638 


25. PARTICLE MOTION IN SCHWARZSCHILD GEOMETRY 


r 




Box 25.1 MASS FROM MEAN ANGULAR FREQUENCY AND 
SEMIMAJOR AXIS: M = <o 2 a 3 

App ro pri atenes s of Newtonian analysis shown by smallness of mass (or “half- 
Schwarzschild radius” or “extension of the pit in the potential”) as listed in last 
column compared to the semimajor axis a in the next-to-last column. Basic data 
from compilation of Allen (1963). 


Object 

Period* {days) 

u{cm~ l ) 

a{cm) 

ura l {cm) 

Planets 

Mercury 

87.9686 

275.8 X lO’ 19 

0.5791 x 10 13 

1.477 x IO 5 

Venus 

224.700 

107.95 

1.0821 

1.477 

Earth 

365.257 

66.41 

1.4960 

1.477 

Mars 

686.980 

35.31 

2.2794 

1.477 

Jupiter 

4332.587 

5.599 

7.783 

1.478 

Saturn 

10759.20 

2.255 

14.27 

1.477 

Uranus 

30685 

0.7905 

28.69 

1.476 

Neptune 

60188 

0.4030 

44.98 

1.478 

Pluto 

90700 

0.2674 x IO" 19 

59.00 x 10 13 

1.469 x 10 5 


Major satellites of Jupiter 


lo 

Europa 

Ganymede 

Callisto 

'l .769 138 

3.551 181 

7.154 553 

16.689 018 

13.711 x 10- 16 

6.831 

3.391 

1.454 x 10- 16 

0.422 x I0 11 

0.671 

1.070 

1.883 x I0 11 

141.3 

141.0 

140.8 

141.1 

Two satellites of Earth 




OSO 5 b 

95.6 min. 

3.65 x 10~ 14 

6.916 X IO 8 

0.442 

Moon 

27.32 

0.888 x IO" 16 

3.84 x 10 10 

0.446 


* Sidereal period: time to make one revolution relative to fixed stars. 

b Orbiting scientific observatory launched Jan. 22, 1969, to observe x-rays and ultraviolet radiation from the sun. 
Perigee 531 km, apogee 560 km, above earth. 


SOME TYPICAL MASSES AND TIMES IN CONVENTIONAL AND 
GEOMETRIC UNITS. Conversion factor for mass, 

G/c 2 = 0.742 X 10~ 28 cm/g 


Mass 

Galaxy 

Sun 

Jupiter 

Earth 

(g) 

M{ cm) 

2.2 x 10 44 

1.6 x 10 16 

1.989 x 10 33 

1.47 x 10 5 

1.899 x 10 30 

112 

5.977 x 10 27 
0.444 

Conversion factor for time, c = 2.998 x IO 10 cm/sec. One sidereal year = 365.256 days or 3.1558 X 10 7 
sec. 

Period 

1 sec 

7 min 

7 hr 

7 day 

“conv (Sec' 1 ) 

os (cm L ) 

6.28 

2.09 x IO" 10 

1.046 x I0' 1 

3.48 X 10“ 12 

1.75 x IO* 3 

5.80 x IO’ 14 

7.28 X IO' 5 
2.42 x IO" 15 


7 week 

7 month 

7 year 



j 


v. 


1.04 x I0- 5 
3.46 x lO" 16 


2.39 x 10" 6 
7.95 X 10~ 17 


1.99 X IO* 7 
6.63 x lO’ 18 



Figure 25.2. 

Effective potential for motion of a test particle in the Schwarzschild 
geometry of a concentrated mass M . Energy, in units of the rest mass 
jx of the particle, is denoted E = E/jn; angular momentum, Z = L/ju. 
The quantity r denotes the Schwarzschild r coordinate. The effective 
potential (also in units of ju) is defined by equation (25.16) or, equiva¬ 
lently, by the equation 

(see also §25.5) and has the value 

P= [(1 - 2M/r)(l + L 2 /r 2 ) ] 1/2 . 

It represents that value of E at which the radial kinetic energy of 
the particle, at r, reduces to zero (E- value that makes r into a “turning 
point”: V{r) = E . Note that one could equally well regard V*{r) as 
the effective potential, and define a turning point by the condition 
V 2 = E 2 . Which definition one chooses depends on convenience, on 
the intended application, on the tie to the archetypal differential 
equation Jx 2 + V{x) = £, and on the stress one wishes to put on 
correspondence with the effective potential of Newtonian theory). 
Stable circular orbits are possible (representative point sitting at mini¬ 
mum of effective potential) only for Z values in excess of 2\/3 M. 
For any such fixed Z value, the motion departs slightly from circu¬ 
larity as the energy is raised above the potential minimum (see the 
two heavy horizontal lines for Z = 3.75 M). In classical physics, the 
motion is limited to the region of positive kinetic energy. In quantum 
physics, the particle can tunnel through the region where the kinetic 
energy, as calculated classically, is negative (dashed prolongations of 
heavy horizontal lines) and head for the “pit in the potential” (capture 
by black hole). Such tunneling is absolutely negligible when the center 
of attraction has any macroscopic dimension, but in principle becomes 
important for a black hole of mass 10 17 g (or 10~ n cm) if such an 
object can in principle exist. 

The diagram at the right gives values of the minimum and maxi¬ 
mum of the potential as they depend on the angular momentum of 
the test particle. The roots oidV/dr are given in terms of the “reduced 
angular momentum parameter” V — Z/A/ = L/Mfi by 

__ 6M _ 

' 1 + (1 - 12 /L n ) u * ’ 

p, _ (l t2 + 36) + (I« - 12X1 - 12/Z. t2 ) l/ ' 2 

54 

[= 8/9 for L* = (12) 1/2 ; 1 forZ/ = 4; (L? 2 /21) + (1/3) + (1 /V 2 ) 

+ ■ • • for V —*- oo j 

(plus root for maximum of the effective potential; minus root for 
minimum; see exercise 25.18). 
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Box 25.2 MOTION IN SCHWARZSCHILD GEOMETRY REGARDED AS A CENTRAL POINT OF 
DEPARTURE FOR MAJOR APPLICATIONS OF EINSTEIN'S GEOMETRODYNAMICS 


1. Newtonian effect of sun on planets and of earth 
on moon and man. 

2. Bending of light by sun. 

3. Red shift of light from sun. 

4. Precession of the perihelion of Mercury around 
the sun. 

5. Capture of a test object by a black hole as 
simple exemplar of gravitational collapse. 

6. Dynamics of Friedmann universe derived from 
model of Schwarzschild “lattice universe.” Lat¬ 
tice universe is constructed from 120 or some 
other magic number of concentrations of mass, 
each mass in an otherwise empty lattice cell of 
its own. Each lattice cell, though actually po¬ 
lygonal, is idealized (see Wigner-Seitz approxi¬ 
mation of solid-state physics) as spherical. A 
test object at the interface between two cells 
falls toward the center of each [standard radial 
motion in Schwarzschild geometry; see discus¬ 
sion following equation (25.27). Therefore the 
two masses fall toward each other at a calcula¬ 
ble rate. From this simple argument follows the 
entire dynamics of the closed 3-sphere lattice 
universe, in close concord with the predictions 
of the Friedmann model [see Lindquist and 
Wheeler (1957)]. 

7. Perturbations of Schwarzschild geometry, I. 
Gravitational waves are incident on, scattered 
by, and captured into a black hole. Waves with 
wavelength short compared to the Schwarzs¬ 
child radius can be analyzed to good approxi¬ 
mation by the methods of geometric optics (ex¬ 
ercises 35.15 and 35.16), as employed in this 
chapter to treat the motions of particles and 
photons. For longer wavelengths, there occur 
important physical-optics corrections to this 


geometric-optics idealization (see §35.8 and ex¬ 
ercises 32.10, 32.11). Similar considerations 
apply to electromagnetic and de Broglie waves. 

8. Lepton number for an electron in its lowest 
quantum state in the geometry (“gravitational 
field of force”) of a black hole is calculated to 
be transcended (capture of the electron!) or not 
according as the mass of this black hole is large 
or small compared to a certain critical mass 
NU e — M* 2 /m e (—10 17 g or 10" 11 cm) [Hartle 
(1971, 1972); Wheeler (1971b,c); Teitelboim 
(1972b,c)]. Similarly (with another value for the 
critical mass) for conservation of baryon num¬ 
ber [Bekenstein (1972a,b), Teitelboim (1972a)]. 
To analyze “transcendence or not” one must 
solve quantum-mechanical wave equations, of 
which the Hamilton-Jacobi equation for parti¬ 
cle and photon orbits is a classical limit. These 
quantum wave equations contain effective po¬ 
tentials identical—-aside from spin-dependent 
and wavelength-dependent corrections—to the 
effective potentials for particle and photon mo¬ 
tion. 

9. Perturbations of Schwarzschild geometry, II. 
Those small changes in standard Schwarzschild 
black-hole geometry which remain stationary in 
time describe the alterations in a “dead” black 
hole that make it into a “live” black hole, one 
endowed with angular momentum as well as 
mass (see Chapter 33). To analyze such changes 
in black-hole geometry, one must again solve 
wave equations, but wave equations which are 
now classical. Once more the wave equations 
are closely related to the Hamilton-Jacobi 
equation, and their effective potentials are close 
kin to those for particle motion. 
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ployed here in turn because each gives special insights. The Hamilton-Jacobi method 
(Box 25.3) leads quickly to the major results of interest (Box 25.4), and it has a 
close tie to the quantum principle. The world-line method (§§252, 25.3, 25.4) starts 
with the geodesic equations of motion themselves. It provides a more familiar way 
into the subject for a reader not acquainted with the Hamilton-Jacobi approach. 
Moreover, in attempting to solve the geodesic equations of motion, one must analyze 
symmetry properties of the geometry, an enterprise that continues to pay dividends 
when one moves from Schwarzschild geometry to Kerr-Newman geometry (Chap¬ 
ter 33), and from Friedmann cosmology (Chapter 27) to more general cosmologies 
(Chapter 30). 

{continued on page 650 ) 


/ \ 

Box 25.3 THE HAMILTON-JACOBI DESCRIPTION OF MOTION: 

NATURAL BECAUSE RATIFIED BY THE QUANTUM PRINCIPLE 

1. Purely classical (nonquantum). 

2. Originated with William Rowan Hamilton out of conviction that mechanics is 
similar in its character to optics; that the “particle world line” of mechanics is 
an idealization analogous to the “light ray” of geometric optics. Localization 
of energy of light ray is approximate only. Its spread is governed by wavelength 
of light (“geometric optics”). Hamilton had glimmerings of same idea for parti¬ 
cles: “quantum physics before quantum physics.” The way that he and Jacobi 
developed to analyze motion through the Hamilton-Jacobi function S(x , /)—to 
take the example of a dynamic system with only one degree of freedom, x — 
makes the leap from classical ideas to quantum ideas as short as one knows 
how to make it. Moreover, the real world is a quantum world. Classical me¬ 
chanics is not born out of a vacuum. It is an idealization of and approximation 
to quantum mechanics. 

3. Key idea is idealization to a particle wavelength so short that quantum-mechan¬ 
ical spread or uncertainty in location of particle (or spread of configuration 
coordinates of more complex system) is negligible. No better way was ever 
discovered to unite the spirit of quantum mechanics and the precision of location 
of classical mechanics. 

4. Call Hamiltonian //(/?, x) = p 2 /2m + V{x). Call energy of particle E. Then there 
is no way whatever consistent with the quantum principle to describe the motion 
of the particle in space and time. The uncertainty principle forbids (sharply 
defined energy AE —0, in AE At > fi/ 2, implies uncertainty At —oo; also 
Ap —0 in Ap Ax > fi/2 implies Ax —oo). The quantum-mechanical wave 
function is spread out over all space. This spread shows in the so-called semi- 
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Box 25.3 (continued) 


classical or Wentzel-Kramers-Brillouin [“WKB”; see, for example, Kemble 
(1937)] approximation for the probability amplitude function, 



slowly varying \ 
amplitude function/ 


( 1 ) 



5. It is of no help in localizing the probability distribution that h = 1.054 X 
10“ 27 g cm 2 /s [or ^ = (1.6 X 10~ 33 cm) 2 in geometric units] is very small com¬ 
pared to the “quantities of action” or “magnitudes of the Hamilton-Jacobi 
function, S” or “dynamic phase, S” encountered in most everyday applications. 


6. It is of no help in localizing the probability distribution that this dynamic phase 
obeys the simple Hamilton-Jacobi law of propagation, 




+ V(x). 


( 2 ) 


7. It is of no help in localizing the probability distribution that the solution of this 
equation for a particle of energy E is extraordinarily simple, 

S(x, /)=-£/ + f 1 {2 m[E - V(x)]} 1/2 dx + 8 E (3) 

*0 

(with 8 e an arbitrary additive phase constant). The probability amplitude is still 
spread all over everywhere. There is not the slightest trace of anything like a 
localized world line, x = x(t). 

8. To localize the particle, build a probability- 
amplitude wave packet by superposing mono¬ 
frequency (monoenergy) terms, according to a 
prescription qualitatively of the form 

t(x, 0 = 0 + 0 “b ’ * * * W 



Monoenergy wave 

Superposition of 
monoenergy waves 
to give wave packet 
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Destructive interference takes place almost 
everywhere. The wave packet is concentrated 
in the region of constructive interference. 
There the phases of the various waves agree; 
thus 


SjpCXj 0 — Se+ae( x * 0* (“0 


At last one has moved from a wave spread 
everywhere to a localized wave and thence, in 
the limit of indefinitely small wavelength, to 
a classical world line. This one equation of 
constructive interference ties together x and t 
(locus of world line in x, t, diagram). Smooth 
lines —20, —19, —18, etc. are wave crests of 
E ; dashed lines, wave crests for ^e+ae- 
Shaded area is region of constructive interfer¬ 
ence (wave packet). Black dots mark locus of 
classical world line, 


Lim 

AE^O 


^ E + AE^ X > 0 Sjg(x, 0 



*5 


9. The Newtonian course of the world line through spacetime follows at once from 
this condition of constructive interference when one goes to the classical limit 
(h negligible compared to amounts of action involved; hence wavelength negli¬ 
gibly short; hence spread of energies AE required to build well-localized wave 
packet also negligible); thus 

Se+ae( x > 0 ~ Se( x > 0 _ q 
AE “ 


reduces to 


dSjg(*> t) _ n 

J 


dE 
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Box 25.3 ( continued ) 

This condition in turn^ applied to expression (3), gives the time required to travel 
to the point x; thus. 




{(2 /m)[E - K(x)]} 


172 + ‘o - °» 


where / 0 is an abbreviation for the quantity 

t 0 ~ d S E /dE 

(“difference in base value of dynamic phase per unit difference of energy”). 

10. Not one trace of the quantum of action comes into this final Newtonian result, 
for a simple reason: fi has been treated as negligible and the wavelength has 
been treated as negligible. In this limit the location of the wave “packet” re¬ 
duces to the location of the wave crest. The location of the wave crest is precisely 
what is governed by S E (x, /); and the condition of “constructive interference” 
dS E (x, t)/dE = 0 gives without approximation the location of the sharply 
defined Newtonian world line x = x(/). 

11. Relevance in the context of motion in a central field of force? Quickest known 
route to the concept of effective potential (Box 25.4). 


Box 25.4 MOTION UNDER GRAVITATIONAL ATTRACTION OF A CENTRAL MASS 
ANALYZED BY HAMILTON-JACOBI METHOD 


A. Newtonian Theory of Gravitation 


Hamiltonian 


, El . El M 

2r 2 2r 2 sin 2 # r 


(tildes over energy, momentum, etc., refer to test object of unit mass; test particle 
of mass /x follows same motion with energy E — fiE, momentum p = j up, etc.). 
Equation of Hamilton-Jacobi for propagation of wave crests: 

_ M = ±(21) 2 , 1 (dS\ 2 1 (cS\ 2 _ M m 

dt 2 \dr) 2r 2 \dd) 2r 2 sin 2 # V d<p/ r 
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Box 25.4 ( continued) 

Solve by “method of separation of variables” with convention that Vo 2 = ±a. 


s-St+r^+f (i 2 -^) 

+r[*( 


1/2 

de 


„ M 

,E + - 

r 


L 2 \1 1/2 

2^)1 dr + S W ' 


(3) 


(Check by substituting into Hamilton-Jacobi equation. Solution as sum of four 
terms corresponding to the four independent variables goes hand in hand with 
expression of probability amplitude in quantum mechanics as product of four 
factors, because iS/ti — ipS/fi is exponent in approximate expression for the 
probability amplitude.) 

Constructive interference of waves: 

(1) with slightly different E values (impose “condition of constructive interfer¬ 
ence” dS~^z %(t, r,9,<p)/dE = 0) tells when the particle arrives at a given r 
(that is, gives relation between t and r); 

(2) with slightly different values of the “parameter of total angular momen¬ 
tum per unit mass,” L (impose condition of constructive interference 
dS~^z z(t, r, 0, <J>)/SL = 0) tells correlation between 9 and r (a major feature 
of the shape of the orbit); 

(3) with slightly different values of the “parameter of azimuthal angular mo¬ 
mentum per unit mass,”/? 0 (impose condition dS/dp ^ = 0) gives correlation 
between 9 and <J>, 



o =-£=*_ r_ qyfr* „ ( 4) 

cp^ J sin e (sin 2 # - p 2 /l 2 ) V2 

Planar character of the orbit. 

Puzzle out the value of this last integral with the help of a table of integrals? 
It is quicker and clearer to capture the content without calculation: the particle 
moves in a plane. The vector associated with the angular momentum L stands 
perpendicular to this plane. The projection of this angular momentum along 
the z -axis is 'p^ = L cos a (definition of orbital inclination, a). Straight line 
connecting origin with particle cuts unit sphere in a point As time runs on, 
SP traces out a great circle on the unit sphere. The plane of this great circle 
cuts the equatorial plane in a “line of nodes,” at which “hinge-line” the two 
planes are separated by a dihedral angle, a. The orbit of the point 9 is described 
by x — r cos = r sin \j/ 9 z — 0 in a Cartesian system of coordinates in which 
y runs along the line of nodes and in which x lies in the plane of the orbit. 

v_ 
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Box 25.4 (continued) 


In a coordinate system in which y runs along the line of nodes and x lies in 
the plane of the equator, one has: 


r cos 9 = z — z cos a + x sin a = r cos ^ sin a ; 

r sin 9 cos 6 — x — — £ sin a + x cos a — r cos cos a; 

r sin 9 sin <> = y — y — r sin \p. 

Eliminate reference to the Cartesian coordinates and, by taking ratios, also 

eliminate reference to r. Thus find the equation of the great circle route in 

parametric form, 

tan<J> = tan^/cos a 


and 


cos 9 = cos i p sin a. 


Here increasing values of ^ spell out successive points on the great circle. 
Eliminate ip via the relation 

sec 2 \p — tan 2 ^ = 1 


to find 


or, more briefly, 


^4-tan^cos 2 « = l 
cos 2 0 Y 

sec<j> = tan a tan 9 . 


(5) 


One verifies that <p as calculated from (5) provides an integral of (4), thus 
confirming the physical argument just traced out. Moreover, the arbitrary 
constant of integration that comes from (4), left out for the sake of simplicity 
from (5), is easily inserted by replacing <j> there by <p — <p 0 (rotation of line of 
nodes to a new azimuth). The kind of physics just done in tracing out the relation 
between 9 and <f> is evidently elementary solid geometry and nothing more. The 
same geometric relationships also show up, with no relativistic corrections 
whatsoever (how could there be any?!) for motion in Schwarzschild geometry. 
Therefore it is appropriate to drop this complication from attention here and 
hereafter. Let the particle move entirely in the direction of increasing 0, not 
at all in the direction of increasing <J>; that is, let it move in an orbit of zero 
angular momentum p 0 (total angular momentum vector L inclined at angle 
a = tt/ 2 to z~axis). Consequently the dynamic phase S (to be divided by h to 
obtain phase of Schrodinger wave function when one turns from classical to 
quantum mechanics) becomes 
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r 

Shape of orbit: 

n - 95 _ „ _ r 1 drh f _ 

IT ~ J [2(£ + M/r - L 2 /2r 2 )] 1/2 ’ 


whence 


r _ lvm 

1 -j- e cos 0 



( 8 ) 


Here e is an abbreviation for the eccentricity of the orbit, 

e = (1 4- 2 EL 2 /M 2 ) 1/2 (9) 

(greater than 1 for positive £, hyperbolic orbit; equal to 1 for zero £, parabolic 
orbit; less than 1 for negative £, elliptic orbit). A constant of integration has 
been omitted from (8) for simplicity. To reinstall it, replace 6 by 6 — 6 0 (rotation 
of direction of principal axis in the plane of the orbit). Other features of the 
orbit: 


( semimajor axis of \ 
orbit when elliptic/ 

( semiminor axis of \ 
orbit when elliptic/ 


_ _ L 2 /M _ M . 

1 -e 2 (^ 21 )’ 

h - l2 / M = \ ♦ 

(1 -e 2 f /2 (-2E) 1 ' 2 ’ 


( 10 ) 


( 11 ) 


( “impact parameter” \ 
for hyperbolic orbit, \ 
or “distance of closest I 
approach in / 

absence of deflection’/ 


b = 


(angular momentum per unit mass) 
(linear momentum per unit mass) 

L . 

(2£) 1/2 5 


( 12 ) 


( actual distance of\ 
closest approach / 

( angle of deflection 
in hyperbolic orbit, 


_ L 2 /M _. 

111111 (1 + 2 EL 2 /M 2 ) 1/2 + 1 ’ 

0 = 7r — 2 arc cos ( l/e ) 

= 2 arc tan [M/(2E) 1/2 L] 

= 2 arc tan [M/2Eb]\ 


(13) 


(14) 


( 



differential scattering 
cross section 


do __ 27ib db 
dS2 277 sin 0 d0 


= (4d^72? (R “ ,h " f ° rd) ' 


(15) 
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Box 25.4 ( continued) 


Time as correlated with position: 




K®+?-£)]■ 


r —-— (1 — e cos u) 

(-2 E) 


to simplify the integration. Get 


t rr--- (u — e sin u), 

(-2 Ef /2 


fmean circularX 2-7T 
^ frequency / (period) 


(-2 Ef n (M \ l/2 


-( 5 ) 


Here w is the so-called “mean eccentric anomaly” (Bessel’s time parameter). 
In terms of this quantity, one has also: 

(1 — e 2 ) 1/2 sin 9 

sin u - ■— --—; 

1 + e cos 9 

COSK = JS*l±JL; 

1 + e cos 9 

cos 0 = 

1 — e cos u 


sin 9 — 


(1 — e 2 ) 1/2 sin u 
1 — e cos u 


x — r cos 9 — -(cos u — e): 

(- 2E) y } 


y = r sin 9 = 


( —2£) 1/2 


These expressions lend themselves to Fourier analysis into harmonic functions 
of the time, with coefficients that are standard Bessel functions: 


1 r 77 

J n ( Z ) = — e i<zsmu-nu) du . 9 

2 7T J .. 


<./a — — — e + ^ k~ 1 J k _ 1 {ke) cos kut; 


y/a — (1 — e 2 ) 1/2 ^ k~ l J k _ 1 (ke) sin kut 



§25.1. FROM KEPLER'S LAWS TO EFFECTIVE POTENTIAL 


649 


[for these and further formulas of this type, see, for example, Wintner (1941), 
Siegel (1956), and Siegel and Moser (1971)]. Via such Fourier analysis one is 
in a position to calculate the intensity of gravitational radiation emitted at the 
fundamental circular frequency co and at the overtone frequencies (see Chap¬ 
ter 36). 


B. Einstein's Geometric Theory of Gravitation 


Connection between energy and momentum for a test particle of rest mass ju, 
traveling in curved space, 

g ap PaPp + M 2 = 0. (25) 

Gravitation shows up in no way other than in curvature of the geometry, in 
which the particle moves as free of all “real” force. Refer all quantities to basis 
of a test object of unit rest mass by dealing throughout with p = p/p. Also 
write p a = dS/dx a . Thus Hamilton-Jacobi equation for propagation of wave 
crests in Schwarzschild geometry (external field of a star; §23.6) becomes 


(1 - 2 M/r) 


m + (i - 2M/ r) m + ±m 2 + 

\ dt) \dr I r 2 \dd) r 2 sin 2 0 \ 06 / 


\ 2 e \d4>/ 

+ 1=0. (26) 


Solve Hamilton-Jacobi equation. As in Newtonian problem, simplify by eliminat¬ 
ing all motion in direction of increasing <p. Thus set 0 = = dS/d<t> (dynamic 

phase independent of <J>) and have 


5 = -St + L6 + f [£ 2 - (1 - 2M/r)(l + L 2 /r 2 )} 


1/2- ( rj \ 

(1-2 M/r)' K } 


Find shape of orbit by “principle of constructive interference”; thus, 


dS 

0 = =g 
dL 


L dr/r 2 

lE 2 - (1 - 2M/r)(\ + Z 2 A 2 )] 1/2 ‘ 


[See equation (25.41) and associated discussion in text; also Figure 25.6.] 
Find time to get to given r by considering “interference of wave crests” belonging 
to slightly differenC£_valucs: 




[. E 2 - (1 - 2M/r)(\ + L 2 /r 2 )] 1/2 (1 - 2 M/r) 


■ (29) 


[See equation (25.32) and associated discussion in text; also Figure 25.5 and 
exercise 25.15.] 
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From symmetries to 
conservation laws by: 

(1) Lagrangian or 

Hamiltonian approach 


(2) Killing-vector approach 


Killing vector, (, defined 


Killing's equation derived 


§25.2. SYMMETRIES AND CONSERVATION LAWS 

In analytic mechanics, one knows that symmetries of a Lagrangian or Hamiltonian 
result in conservation laws. Exercises 25.1 to 25.4 describe how these general prin¬ 
ciples are used to deduce, from the symmetries of Schwarzschild spacetime, constants 
of motion for the trajectories (geodesics) of freely falling particles in the gravitational 
field outside a star. The same constants of motion are obtained in a different language 
in differential geometry, where a “Killing vector” is the standard tool for the de¬ 
scription of symmetry. This section considers the general question of metric symme¬ 
tries before proceeding to Schwarzschild spacetime. 

Let the metric components gfi¥ relative to some coordinate basis dx a be independ¬ 
ent of one of the coordinates x K , so 

d gtlv /dx« = 0 for a-K. (25.1) 

This relation possesses a geometric interpretation. Any curve x a = c a (\) can be 
translated in the x K direction by the coordinate shift Ax K — e to form a “congruent” 
(equivalent) curve: 


x« = c a ( A) for a ^ K and x K = c K (\) + e. 

Let the original curve run from A = A x to A = A 2 and have length 

L = f X \g,Mmdx»/d\)(dx''/d\)]V* d\. 

\ 

Then the displaced curve has length 

m = f (dx'/j\Xdx’/d\)J' S d\. 


But the coefficient of e in the integrand is zero. Therefore the length of the new 
curve is identical to the length of the original curve: dL/de = 0. 

The vector 


f =E d/de = (d/dx K ) (25.2) 

provides an infinitesimal description of these length-preserving “translations.” It is 
called a “Killing vector.” It satisfies Killing’s equation* 

= 0 (25.3) 

(condition on the vector field f necessary and sufficient to ensure that all lengths 
are preserved by the displacement ef). This condition is expressed in covariant form. 


* Historical note: Wilhelm K. J. Killing, bom May 10, 1847, in Burbach, Westphalia, died February 
11, 1923 in Mllnster, Westphalia; Professor of Mathematics at the University of Milnster, 1892-1920, 
The key article that gives the name “Killing vector” to the kind of isometries considered here appeared 
almost a century ago [Killing (1892)]. 
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Therefore it is enough to establish it in the preferred coordinate system of (25.1) 
in order to have it hold in every coordinate system. In that preferred coordinate 
system, the vector field, according to (25.2), has components 

¥ = SV 

Therefore the covariant derivative of this vector field has components 


£ = e £“ = £T (——h r a £ ff ) 

Spa** ;v ofia y ^ v ' va** J 

_ Q ra — r _ I / _ ^gvK \ 

Sfia vK 2 \ dx v dx K dx* J 


(25.4) 


— 2 8»k,ii)- 


One sees that is antisymmetric in the labels p and v, as stated in Killing’s equa¬ 
tion (25.3). 

The geometric significance of a Killing vector is spelled out in Box 25.5. 

From Killing’s equation, = 0, and from the geodesic equation V p p = 0 for 
the tangent vector p = d/dX to any geodesic, follows an important theorem: In any 
geometry endowed with a symmetry described by a Killing vector field motion along 
any geodesic whatsoever leaves constant the scalar product of the tangent vector with 
the Killing vector: 


p K = p • ( = constant. (25.5) 

In verification of this result, evaluate the rate of change of the constant p K along 
the course of the typical geodesic (affine parameter A; result therefore as applicable 
to light rays—with zero lapse of proper time—as to particles); thus, 

dp K /d\ = (p%), v p v = + p { Y%w\ = °- ( 25 - 6 ) 

Turn back from a general coordinate system to the coordinates of (25.1), where 
the Killing vector field of the symmetry lets itself be written f = d/dx K . Then the 
scalar-product of (25.5) becomes constant = p a £ a = p a b a K — p K . The symmetry of 
the geometry guarantees the conservation of the K-th covariant coordinate-based 
component of the momentum. 

On a timelike geodesic in spacetime, the momentum of a test particle of mass 
p is 

p = d/d\ = pu = pd/dr. (25.7) 

Thus the affine parameter A most usefully employed in the above analysis, when 
it is concerned with a particle, is not proper time r but rather the ratio A = t/ju. 

When the metric g^ v is independent of a coordinate x K , that coordinate is called 
cyclic, and the corresponding conserved quantity, p K ,, is called the “momentum 
conjugate to that cyclic coordinate” in a terminology borrowed from nonrelativistic 
mechanics. 


Conservation of p * f for 
geodesic motion 


Terminology: 

"cyclic coordinate," 
"conjugate momentum" 


(continued on page 654) 
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r 


Box 25.5 


KILLING VECTORS AND ISOMETRIES (Illustrated by a Donut) 


A. On a given manifold (e.g., spacetime, or the donut 
pictured here), in a given coordinate system, the 
metric components are independent of a particular 
coordinate x K . Example of donut: 

ds 2 = b 2 dO 2 + (a - b cos 6) 2 d$ 2 ; 

g^ v independent of x K = 

B. Translate an arbitrary curve 6 through the infinitesi¬ 
mal displacement 

ef = e(d/dx K ) = e(d/d<>>) 9 £ < 1 

to form a new curve G'. In coordinate language 6 
is 0 = 6( A), <j> = <f>(A); while G' is 0 = 6( A), <> = 
<>(A) + e. (Translation of all points through A}> = e.) 
Because dg^/d# = 0, the curves G and G’ have the 
same length (see text). 




C. Pick a set of neighboring points d, G, and 
translate each of them through to obtain points 
d\ G\ Since the length of every curve is 
preserved by this translation, the distances between 
neighboring points are also preserved: 

(distance between (T and $') = 

(distance between d and $). 



But geometry is equivalent to a table of all infinitesi¬ 
mal distances (see Box 13.1). Thus the geometry of 
the manifold is left completely unchanged by a trans¬ 
lation of all points through e( . [This is the coordinate- 
free version of the statement dg^/dfy = 0.] One says 
that ( — d/d<j> is the generator of an ''isometry* y (or 
"group of motions'’) on the manifold. 

D. In general (see text), a vector field ((&) gener¬ 
ates an isometry if and only if it satisfies Killing’s 
equation ^ a . p) - 0. 
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'\ 


E. If generates an isometry (i.e, if £ is a “Killing 
vector”), then the curves 


9 ’(**, «!, •••,< ) 

[ parameter Tj ^ labels to tell 1 

on curve j L“which” curvej 


to which (is tangent [{ = are ca ^ ed 

“trajectories of the isometry.” 


Three different trajectories 
on a donut 



F. The geometry is invariant under a translation of all 
points of the manifold through the same Ax K along 
these trajectories [<?(x K , a v ..., a n ) — 6>{x K + 

~- * • > <* a ); “finite motion” built up from many 

“infinitesimal motions” e (.] 


G. This isometry is described in physical terms as fol¬ 
lows. Station a family of observers throughout the 
manifold. Have each observer report to a central 
computer (1) all aspects of the manifold’s geometry 
near him, and (2) the distances and directions to all 
neighboring observers (directions relative to “pre¬ 
ferred” directions that are determined by anisotro¬ 
pies in the local geometry). Through each observer’s 
position passes a unique trajectory of the isometry. 
Move each observer through the same fixed Ax K 
(e.g., Ax K = 17) along his trajectory, leaving the 
manifold itself unchanged. Then have each observer 
report to the central computer the same geometric 
information as before his motion. The information 
received by the computer after the motion will be 
identical to that received before the motion. There 
is no way whatsoever, by geometric measurements, 
to discover that the motion has occurred! This is the 
significance of an isometry. 
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EXERCISES 


Exercise 25.1. CONSTANT OF MOTION OBTAINED FROM 
HAMILTON'S PRINCIPLE 

Prove the above theorem of conservation of p K = p- f from Hamilton’s principle (Box 
13.3) 

8 f jg'JxXdxWXdx’/dK) d\= 0 (25.8) 


as applied to geodesic paths. Recall: In this action principle, g is to be regarded as a known 
function of position, along the path; and the path itself, is to be varied. 


Exercise 25.2. SUPER-HAMILTONIAN FORMALISM 
FOR GEODESIC MOTION 

Show that a set of differential equations in Hamiltonian form results from varying and 
x* independently in the variational principle 81 = 0, where 

7 = f( Pll dx» - X d\) (25.9) 


and 

X = jg‘“(x)p ll p,. 


(25.10) 


Show that the “super-Hamiltonian” DC is a constant of motion, and that the solutions of 
these equations are geodesics. What do the choices DC = + DC — 0, DC — — ^ 2 , or DC = — \ 
mean for the geodesic and its parametrization? 

Exercise 25.3. KILLING VECTORS IN FLAT SPACETIME 

Find ten Killing vectors in flat Minkowski spacetime that are linearly independent. (Restrict 
attention to linear relationships with constant coefficients). 

Exercise 25.4. POISSON BRACKET AS KEY TO CONSTANTS OF MOTION 

If f is a Killing vector, show that p K = commutes (has vanishing Poisson bracket) with 
the Hamiltonian DC of the previous problem, [DC,p K ] — 0, so dp K /d\ = 0. (Hint: Use a 
convenient coordinate system.) 

Exercise 25.5. COMMUTATOR OF KILLING VECTORS IS A KILLING VECTOR 

Consider two Killing vectors, f and q, which happen not to commute [as differential opera¬ 
tors; i.e., the commutator of equations (8.13) does not vanish; consider rotations about 
perpendicular directions as a case in point]; thus, 


[f,/7] = C 5*0. 


(a) Show that no single coordinate system can be simultaneously adapted, in the sense 
of equation (25.1), to both the ( and q symmetries (see exercise 9.9). 

(b) Let p(. — p fi ^ fi ,p v = and p$ = p^^y and derive the Poisson-bracket relationship 
[p&Pt}] — ~~P$- * n a geometry, the symmetries of which are related in this way, show that 
p£ is also a constant of motion. 

(c) In a coordinate system where ( = (d/dx K ), define DC as in (25.10) and show from 
[DC,p^] =r 0 that C is a Killing vector. 

Thus the commutator of two Killing vectors is itself a Killing vector . 
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Exercise 25.6. EIGENVALUE PROBLEM FOR KILLING VECTORS 

Show that any Killing vector satisfies = 0, and is an eigenvector with eigenvalue k = 0 
of the equation 

(25.11) 

Find a variational principle (Raleigh-Ritz type) for this eigenvalue equation. 


§25.3. CONSERVED QUANTITIES FOR MOTION 
IN SCHWARZSCHILD GEOMETRY 


Consider a test particle moving in the Schwarzschild geometry, described by the 
line element 


ds 2 = —(1 — 2 M/r)dt 2 + 


dr 2 

(1 - 2 M/r) 


+ r 2 (d8 2 + sin 2 0 d<f> 2 ). 


(25.12) 


This expression for the geometry applies outside any spherically symmetric center 
of attraction of total mass-energy M It makes no difference, for the motion of the 
particle outside, what the geometry is inside, because the particle never gets there; 
before it can, it collides with the surface of the star—if the center of attraction is 
a star, that is to say, a fluid mass in hydrostatic equilibrium. At each point throughout 
such an equilibrium configuration, the Schwarzschild coordinate r exceeds the local 
value of the quantity 2m(r)\ see §23.8. Therefore the Schwarzschild coordinate R 
of the surface exceeds 2M. Consequently, expression (25.12) applies outside any 
equilibrium configuration, no matter how compact (r > R > 2 M implies that one 
need not face the issue of the “singularity” at r — 2M). The more compact the 
configuration, however, the more of the Schwarzschild geometry the test particle 
can explore. The ideal limit is not a star in hydrostatic equilibrium. It is a star that 
has undergone complete gravitational collapse to a black hole. Then (25.12) applies 
arbitrarily close to r — 2M. This idealization is assumed here (“black hole”), because 
the analysis can then cover the maximum range of possible situations. 

Wherever the test particle lies, and however fast it moves, project that point and 
project that 3-velocity radially onto a sphere of some fixed r value, say, the unit 
sphere r — 1. The point and the vector together define a point and a vector on the 
surface of the unit sphere; and they in turn mark the beginning and define the totality 
of a great circle. As the particle continues on its way, the radial projection of its 

_position, will continue to lie o n that gTeat circle. To depart from the great circle on 

one side or the other would be to give preference to the one hemisphere or the other 
of the unit sphere, contrary to the symmetry of the situation. 

Orient the coordinate system so that the radial projection of the orbit coincides 
with the equator, 8 — n/2, of the polar coordinates (see Box 25.4 for the spherical 
trigonometry of a more general orientation of the orbit, and for eventual specializa- 


Why attention focuses on 
particle orbits around a black 
hole 


Choice of coordinates to 
make particle orbit lie in 
"equator/' 8 — tt/2 
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Conserved quantities for 
particle motion: 

(D £ 

(2) L 


(3) ju 


(4) E = E/jx 

(5) I = L/ix 


Effective potential V. and 
equations for orbit when 


tion to a polar orbit, in contrast to the equatorial orbit considered here). In polar 
coordinates as so oriented, the particle has at the start, and continues to have, zero 
momentum in the 0 direction; thus. 

p 6 = dO/dX = 0. 

The expression (25.12) for the line element shows that the geometry is unaffected 
by the translations t —t + At, <p —>- <f> + A$. Thus the coordinates t and 4> are 
“cyclic.” The conjugate momenta p 0 = —E and p$ = =tL (L > 0) are therefore 
conserved. This circumstance allows one immediately to deduce the major features 
of the motion, as follows. 

The magnitude of the 4-vector of energy-momentum is given by the rest mass 
of the particle, 


or 


g a pP“P^ + M 2 = g af> PaP)3 + M 2 = 0 


(25.13) 


E 2 


+ 


1 


(1 - 2 M/r) (1 - 2M/r) 


(i) ! 


+ 4 + ( . s =o. 


(25.14) 


Moreover, one knows from the equivalence principle that test particles follow the 
same world line regardless of mass. Therefore what is relevant for the motion of 
particles is not the energy and angular momentum themselves, but the ratios 


Recall also 


£ =£/ (energy per unit\ 

J \rest mass /’ 

2 * _ j^j _ /angular momentumX 
\per unit rest mass / 


A = r//x = fP r °P er time P er \ 

\unit rest mass / 


(25.15) 


Then (25,14) becomes an equation for the change of r-coordinate with proper time 
in which the rest mass makes no appearance: 


(~f = E 2 - (l - 2M/r)(\ + L 2 /r 2 ) 
= E 2 - F2(r). 


(25.16a) 


Here 


V(r) - [(1 - 2M/r)(l + I 2 A 2 )] 1/2 (25.16b) 


is the “effective potential” mentioned in §25.1 and Figure 25.2 and to be discussed 
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in §25.5. For the rate of change of the other two relevant coordinates with proper 
time, one has, assuming a “direct” orbit ( d<j>/dr > 0; = +L rather than — L), 


d<p 

dr 


\_tfy 
H dX 




L_ 

72 


(25.17) 


and 


&__}g°°E _ E 
dr n d\ ~ g. ~ n ~ 1 - 2M/r' 


(25.18) 



Knowing rasa function of t from (25.16), one can find <j> and t in their dependence 
on t from (25.17) and (25.18). Symmetry considerations have in effect reduced the 
four coupled second-order differential equations p^.^p" = 0 of geodesic motion to 
the single first-order equation (25.16). 

For objects of zero rest mass, it makes no sense to refer to proper time, and a 
slightly different treatment is appropriate (§25.6). 

Before looking, in §25.5, at the motions predicted by equations (25.16) to (25.18), 
it is useful to analyze the physical significance of the constants p 0 and p^ and to 
identify other physically significant quantities whose values will be of interest in 
studying these orbits. One calls E = —p 0 the “energy at infinity”; and L = |p \, 
for equatorial orbits, the “total angular momentum.” To justify these names, compare 
them with standard quantities measured by an observer at rest on the equator of 
the Schwarzschild coordinate system as the test particle flies past him in its orbit. 
Let 

£ Iocal =p 6 = <w\p) = <|g 00 l 1/2 d‘,P> = \g 00 \ 1 / 2 P° 

= \g 00 \ l ^dt/dX = (1 - 2M/r) iri dt/dX { ' ' 


Interpretation of E as 
"energy at infinity" and L as 
"angular momentum" 


be the energy he measures in his proper reference frame, and let 

<w*,p) <| g^J^d/dX) 

^ mO P 77 

P ^local ^local 

= rjd^/dX) _ p ± 

^local ^^local 


(25.20) 


be the tangential component of the ordinary velocity he measures. [Note: cu a are 
the basis one-forms of the observer’s proper reference frame; see equations 
(23.15a,b).] In terms of these locally measured quantities, the energy-at-infinity is 

E = -Po = -gooP° = l^ool 1/2 ^iocai = (1 - 2A/ A) 1/2 ^iocai = constant. (25.21) 

It therefore represents the locally measured energy £ local , corrected by a factor 
|g 00 | 1/2 . For any particle that flies freely (geodesic motion) from this observer to 
r = oo, the correction factor reduces to unity, and E local (as measured by a second 
observer, this time at infinity) becomes identical with E. Similarly the angular 
momentum from (25.20) is 


P 0 = ^iocaiV = constant. 


(25.22) 
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EXERCISES 


This, like E = —p 0 , represents a quantity that is conserved, and whose interpretation 
for r —oo on any orbit is familiar. Finally, recall that the total 4-momentum 
of two colliding particles p x 4- p 2 or ( P/Ji + (p^) 2 is conserved in a point collision 
(at any r). Therefore the totals (E) l 4 (E) 2 = (~p 0 ) l 4- (-p 0 ) 2 and i + (p 0 ) 2 
are also conserved. One of the colliding particles may be on an orbit that could 
never reach out to r — oo, but this makes no difference. This conservation principle 
allows and forces one to take over the terms E = “energy at infinity” and L — 
“angular momentum,” valid for orbits that do reach to infinity, for an orbit that 
does not reach to infinity. 


Exercise 25.7. RADIAL VELOCITY OF A TEST PARTICLE 

Obtain a formula for the radial component of velocity v; that an observer at r would measure 
[see (25.20) for v ^]. Express £ locaI , and v$ in terms of r and the constants E, p 

Exercise 25.8. ROTATIONAL KILLING VECTORS FOR 
SCHWARZSCHILD GEOMETRY 

(a) Show that in the isotropic coordinates of exercise 23.1, the metric for the Schwarzschild 
geometry takes the form 

ds 2 = -(1 - M/2r) 2 (l -f M/27)~ 2 dt 2 + (1 + M/2r) 4 (dr 2 + 7‘ 2 d$2 2 ). (25.23) 

(b) Exhibit a coordinate transformation that brings this into the form 

ds 2 = -(1 - M/2r) 2 (\ -f M/27)~ 2 dt 2 4 (1 4 M/2r)\dx 2 4 dy 2 4 dz 2 ), (25.24) 
with 7 = ( x 2 4 y 2 4 z 2 ) 1/2 . 

(c) Show that f x — y(d/Zz) - z{d/dy) and similar vectors ( y and are each Killing 
vectors, by verifying (see exercise 25.5c) that the Poisson brackets [3f, L K \ vanish for each 
Lk ~ P ‘ K = x, y, z. 

(d) Show that ( z = (d/d<j>) t r e \ and show that for orbits in the equatorial plane L z = 

L x = L y = 0. 

Exercise 25.9. CONSERVATION OF TOTAL ANGULAR MOMENTUM 
OF A TEST PARTICLE 

Prove by a Poisson-bracket calculation that the total angular momentum squared, L 2 = 
p e 2 4 (sin 6y 2 p^ 2 is a constant of motion for any Schwarzschild geodesic. 

Exercise 25.10. SELECTING EQUATION BY SELECTING WHAT IS VARIED 

Write out the integral I that is varied in (25.8) for the special case of the Schwarzschild 
metric (25.12). What equation results from the demand 81 = 0 if only <>(\) is varied? If 
only /(A)? 

Exercise 25.11. MOTION DERIVED FROM SUPER-HAMILTONIAN 

Write out the super-Hamiltonian (25.10) for the special case of the Schwarzschild metric. 
Deduce from its form that p Q and p^ are constants of motion. Derive (25.14), (25.17), and 
(25.18) from this super-Hamiltonian formalism. 



§25.5. ORBITS OF PARTICLES 659 

§25.4. GRAVITATIONAL REDSHIFT 

The conservation law |gool 1/2 £i OC ai = constant (equation 2521), which is valid in this 
form for any time-independent metric with g 0j = 0 and for particles with both zero 
and non-zero rest mass, is sometimes called the “law of energy red-shift.” It describes 
how the locally measured energy of any particle or photon changes (is “red-shifted” 
or “blue-shifted”) as it climbs out of or falls into a static gravitational field. For 
a particle of zero rest mass (photon or neutrino), the locally measured energy E local , 
and wavelength \, ocal (not to be confused with affine parameter!), are related by 
•^locai =: V^iocai> where h is Planck’s constant. Consequently, the law of energy 
red-shift can be rewritten as 


x iocail&ool 1/2 = constant. (25.25) 

A photon emitted by an atom at rest in the gravitational field at radius r, and received 
by an astronomer at rest at infinity is red-shifted by the amount 

Z ~ (^received ^emittedV^emitted = l£oo( r )l 1 > 

z = (1 - IM/r)- 1 ' 2 - 1, (25.26) 

z zz M/r in Newtonian limit. (25.26N) 

Note that these expressions are valid whether the photon travels along a radial path 
or not. 


Exercise 25.12. REDSHIFT BY TIMED PULSES 

Derive expression (25.26) for the photon redshift by considering two pulses of light emitted 
successively by an atom at rest at radius r. [Hint: If dr em is the proper time between pulses 
as measured by the emitting atom, and dr rec is the proper time separation as measured by 
the observer at r = oo, then one can idealize X em as dr em and X rec as dr rec .] 


§25.5. ORBITS OF PARTICLES 

Turn attention now from energy red-shift to the orbit of a particle in the Schwarzs- 
child geometry. The position as a function of proper time follows upon solving 
(25.16a), 

(rfr) 2 + P2(r) = E2 ’ (25 ' 16a ) 

where V is the “effective potential” defined by 

V°-(r) = (1 - 2M/r){\ + L°-/r 2 ) 


Law of "energy redshift" 
("gravitational redshift") 


EXERCISE 


(25.16b) 
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Qualitative features of orbits 
diagnosed from 
effective-potential diagram 



and illustrated in Figure 25.2 and Box 25.6. The first diagram in Box 25.6 gives 
V\r) as a function of r. It is relevant even in the “domain inside the black hole” 
(r < 2 A/), where V 1 2 is negative (see Chapter 31). It serves as a model for, and is 
closely related to, the “effective potential” B~ 2 (r) used in §25.6 to analyze photon 
orbits. The final diagram in Box 25.6 gives V(r) itself as a function of r. Energy 
levels in this diagram or in Figure 25.2 can be interpreted as in any conventional 
energy-level diagram. The difference in energy between two levels represents energy, 
as measured at infinity, of the photon given off in the transition from the one level 
to the other. Whether one plots V(r) or V 2 (r) as a function of r is largely a matter 
of convenience. The important point is this: a value of r where V{r) becomes equal 
to the available energy £, or V 2 (r) becomes equal to E 2 , is a turning point . A particle 
that was moving to larger r values, once arrived at a turning point, turns around 
and moves to smaller r values. Or when a particle moving to smaller r values comes 
to a turning point, it reverses its motion and proceeds to larger r values. A turning 
point is not a point of equilibrium. A stone thrown straight up does not sit at a 
point of equilibrium at the top of its flight. However, when E — K(r), or E 2 — V 2 (r), 
instead of having a single root, has a double root, then one does deal with a point 
of equilibrium (only possible because of “centrifugal force” fighting against gravity). 
When this equilibrium occurs at a minimum of K(r), it is a stable equilibrium; at 
a maximum, an unstable equilibrium. Thus all the major features of the motion 
in the r direction can be read from a plot of the effective potential as a function 
of r (plot depends on value of L) and from a knowledge of the E value (Figure 
25.2, with further details in Box 25.6). 


Box 25.6 


A. Equations Governing Orbit 
(see text for derivation) 

1. Effective-potential equation for radial part 
of motion: 

(dr/dr ) 2 + F2(Z, r) = E 2 , 

V 2 (L, r) = (1 - 2M/r)(l + Z 2 /r 2 ), 

E = (energy at infinity per unit rest mass), 
L = (angular momentum per unit rest 
mass). 

2. Supplementary equations for angular and 
time motion for “direct” orbit, dfy/dr > 0: 

d<t>/dr = Z/r 2 , 
dt = E 
dr 1 — 2 M/r 



~\ 



“Turning points” of orbit occur where 
horizontal line of height E 2 crosses V 2 



QUALITATIVE FEATURES OF ORBITS OF A PARTICLE 
MOVING IN SCHWARZSCHILD GEOMETRY 
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Newtonian Limit, |£ — 1| 1, 

M/r 4 1, 1/r < 1 

1. Speak not about “energy-at-infinity per 
unit rest mass,” E = E/p = (1 — v 2 )~ 1/2 , 
but instead about the “nonrelativistic en- 
er gy per unit rest mass,” 

£=1(£ 2 - \)^E- 

2. Speak not about r) but instead about 
the Newtonian effective potential, 

F iV (Z,r) = i(F2_ + 

2 r 2r 2 

3. Rewrite effective-potential equation in the 
form 


}(£)’ +'>=«• 

4. From the effective-potential diagram and 
the subsidiary equation dfy/dr = L/r 2 , 
conclude that: 

a. Particles with e > 0 (E > 1) come in 
from r = oo along hyperbolic or pa¬ 
rabolic orbits, are reflected off the 
effective potential at e = pyi: 2 = F 2 ; 
“turning point”; {dr/dr) 2 = 0], and 
return to r — oo. 

b. Particles with e < 0 (£ < 1) move 
back and forth in an effective potential 
well between periastron (inner turning 
point of elliptic orbit) and apastron 
(outer turning point). 


Relativistic Orbits 

Use the effective-potential diagram of part A 
(reproducedliefelor various L), in the same 
way one uses the Newtonian diagram of part 
B, to deduce the qualitative features of the 
orbits. The main conclusions are these. 
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Box 25.6 (continued) 

1. Orbits with periastrons at r > M are Kep- 
lerian in form, except for the periastron 
shift (exercise 25.16; §40.5) familiar for 
Mercury. 

2. Orbits with periastrons at r < 10 M differ 
markedly from Keplerian orbits. 

3. For L/M < 2^/3 there is no periastron; 
any incoming particle is necessarily pulled 
into r = 2 A/. 

4. For 2 \/3 < L/M < 4 there are bound 
orbits in which the particle moves in and 
out between periastron and apastron; but 
any particle coming in from r — oo (un¬ 
bound; E 2 > 1) necessarily gets pulled into 
r = 2M. 

5. For V = L/M > 4, there are bound 
orbits; particles coming in from r = oo 
with 

E 2 < VmJ = 0 - 20(1 + ^ t2 0. 

1 + Vl - 12/£ t2 

= g 

reach periastrons and then return to r = 
oo; but particles from r — oo with £ 2 > 
K max 2 get pulled into r — 2M. 

6. There are stable circular orbits at the mini¬ 
mum of the effective potential; the mini¬ 
mum moves inward from r = oo for L = 
oo to r = 6M for L 1 * = Z/M = 2 y/3* The 
most tightly bound, stable circular orbit 
(L/M = 2 y/3, r = 6 M) has a fractional 
binding energy of 

tzJL = 1 -^= 1 - V879" = 0.0572. 

7. There are unstable circular orbits at the 
maximum of the effective potential; the 
maximum moves outward from r — 3M for 
L — oo to r = 6 M for L/M = 2 \ // 3. A 
particle in such a circular orbit, if per¬ 
turbed inward, will spiral into r — 2 M. If 
perturbed outward, and if it has E 2 > 1, 
it will escape to r = oo. If perturbed out¬ 



ward, and if it has E 2 < 1, it will either 
reach an apastron and then enter a spiral¬ 
ing orbit that eventually falls into the star 
(e.g., if SE > 0, with unchanged angular 
momentum); or it will move out and in 
between apastron and periastron, in a sta¬ 
ble bound orbit (e.g., if SE < 0, again with 
unchanged angular momentum). 
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When one turns from qualitative features to quantitative results, one finds it 
appropriate to write down explicitly the proper time At required for the particle 
to augment its Schwarzschild coordinate by the amount Ar; thus (with the convention 
that square roots may be negative or positive, \/a 2 = ±a) 


T = f dT = f 


dr 

[E 2 - (1 - 2M/r){ 1 + Z 2 /r 2 )] 172 * 


(25.27) 


The integration is especially simple for a particle falling straight in, or climbing 
straight out, for then the angular momentum vanishes and the integral can be written 
in an elementary form that applies (with the change r — i) even in Newtonian 
mechanics, 


T = f dT = f 


_ dr _ 

[2M/r — 2M/R] 1/2 * 


(25.27') 


Here R == 2M/(1 — E 2 ) is the radius at which the particle has zero velocity (“apas- 
tron”). The motion follows the same “cycloid principle” that is so useful in nonrela- 
tivistic mechanics (Figure 25.3). Thus, in parametric form, one has 


r = y (1 + COST)), 


R ( R \ 1/2 
2 \2m) 


(t) + sin tj), 


(25.28) 


with the total proper time to fall from rest at r — R into r = 0 given by the expression 


= ?Lr(A -\ 1/2 
2 R \2m) 


(25.29) 


(shorter by a factor l/\/2 than the time for fall under pull of the same mass, 
distributed over a sphere of radius see dotted curve in Figure 25.3). 

What about the Schwarzschild-coordinate time taken for a given motion? Take 
equation (25.16a) for general motion (radial or nonradial), and where dr/dr appears, 
replace it by 

dr _ dr dt 
dr 


dt dr 


dr E _ £ dr* 
dt 1 — 2 M/r dt 


(25.30) 


Here r* is an abbreviation for a new “tortoise coordinate,” 


'’=S d '’=Sj^MTr-' + lma { JM-'Y 


(25.31) 


which was introduced by Wheeler (1955) and popularized by Regge and Wheeler 
(1957). Thus find the equation 


(e^) 2 + V 2 = E 2 . 


(25.32) 



Radial orbits: 


(1) “cycloidal'' form of r(r) 
for radial bound orbits 


(2) “tortoise" radial 

coordinate as function of 
coordinate time, r*{t) 
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Figure 25.3. 

A cycloid gives the relation between proper time and Schwarschild r coordinate for a test particle falling 
straight in toward center of gravitational attraction of negligible dimensions. The angle of turn of the 
wheel as it rolls on the base line and generates the cycloid is denoted by tj. In terms of this parameter, 
one has 


— (1 H- cos i]) 


(Schwarzschild r-coordinate) 


T 


R / R \ 1/2 
2 \ 2 M! 


(tj 4- sin tj) 


(proper time) 


(note difference in scale factors in expressions for r and for r). The total lapse of proper time to fall 
from r=/?tor = 0isT = (v/2)(R 3 /2M) 1/2 . The same cycloid relation and the same expression for 
time to fall holds in Newton's nonrelativistic theory of gravitation, except that there the symbol r is 
to be replaced by the symbol t (ordinary* time). Were one dealing in Newtonian theory with the same 
attracting mass M spread uniformly over a sphere of radius R , with a pipe thrust through it to make 
a channel for the motion of the test particle, then that particle would execute simple harmonic oscillations 
(dotted curve above). The angular frequency to of these vibrations would be identical with the angular 
frequency of revolution of the test particle in a circle just grazing the surface of the planet, a frequency 
given by Kepler’s law M = to 2 /? 3 . In this case, the time to fall to the center would be (tt/ 2)(/? 3 /A/) 1/2 , 
longer by a factor 2 1/2 than for a concentrated center of attraction (concentrated mass: stronger accelera¬ 
tion and higher velocity in the later phases of the fall). The expression for the Schwarzschild-coordinate 
time t required to reach any point r in the fall under the influence of a concentrated center of attraction 
is complicated and is not shown here (see equation 25.37 and Figure 25.5). 

The same cycloidal relation that connects r with time for free fall of a particle also connects the radius 
of the “Friedmann dust-filled universe” with time (see Box 27.1), except that there the cycloid diagram 
applies directly, without any difference in scale between the two key variables: 


/radius of\ _ £. (j __ CO stj) ~ — tj 2 (for small tj), 

\3-sphere / 2 4 

/ coordinate time \ 

identical with \ 

proper time as I = y (tj — sin-rj) ~ — tj 3 (for small tj). 
measured on dust I 
\ particle / 

The starting point of tj is renormalized to time of start of expansion; see Lindquist and Wheeler (1957) 
for more on correlation between fall of particle and expansion of universe. 


Here the effective potential is the same effective potential that one dealt with before, 

V = [(1 - 2M/r)(l + L 2 /r 2 )] 1/2 . (25.33) 

Moreover, the E on the righthand side is the same E that appeared in the earlier 
equation for (dr/dr) 2 . Therefore the turning points and the qualitative description 
of the motion are both the same as before. “A turning point is a turning point is 
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a turning point.” Right? Right about turning points; wrong about the conclusion. 

The story has it that Achilles never could pass the tortoise. Whenever he caught (3) details of the approach to 
up with where it had been, it had moved ahead to a new location; and when he the Schwarzschild radius 

got there, it was still further ahead; and so on ad infinitum. Imagine the race between ^ ~~ 

Achilles and the tortoise as running to the left and the expected point of passing 
as lying at r — 2M. The r-coordinate has no inhibition about passing through the 
value r — 2M. Not so r*, the “tortoise coordinate.” It can go arbitrarily far in the 
direction of minus infinity (corresponding to the infinitely many times when Achilles 
catches up with where the tortoise was) and still r remains outside r = 2M: 


r/2M 

1.000001 

1.0001 

1.01 

1.278465 

2 

5 

10 

10,000 

r*/2M 

-12.8155 

-8.2102 

-3.5952 

0 

2 

6.386 

12.303 

10,009.210 


It follows that there is a great difference between the description of the motion 
in terms of the proper time t of a clock on the falling particle (r goes all the way 
from r = R down to r = 0 in the finite proper time of 25.29) and a description of 
the motion in terms of the Schwarzschild-coordinate time t appropriate for the 
faraway observer (r* goes all the way from r* = R* down to r* = -oo; infinite 
t required for this; but even in infinite time, as r* goes down to — oo, r is only brought 
asymptotically down to r ~~ 2 M). Thus the second description of the motion leaves 
out, and has no alternative but to leave out, the whole range of r values from r — 2M 
down to zero: perfectly good physics, and physics that the falling particle is going 
to see and explore, but physics that the faraway observer never will see and never 
can see. If the tortoise coordinate did not exist, it would have to be invented. It 
invests each factor ten of closer approach to r = 2 M with the same interest as the 
last factor ten and the next to come. It proportions itself in accord with the amount 
of Schwarzschild-coordinate time available to the faraway observer to study these 
more and more microscopic amounts of motion in more and more detail. 

_F igure 2 5.4 shows the effective potential V of (25.33) and of Figure 25.2 replotted 

as a function of the tortoise coordinate. The approach of V to zero at r — 2M shows 
up as an exponential approach of V to zero as r* goes to minus infinity. Thus in 
moving “towards the black hole” ( r = 2M, r* = — oo), the particle, as described 
in coordinate time t , soon casts off any effective influence of any potential, and moves 
essentially freely toward decreasing r*, in accordance with the equation 



(25.34) 


that is, “with the speed of light” (dr*/dt ~ — 1). This dependence of r* on t implies 
at once an asymptotic dependence of r itself on Schwarzschild-coordinate time t , 
of the form 

r = 2 M + (constant x e~ t/2M ). (25.35) 

This result is independent of the angular momentum of the particle and independent 
also of the energy, provided only that the energy-per-unit-mass E is enough to 
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— r*/M 

Figure 25.4. 

Effective potential for motion in Schwarzschild geom¬ 
etry, expressed as a function of the tortoise coordinate, 
for selected values of the angular momentum of the test 
particle. The angular momentum L is expressed in units 
A/jn, where M is the mass of the black hole and fi the 
mass of the test particle. The effective potential (includ¬ 
ing rest mass) is expressed in units /a; thus, V = V/p. 
The tortoise coordinate r* = r + 2M In (r/2M — 1) is 
given in units M. — - - 


surmount the barrier (Figure 25.4) of the effective potential-per-unit-mass V (More 
will be said on the approach to r — 2 M in Chapter 32, on gravitational collapse.) 

To replace the asymptotic formula (25.35) by a complete formula requires one 
to integrate (25.32); thus, 


-/*“/ 7r 
= / 


E dr* 


[E 2 - V 2 ] 1/2 


dr 


(25.36) 


[E 2 - (1 - 2M/r)(l + L 2 /r 2 )] 1/2 (1 - 2 M/r) 


The integration here is not easy, even for pure radial motion (L — 0), as is seen 
in the complication of the resulting expression (Khuri 1957): 


t 


Kf +2 ")(2s- i r]’ i+ f(B?-T si "’’ 

1 (R/2M - 1 ) 1/2 + tan (tj/2) I 
+ I (R/2M — 1) 1/2 — tan(r)/2) | 


(25.37) 


Here rj is the same cycloid parameter that appears in equation (25.28) and Figure 
25.3 (see the detailed plot in Figure 25.5 of the correlation between r and t y illustrat- 
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Figure 25.5. 

Fall toward a Schwarzschild black hole as described (a) by a comoving observer (proper time t) and 
(b) by a faraway observer (Schwarzschild-coordinate time /). In the one description, the point r — 0 
is attained, and quickly [see equation (25.28)]. In the other description, r — 0 is never reached and even 
r = 2 M is attained only asymptotically [equations (25.35) and (25.37)]. The qualitative features of the 
motion in both cases are most easily deduced by inspection of the “effective potential-per-unit-mass” 
V in its dependence on r (Figure 25.2) when one is interested in proper time; or the same effective 
potential V in its dependence on the “tortoise coordinate” r* [Figure 25.4 and equation (25.31)] when 
one is interested in Schwarzschild-coordinate time t. 


ing the asymptotic approach to r — 2 M). The difficulty in the integration for f, as 
compared to the ease of the integration for r (25.28), has a simple origin. Only two 
r-values appear in (25.27a) as special points when Z is zero: the starting point, r = R, 
where the velocity vanishes, and the point r = 0, where dr/dr becomes infinite. In 
contrast (25.36), rewritten as 

f r [! -2 Mimn d± 

J J [2M/r - 2 M/R ] 1/2 (1 - 2 M/r) V ' 

contains three special points: r — R, r — 0, and the added point with all the new 
physics, r — 2M. To admit angular momentum is to increase the number of special 
points still further, and to make the integral unmanageable except numerically or 
qualitatively (via the potential diagram of Figure 25.4), or in terms of elliptic 
functions [Hagihara (1931)]. 

It is often convenient to abstract away from the precise value r = R at the start (4) free-fall from r = co 
of the collapse. In this event, one deals with the limit R —oo. Then it is convenient 
to displace the zero of proper time to the instant of final catastrophe. In this limit, 
one has 

r/2M= -(2/3)(r/2M) 3/2 , 

t/2M = — (2/3)(r/2M) 3/2 - 2(r/2M)"* + In + \ . (25.38) 

{r/2M) 1/z — 1 

At very large negative time, the particle is far away and approaching only very 
slowly. Then one can write 

r = (9 Mt 2 /2) 1/3 - (9M( 2 /2) in 


(25.39a) 




668 


25 . PARTICLE MOTION IN SCHWARZSCHILD GEOMETRY 


whether one refers to coordinate time or to proper time. However, the final stages 
of infall are again very different, when expressed in terms of proper time (r —0, 
r - —0), from what they are as expressed in terms of Schwarzschild-coordinate 
time, 

r/2M = 1 + 4e- 8/3 e-‘' 2M . (25.39b) 

Nonradial orbits: Turning from pure radial motion to motion endowed with angular momentum, 

(1) Fourier analysis one has a situation where one would like to express the principal quantities of the 

motion (components of displacement, velocity, and acceleration) in Fourier series 
(in Schwarzschild-coordinate time), these being so convenient in the Newtonian limit 
in analyzing radiation and perturbations of one orbit by another and tidal perturba¬ 
tions of the moving particle itself by the tide-producing action of the center of 
attraction. Any exact evaluation of these coefficients would appear difficult. For the 
time being, the values of the Fourier amplitudes would seem best developed by 
successive approximations starting from the Newtonian analysis (see Box 25.4 and 
references cited there). 

In connection with any such Fourier analysis, it is appropriate to recall that the 
fundamental frequency alone appears, and all higher harmonics have zero amplitude, 
when the motion takes place in an exactly circular orbit (opposite extreme from 
the pure radial motion of L = 0). Therefore it is of interest to note (exercise 25.19) 
that the circular frequency to of this motion, as measured by a faraway observer, 
is correlated with the Schwarzschild r-value of the orbit by exactly the Keplerian 
formula of non-relativistic physics: 

co 2 r 3 = M (exact; general relativity). (25.40) 

(2) details of angular motion Turn now from the correlation between r and time to the correlation between 

r and angle of revolution (<f> in the analysis here; 9 in the Hamilton-Jacobi analysis 
of Box 25.4; this difference in name is irrelevant in what follows). Return to equation 
(25.16), 

iff + p2(r >=**■ 

and recall also equation (25.17) 

d±_L_ 

dr r 2 ' 

Solve the second equation for dr, and substitute into the first to find 

+ - *■ <25 - 41) 

or equivalently, with u = M/r and U = L/M = L/Mfi, 

(du\ 2 E 2 - (1 - 2«)(1 + L t2 u 2 ) 

[d$/ ~ V 2 


(25.42) 
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Exercise 25.16 presents an alternative differential equation derived from this for¬ 
mula, and uses it to obtain the following expression for the angle swept out by the 
particle or planet, moving in a nearly circular orbit, between two successive points 
of closest approach: 


Af> = 


_2w_ 

(1 — 6M/r 0 ) 1/2 ' 


(25.43) 


The radial motion turns around from ingoing to outgoing, or from outgoing to 
ingoing, whenever the quantity E 2 - V 2 (r), or E - V(r), plotted as a function of 
r, undergoes a change of sign, and this as clearly here in the correlation between 
r and 4> as in the earlier correlation between r and time. Recall again the curves 
of Figure 25.2 for V(r ) as a function of r for selected Z values. From them one can 
read out, without any calculation at all, the principal features of typical orbits (Box 
25.6) obtained by detailed numerical calculation. Characteristic features are 


(1) circular orbit when E coincides with a minimum of the effective potential V(r), 

(2) precession when E is a little more than F min , 

(3) temporary “orbiting” (many turns around the center of attraction) when E 
is close to a maximum V max of the effective potential, 

(4) “capture into the black hole” when E exceeds F max . 


A more detailed analysis appears in Box 25.6. [For explicit analytic calculation of 
orbits in the Schwarzschild geometry, see Hagihara (1931), Darwin (1959 and 1961), 
and Mielnik and Plebanski (1962).] 

For orbits of positive energy, no feature of the inverse-square force is better known 
than the Rutherford scattering formula. It gives the “effective amount of target area” 
presented by the center of attraction for throwing particles into a faraway receptor 
that picks up everything coming off into a unit solid angle at a specified angle of 
deflection 0: 


~dn = [4( 1- 1) sin 2 6>/2] 2 ( Rutherford ; nonrelativistic) (25.44) 

(derivation in equations 8 to 15 of Box 25.4). When one turns from the Newtonian 
analysis to the general-relativity treatment, one finds two striking new features of 
the scattering associated with the phenomenon of orbiting. (1) The particles that 
come off at a given angle of deflection 0 now include not only those that have really 
been deflected by 0 (the only contribution in Rutherford scattering), but also those 
that have been deflected by 0 -f 2-77, 0 + 4-77, ... etc. (an infinite series of contribu¬ 
tions). (2) These supplementary contributions, while finite in amount, and even finite 
in amount “per unit range of <9,” are not finite in amount when expressed “per 
unit of solid angle d& = 2-77 sin 0 d0 ” in either the forward direction (0 = 0) or 
the backward direction (0 = 77). This circumstance produces no spectacular change 
in the forward scattering, for that is already infinite in the nonrelativistic approxi¬ 
mation (infinity in Rutherford value of do/dS2 as 0 = 0 is approached, arising from 


(3) nearly circular orbits: 
periastron shift 


(4) qualitative features of 
angular motion 


Scattering of incoming 
particles: 


(1) Rutherford 

(nonrelativistic) cross 
section 


(2) new features due to 
relativistic gravity 
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particles flying past with large impact parameters and experiencing small deflections; 
see exercise 25.21). In contrast, the backward scattering, which was perfectly finite 
in the Rutherford analysis, acquires also an infinity: 

l do \ constant (25.45) 

\dO/ e ^ v sin 0 

This concentration of scattering in the backward direction is known as a “glory.” 
The effect is most readily seen by looking at the brilliant illumination that surrounds 
the shadow of one’s plane on clouds far below (180° scattering of light ray within 
waterdrop). It is also clearly seen in observations on the scattering of atoms by atoms 
near <9 = 180°. No dwarf star, not even any neutron star, is sufficiently compact 
to be out of the way of a high-speed particle trying to make such a 180° turn. Only 
a black hole is compact enough to produce this effect. 

Further interesting features of motion in Schwarzschild geometry appear in the 
exercises below. 


Exercise 25.13. QUALITATIVE FORMS OF PARTICLE ORBITS 

Verify the statements about particle orbits made in part C of Box 25.6. 

Exercise 25.14. IMPACT PARAMETER 

For a scattering orbit (i.e., unbound orbit), show that L = where b is the impact 

parameter and v x the asymptotic ordinary velocity; also show that 

b = Z/(£ 2 - 1) 1/2 . (25.46) 

Draw a picture illustrating the physical significance of the impact parameter. 

Exercise 25.15. TIME TO FALL TO r = 2 M 

Show from equation (25.16) and the first picture in Box 25.6 that orbits (general L value!) 
which approach r — 2M do so in a finite proper time, but (equation 25.32) an infinite 
coordinate time t. For equilibrium stars, which must have radii R > 2 M, the coordinate time 
t to fall to the surface is finite, of course. 

Exercise 25.16. PERIASTRON SHIFT FOR NEARLY CIRCULAR ORBITS 

Rewrite equation (25.42) in the form 

(du/d<t>) 2 + (1 - 6 u 0 )(u - u 0 f - 2 (u - u 0 ) 3 = (. E 2 - E$)/V 2 . (25.47) 

Express the constant u 0 = M/r 0 in terms of L/M , and express E 0 in terms of u 0 . Show for 
a nearly circular orbit of radius r 0 that the angle swept out between two successive periastra 
(points of closest approach to the star) is 

A<t> = 2ir(l - 6M/r 0 )~ 1/2 . (25.48) 


Sketch the shape of the orbit for r 0 = 8M. 



§25.5. ORBITS OF PARTICLES 


671 


Exercise 25.17. ANGULAR MOTION DURING INFALL 

From equation (25.42), deduce that the total angle A<j> swept out on a trajectory falling into 
r— 0 is finite. The computation is straightforward; but the interpretation, in view of the 
behavior of t(X) on the same trajectory (equation 25 32 and exercise 25.15), is not. The 
interpretation will be elucidated in Chapter 31. 

Exercise 25.18. MAXIMUM AND MINIMUM OF EFFECTIVE POTENTIAL 

Derive the expressions given in the caption of Figure 25.2 for the locations of the maximum 
and the minimum of the effective potential as a function of angular momentum. Determine 
also the limiting form of the dependence of barrier height on angular momentum in the 
limit in which Z is very large compared to M. 

Exercise 25.19. KEPLER LAW VALID FOR CIRCULAR ORBITS 

From dfy/dr of (25.17) and dt/dr of (25.18), deduce an expression for the circular frequency 
of revolution as seen by a faraway observer; and from the results of exercise 25.18 (or 
otherwise) show that it fulfills exactly the Kepler relation 

w 2 r 3 = M 

for any circular orbit of Schwarzschild r-value equal to r, whether stable (potential minimum) 
or unstable (potential maximum). 

Exercise 25.20. HAMILTON-JACOBI FUNCTION 

Construct the locus in the r, 6 diagram of points of constant dynamic phase S(t, r, 9) = 0 
for / = 0 and for values Z = 4Af, E = 1 (or for Z = 2 \/3Af, E ~ (8/9) 1/2 , or for some other 
equally simple set of values for these two parameters). Show that the whole set of surfaces 
of constant S can be obtained by rotating the foregoing locus through one angle, then another 
and another, and recopying or retracing. Interpret physically the principal features of the 
resulting pattern of curves. 

Exercise 25.21. DEFLECTION BY GRAVITY CONTRASTED WITH 
DEFLECTION BY ELECTRIC FORCE 

A test particle of arbitrary velocity ft flies past a mass M at an impact parameter b so great 
that the deflection is small. Show that the deflection is 

e = w (1 + n (25A9) 

Derive the deflection according to Newtonian mechanics for a particle moving with the speed 
of light. Show that (25.49) in the limit /? —>- 1 is twice the Newtonian deflection. Derive 
also (flat-space analysis) the contrasting formula for the deflection of a fast particle of rest 
mass jit and charge e by a nucleus of charge Ze, 

8 = ^('-P 2 ) 1/2 - (25-50) 

How feasible is it to rule out a “vector” theory of gravitation [see, for example, Brillouin 
(1970)], patterned after electromagnetism, by observations on the bending of light by the 
sun? [Hint: To simplify the mathematical analysis, go back to (25.42). Differentiate once 
with respect to to convert into a second-order equation. Rearrange to put on the left all 
those terms that would be there in the absence of gravity, and on the right all those that 
originate from the —2 u term (gravitation) in the factor (1 — 2 m). Neglect the right-hand 
side of the equation and solve exactly (straight-line motion). Evaluate the perturbing term 
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Orbits for particles of zero 
rest mass: 


(1) impact parameter defined 


(2) shape of orbit 


on the right as a function of <j> by inserting in it the unperturbed expression for u($). Solve 
again and get the deflection.] 

Exercise 25.22. CAPTURE BY A BLACK HOLE 

Over and above any scattering of particles by a black hole, there is direct capture into the 
black hole. Show that the cross section for capture is ^b\ txV with the critical impact parameter 
6 crit gi ven ^ ^crit/(^ 2 “ M 2 ) 1/2 * From the formulas in the caption of Fig. 25.2 or otherwise, 
show that for high-energy particles this cross section varies with energy as 

%, = 2 7 ^ 2 (l+^+-") (25.51) 

(photon limit for E —>* oo) and for low energies as 

o capt = \(,kMVP\ (25.52) 

where /? is the velocity relative to the velocity of light [Bogorodsky (1962)]. 


§25.6. ORBIT OF A PHOTON, NEUTRINO, OR 

GRAVITON IN SCHWARZSCHILD GEOMETRY 


The concepts of “energy per unit of rest mass” and “angular momentum per unit 
of rest mass” make no sense for an object of zero rest mass (photon, neutrino, even 
the graviton of exercise 35.16). However, there is nothing about the motion of such 
an entity that cannot be discovered by considering the motion of a particle of finite 
rest mass /u and going to the limit /i —*- 0. In this limit the quantities 

E= E/n 

and 

L — L/fi 


individually go to infinity; but the ratio 


( impact para¬ 
meter 


= b 


/ angular \ 
Vmomentum/ 

( linear \ 

momentum/ 


L _ L 
( E 2 - [i 2 ) 1/2 (E 2 - 1) 1/2 


(25.53) 


goes to the finite value 


Lim 

n-*o 



b . 


(25.54) 


In this limit, equation (25.41) for the shape of the orbit reduces at once to the simple 
form 
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/ 1 dr \ 2 1 - 2 M/r 1 

\r 2 d<j>/ + r 2 ~ b 2 ’ 

or 

(p 0 + ""w = ^ 

or 

Whichever way the differential equation for the orbit is written, one term in it 
depends on the choice of orbit (the term 1 /ft 2 ) the other on the properties of the 
Schwarzschild geometry, but not on the choice of orbit. This second term defines 
a kind of effective potential, 

“effective \ 

potential for j = B~ 2 (r) ==—- (25.58) (3) effective potential 

photon” / r 

No attempt is made here to take the square root, as was done for a particle of finite 
rest mass. There one took the root in order to have a quantity that reduced to the 
Newtonian effective potential (plus the rest mass) in the nonrelativistic limit; but 
for light (v = 1) there is no nonrelativistic limit. Therefore the effective potential 
(25.58) is plotted directly in Box 25.7, and used there to analyze some of the principal 
features of the orbits of a photon in Schwarzschild geometry. 

On occasion it has proved useful to plot as a function of r, not the “effective 
potential” of (25.58), but the “potential impact parameter B(r )” calculated from that 
formula [see, for example, Power and Wheeler (1957), Zel’dovich and Novikov 
(1971)]. This potential impact parameter has the following interpretation: A ray, 
in order to reach the point r, must have an impact parameter b that is equal to 
or less than B(r): 


(25.55) 


(25.56) 


b < B(r) (“condition of accessibility”). (25.59) 

A ray with zero impact parameter (head-on impact), or any impact parameter less 

than b crit = min[i?(r)] = 3\/3Af, can get to any and all r values. (4) critical impact parameter 

The beautifully simple “effective potential” defined by (25.58) is used in (25.56) 
to deterinme_the__shape of an orbit; rhai is, the azimuth <f> that the photon has when 
it gets to a given r-value. In other connections, it can be equally interesting to know 
when, or at what Schwarzschild coordinate time, the photon gets to a given r value. 

More broadly, the geodesic of a photon, for which proper time has no meaning, (5) affine parameter 
admits of analysis from first principles by way of an affine parameter A, as contrasted 
with the device of first considering a particle and then going to the limit fi — y- 0. 

(comin tied on page 676) 
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Box 25.7 


QUALITATIVE ANALYSIS OF ORBITS OF A PHOTON 
IN SCHWARZSCHILD GEOMETRY 


A. Equations Governing Orbit 

1. Effective-potential equation for radial part of motion: 

B~\r) — r -2 ( 1 — 2 M/r)\ 
b — (impact parameter). 

2. Supplementary equations to determine angular and time motion: 

d<j>/d\ = 1/r 2 ; 
dt/dX = b-\ 1 - 2M/r)~ 1 . 



B. Qualitative Features of Orbits 

(deduced from effective-potential diagram) 

1. A zero-mass particle with b > 3 \/3M, which falls in from r — oo, is “reflectedr 
off the potential barrier” (periastron; b — B\ dr/dX = 0) and returns to 
infinity. 

a. For b > 3 \/3M, the orbit is a straight line, except for a slight deflection of 
angle 4 M/b (exercise 25.21; §40.3). 

b. For 0 < b — 3 \/3M < M, the particle circles the star many times (“un¬ 
stable circular orbit) at r tz 3 M before flying back to r — oo. 
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2. A zero-mass particle with b < 3 V3Af, which falls in from r — oo, falls into 
r — 2M (no periastron). 

3. A zero-mass particle emitted from near r = 2M escapes to infinity only if 
it has b < 3\/3Af; otherwise it reaches an apastron and then gets pulled 
back into r — 2M. 


C. Escape Versus Capture as a Function of Propagation Direction 

An observer at rest in the Schwarzschild gravitational field measures the ordi¬ 
nary velocity of a zero-mass particle relative to his orthonormal frame [equa¬ 
tions (23.15)]: 

jgJ 1 ' 2 dr/d^ _ b 2/ B 2y/2. 

T \ g 00 \v*dt/dk ,a) ' 

, I gjWdk 

(Vf) 2 + (^) 2 = 1; 

8 = (angle between propagation direction and radial direction) 

= cos” 1 v- = sin -1 

To be able to cross over the potential barrier, the particle must have b < 3 \/3 M, 
or v^ 2 B 2 < 27Af 2 , or sin 2 8 < 21M 2 /B 2 . This result, restated: 

1. A particle of zero rest mass at r < 3 M will eventually escape to infinity, rather 
than be captured by a black hole at r — 2 M if and only if v r is positive and 

sin S < 3 V3MB-\r). 

2. A particle of zero rest mass at r > 3 M will eventually escape to infinity if and 
only if: (1) v T is positive, or (2) v T is negative and 

sin 8 > 3 VlMB-'ir). 


White, escape; black, to black hole; 
directions in proper reference frame 


= r~ l d/d$ 


•* 


L 


e T = (1 - 2 M/r) 1 ' 2 d/dr \ 
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Return to the statement of the conservation laws (25.17) and (25.18) in the form 
that makes reference to the affine parameter X but no reference to the rest mass 
Ia; thus 


d<j> L 

dX r 2 


(25.60) 


(6) equations for orbit 


and 


dt_ _ E 
dX 1 - 2 M/r ' 


(25.61) 


Recall that the course of a photon in a gravitational field is governed by its direction 
but not by its energy. Therefore neither E nor L individually are relevant but only 
their ratio, the impact parameter b = L/E of (25.54) and exercise 25.14. This 
circumstance leads one to replace the affine parameter X by a new affine parameter, 


= LX, 


(25.62) 


that is equally constant along the world line of the photon. In this notation (drop 
the subscript “new” hereafter), the conservation laws take the form 


d$__ 1 
~dX~7 2 ' 


(25.63) 


dt _ 1 

dX b( 1 - 2 M/r)' 


(25.64) 


The statement that the world line of the photon is a line of zero lapse of proper 
time, 


dxL/_ dx^ 
gap dX dX 


= 0 


(25.65) 


leads to the “radial equation” 

(|0 2 + B-\r) = b~ 2 . (25.66) 

Here one encounters again the “effective potential” B~ 2 (r) of (25.58). The present 
fuller set of equations for the geodesic of a photon have the advantage that they 
reach beyond space to a description of the world line in spacetime. 

Return to space! Figure 25.6 shows typical orbits for a photon in Schwarzschild 
geometry. Figure 25.7 shows angle of deflection as a function of impact parameter. 
(7) scattering cross section From the information contained in this curve, one can evaluate the contributions 

to the differential scattering cross section 


do 

~d@ 


2 

“branches” 


2t rb db 
2m sin 0 dB 


(25.67) 


from the various “branches” of the scattering curve of Figure 25.7 [one turn around 
the center of attraction, two turns, etc.; for more on these branches and the central 




Figure 25.6. 

The orbit of a photon in the “equatorial plane” of a black hole, 
plotted in terms of the Schwarzschild coordinates r and <£, for selected 
values of the turning point of the orbit, r TP /Af = 2.99, 3.00 (unstable 
circular orbit), 3.01, 3.5, 4, 5, 6, 7, 8, 9. The impact parameter is given 
by the formula b = r TP (l — 2 Af/r TP )“ 1/2 . In none of the cases shown, 
even for the inward plunging spiral, is the impact parameter less than 
6 crit = (27) 1/2 M, nor are any of these orbits able to cross the circle 
r = 3 M. That only happens for orbits with b less than b^ For such 
orbits there is no turning point; the photon comes in from infinity 
and ends up at r = 0: straight in for b = 0 (head-on impact); only 
after many loops near r = 3M, when b/M = (27) 1/2 - e, where e is 
a very small quantity. Appreciation is expressed to Prof. R. H. Dicke 
for permission to publish these curves, which he had a digital calcu¬ 
lator compute and plot out directly from the formula d 2 u/d^ 2 = 
3w 2 — «, where u = M/r, 


role of the deflection function 0 = 0(b) in the analysis of scattering, see, for example, 
Ford and Wheeler (1959a,b)]. For small angles the “Rutherford” part of the scatter¬ 
ing predominates. The major part of the small-angle scattering, and in the limit 
0 —^ 0 all of it, comes from large impact parameters, for which one has 



(25.68) 


(see exercises 25.21 and 25.24). It follows that the limiting form of the cross section 
is 



% - (W < SmaU < 25 ' 69 > 

Also, at 0 — 77 one has a singularity in the differential scattering cross section, with 
Ihe^char^ter^of a 'glory [see discussion following equation (25.44)]. Writing down 
the contributions of the several branches of the scattering function to the differential 
cross section, and summing them, one has, near 0 — n r, 

A- = -df— (1.75 + 0.0029 + 0.0000055 +•••) = 1.75 (25.70) 

aJ/ 7 T — fj - 77 — 0 

Thus, in principle, if one shines a powerful source of light onto a black hole, one 
gets a direct return of a few photons from it. Equation (25.70) provides a means 
to calculate the strength of this return. See exercise 2526. 
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Figure 25.7. 

Deflection of a photon by a Schwarzschild black hole, or by any spherically symmetric center of attraction 
small enough not to block the trajectory of the photon. The accurate calculations (smooth curves) are 
compared with formulas (dashed curves) valid asymptotically in the two limiting cases of an impact 
parameter, b : (1) very close to b crit = 3 3/2 M (many turns around the center of attraction); and (2) very 
large compared to b crlt (small deflection). The algorithm for the accurate calculation of the deflection 
proceeds as follows (all distances being given, for simplicity, in units of the mass value, M). (1) Choose 
a value, r = R, for the Schwarzschild coordinate of the point of closest approach. (2) Calculate the im¬ 
pact parameter, b, from b 2 = R 3 /(R - 2). (3) Calculate Q from Q 2 = (R - 2 )(R + 6). (4) Determine 
the modulus, k, of an “elliptic integral of the first kind” from sin 2 0 = k 2 = (Q — R + 6)/2Q. (5) De¬ 
termine the so-called amplitude $ — <p miD of the same elliptic function from sn 2 w min = sin 2 £ min = 
(2 + Q — R)/(b + Q — R). (6) Then the total deflection is 

0 = 4(R/Q)^lF^/2 t 6) - F(* min , *)]-*. 

The values plotted here were kindly calculated by James A. Isenberg on the basis of the work of C. G. 
Darwin (1959, 1961). 


gravitational lens effect When the source of illumination, instead of being on the observer’s side of the 

black hole, is on the opposite side, then in addition to the “lens effect” experienced 
by photons flying by with large impact parameter [literature too vast to summarize 
here, but see, e.g., Refsdal (1964)], and subsumed in equation (25.68), there is a 
glory type of illumination (intensity — 1/sin G , with now, however, G close to zero) 
received from photons that have experienced deflections G = 2tt, 4-77,.... This 
illumination comes from “rings of brightness” located at impact parameters given 

by b/M — 3 3/2 = 0.0065, 0.000012,-Interesting though all these optical effects 

are as matters of principle, they are, among all the ways to observe a black hole, 
the worst; see part VI, C, of Box 33.3 for a detailed discussion. 
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Exercise 25.23. QUALITATIVE FEATURES OF PHOTON ORBITS 

Verify all the statements about orbits for particles of zero rest mass made in Box 25.7. 

Exercise 25.24. LIGHT DEFLECTION 

Using the dimensionless variable u — M/r in place of r itself, and u b = M/b in place of 
the impact parameter, transform (25.55) into the first-order equation 

+ (1 ~ 2u)u2 = (25.71) 

and thence, by differentiation, into 


+ u = 3 u 2 . (25.72) 

(a) In the large-impact-parameter or small-w approximation, in which the term on the 
right is neglected, show that the solution of (25.72) yields elementary rectilinear motion (zero 
deflection). 

(b) Insert this zero-order solution into the perturbation term 3 u 2 on the righthand side 
of (25.72), and solve anew for u (“rectilinear motion plus first-order correction”). In this 
way, verify the formula for the bending of light by the sun given by putting /? = 1 in equation 
(25.49). 

Exercise 25.25. CAPTURE OF LIGHT BY A BLACK HOLE 

Show that a Schwarzschild black hole presents a cross section a capt = IItiM 2 for capture 
of light. 

Exercise 25.26. RETURN OF LIGHT FROM A BLACK HOLE 

Show that flashing a powerful pulse of light onto a black hole leads in principle to a return 
from rings of brightness located at b/M — 3 3/2 = 0.151, 0.00028,.... How can one evaluate 
the difference in time delays of these distinct returns? Show that the intensity / of the return 
(erg/cm 2 ) as a function of the energy £ 0 (erg/steradian) of the original pulse, the mass M( cm) 
of the black hole, the distance R to it, and the lateral distance r from the “flashlight” to 
the receptor of returned radiation is 

I — -Esl y 

R 3r 9 = <2lf+Xh t 

under conditions where diffraction can be neglected. 


2b db 


dO 


R 3 r 


-(1.75 + 0.0029 + 0.0000055 + .. • ) 


§25.7. SPHERICAL STAR CLUSTERS 

By combining orbit theory, as developed in this chapter, with kinetic theory in curved 
spacetime as developed in §22.6, one can formulate the theory of relativistic star 
clusters. 

Consider, for simplicity, a spherically symmetric cluster of stars (e.g., a globular 
cluster, but one so dense that relativistic gravitational effects might be important). 


EXERCISES 


Static, spherical star clusters: 
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Demand that the cluster be static, in the sense that the number density in phase 
space 91 is independent of time. (New stars, flying along geodesic orbits, enter a 
fixed region in phase space at the same rate as “old” stars leave it.) Ignore collisions 
and close encounters between stars; i.e., treat each star’s orbit as a geodesic in the 
spherically symmetric spacetime of the cluster as a whole. 

With these idealizations accepted, one can write down a manageable set of equa¬ 
tions for the structure of the cluster.* Since the cluster is static and spherical, so 
(1) foundations for analysis must be its gravitational field. Consequently, one can introduce the same kind of 

coordinate system (“Schwarzschild coordinates”) as was used for a static spherical 
star in Chapter 23: 

ds 2 — — e 2<p dt 2 + e 2A dr 2 + r 2 dQ 2 \ <p — <P(r\ A — A(r). (25.73) 

In the tangent space at each event in spacetime reside the momentum vectors of 
the swarming stars. For coordinates in this tangent space (“momentum space”), it 
is convenient to use the physical components of 4-momentum, p a —i.e., components 
on the orthonormal frame 

uj 1 — e* dt , (ju t — e A dr , cu § = rdQ, <v* = r sin 9 d§. (25.74) 

Then the number density of stars in phase space is a spherically symmetric, static 
function 

% = 9l[r , p^ 9 p\ ( p° 2 + p* 2 ) 1 ' 2 ]. (25.75) 



(2) solution of Viasoff 
equation 


[91 is independent of t because the cluster is static; and independent of 9 , <f>, and 
angle 0 — tan“ 1 (/?V/ , 0 because of spherical symmetry.] 

The functions describing the structure of the cluster, <P, A , and % are determined 
by the kinetic (also, in this context, called the Viasoff) equation (§22.6) 

d9l/d\ — 0, i.e., % conserved along orbit 

of each star in phase space; ^ * ' 

and by the Einstein field equations 

G a!i = 8ir = SttJ (dCptpfy-i dp 6 dp*dp* dp*. (25.76b) 

[The Viasoff equation for Newtonian star clusters is treated by Ogorodnikov (1965). 
The above expression for the stress-energy tensor of a swarm of particles (stars) was 
derived in exercise 22.18. Here, as in exercise 22.18, the particles (stars) are assumed 
not all to have the same rest mass. Note that rest mass is here denoted /a, but in 
Chapter 22 it was denoted m.] 

To solve the Viasoff equation, one need only note that % is conserved along stellar 
orbits and therefore must be a function of the constants of the orbital motion. There 
is a constant of motion corresponding to each Killing vector in the cluster’s static, 
spherical spacetime (see exercise 25.8): 


These equations were first derived and explored by Zel’dovich and Podurets (1965). 
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E = “energy at infinity” = —p • (3/0/) = —p 0 , 

L z = “z-component of angular momentum” = p * f x = p • (d/d</>) = p^ 

L y = “/-component of angular momentum” = p • f y , (25.77a) 

L x = “x-component of angular momentum” = p • ( x . 

In addition, each star’s rest mass 

n = (p 62 - p r2 - p i2 - p* 2)1/2 (25.77b) 

is a constant of its motion. The general solution of the Vlasoff equation, then, has 
the form 

% = H(E, L t , L y ,4,ja). 

But this general solution is not spherically symmetric. For example, the distribution 
function 


a = H(E,p,L z )8(L v ) 5(4), 

corresponds to a cluster of stars with orbits all in the equatorial plane 6 = 
tt/ 2 (4 = 4 = 0 for all stars in cluster). To be spherical the cluster’s distribution 
function must depend only on the magnitude 

L = (4 2 + 4 2 4 4 2 ) 1/2 


of the angular momentum, and not on its direction (not on the orientation of a star’s 
orbital plane). Thus, the general spherical solution to the Vlasoff equation in a static, 
spherical spacetime must have the form 

% = F(E 9 L,p). (25.78) 

To use this general solution, one must reexpress the constants of the motion E, 
L, ju, in terms of the agreed-on phase-space coordinates ( t , r, 0, <$>,p®,p r ,p* y p*). The 
rest mass of a star is given by (25.77b). The energy-at-infinity is obtained by red- 
shifting the locally measured energy 

E= -p 0 = <?*/• (25.79a) 

For an orbit in the equatorial plane (p e = p e = p e = 0; L x = L y = 0), the total 
angular momentum has the form 

L = |4I = \p<p\ — | rp*\ = r X (“tangential” component of 4-momentum). 

By symmetry, the equation L = r X (“tangential” component of p) must hold true 
also for orbits in other planes; it must be perfectly general: 

L = rp f , (25.79b) 

p* = (tangential component of 4-momentum) = [{p' e ) 2 + (p*) 2 ] y/2 (25.80) 


(see exercise 25.9). 
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(3) "smeared-out" Before solving the Einstein field equations, one finds it useful to reduce the 

tTstars"^^ tenS ° r dUe stress-energy tensor to a more explicit form than (25.76b). The off-diagonal compo¬ 
nents T®’ and T& (j ^ k) all vanish because their integrands are odd functions of 
pK The integrands for the diagonal components T®®> T™, and -f T^) are 
independent of angle 6 = tan _1 (/?^/p*) in the tangential momentum plane; so the 
momentum volume element can be rewritten as 


dp® dp r dp e dp^ — I'lrpf dp^ dp T dp®. 

Changing variables from (p f ,p T ,p®) to (p*,p,p®) where 

M = i(P 6 ) 2 ~ (pi 2 ~ (. p*) 2 ) 1 ' 2 , 

and recognizing that two values of p r (±:/? r ) correspond to each value of j a, one 
brings the volume element into the form 


IrrpT dp f dp r dp® — 4ir(p*ft/p*) dp f dp® dfx. 

The diagonal components of T [equation (25.76b)] then read 
p = T°® = (total density of mass-energy) 

= 4 "f F(e*p°, rp f , p)(p 02 p t /p r ) dp f dp° dp, 

P t = — (T^ + T**) = 7'*® = T** = (tangential pressure) 




= 2 t rf F(e‘ t p°, rp f , p)[(p i ) z /p r ] dp f dp 0 dp, 

P r = 7’ rT = (radial pressure) 

= 4 tt f F(e*p 6 , rp*, p)(p T p^) dp f dp° dp. 


(25.81a) 


-[by spherical symmetry] (25.81b) 


(25.81c) 


When performing these integrals, one must express p T in terms of the variables of 
integration, 

P r = [(p 6 ) 2 ~ (p 1 ) 2 ~ P 2 V 2 - (25.8Id) 

(4) solution of field equations The Einstein field equations for this stress-energy tensor and the metric (25.73), 

after use of expressions (14.43) for G a & and after manipulations analogous to those 
for a spherical star (§23.5), reduce to 


e 2A = (1 — 2 m/r)' 1 , m = f 4irr 2 pdr; 

J o 

d<P m + 4 rrr 3 P T 
dr r{r — 2m) 


(25.82a) 

(25.82b) 


These equations, together with the assumed form F{E, L, p) of the distribution 
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function and the integrals (25.81) for p, P r and determine the structure of the 
cluster. Box 25.8 gives an overview of these structure equations, and specializes them 
for an isotropic velocity distribution. Box 25.9 presents and discusses the solution 
to the equations for an isothermal star cluster (truncated Maxwellian velocity distri¬ 
bution). 


Exercise 25.27. ISOTROPIC STAR CLUSTER EXERCISES 

For a cluster with distribution function independent of angular momentum, derive properties 
B.l to B.6 of Box 25.8. 

Exercise 25.28. SELF-SIMILAR CLUSTER [See Bisnovatyi-Kogan and Zei'dovich 
(1969), Bisnovatyi-Kogan and Thorne (1970).] 

(a) Find a solution to the equations of structure for a spherical star of infinite central density, 
with the equation of state P — yp, where y is a constant (0 < y < 1/3). 

(b) Find an isotropic distribution function F(E , p) that leads to a star cluster with the 
same distributions of p, P, m , and 0 as in the gas sphere of part (a). (See Box 25.8.) [Answer: 

y 2 J 

P ^ 1 *+■ 6y 4- y 2 lirr 2 ' 

e 2A = (1 - 2 m/r)- 1 = (1 + 6y + y 2 )/(l + y) 2 , 
e 2<p _ Br 4y/(i + y) 9 B _ const; 

F = AiE/B^r^y 5(p - jt 0 ) = ^”2(£ local) -a + Y)/y A = const] 

Exercise 25.29. CLUSTER WITH CIRCULAR ORBITS 

What must be the form of the distribution function to guarantee that all stars move in circular 
orbits? Specialize the equations of structure to this case. Analyze the stability of the orbits 
of individual stars in the cluster, using an effective-potential diagram. What conditions must 
the distribution function satisfy if all orbits are to be stable? [See Einstein (1939), Zapolsky 
(1968).] 


- ^ 

Box 25.8 EQUATIONS OF STRUCTURE FOR A SPHERICAL STAR CLUSTER 

A. To Build a Model for a Star Cluster, Proceed as Follows 

1. Specify the distribution function % = F(E, L, ja), where 


E = energy-at-infinity of a star, 

L = angular momentum of a star, 
ja = rest mass of a star. 


2. Solve the following two integro-differential equations for the metric functions 
m = \r( 1 — e~ 2A ) and 0 of the line element 



Box 25.8 (continued) 


ds 2 = —e 2<p dt 2 + e 2A dr 2 + r 2 dQ 2 : 


B. 


m = I 4wr 2 p rfr, 

J o 

d0 m + 4irr 3 P r 
dr ~ r(r - 2m) ' 

where 

p = 4 tt J F(e t ’p°, rp t ,p)[{p°) 2 p f /p r ] dp f dp 0 dp, 

P t -2ttJ F(e 0 p°, rp f ,p)[{p i ) 3 /p T ] dp f dp 0 dp, 

P r = 4w J F(e 0 p°, rp f , p)(p r p f ) dp f dp 0 dp, 
pr = [(p*f - {ptf - P 2 Y' 2 . 

The integrations for p, P^ and P r go over all positive p f , p°, p for which 

O' 6 ) 2 - ip 1 ) 2 - p 2 > 0. 


If the Distribution Function is Independent 

of Angular Momentum, Then 

1 .F = F(E,p). 

2. The distribution of stellar velocities at each point in the cluster is isotropic. 

3. p = 4 «SF{*p\Mp*f ~ P 2 ] 1/2 (p 6 ) 2 dp 6 dp. 

4. The pressure is isotropic: 

p r = p T =p=iff F( e y, p)(p ° 2 - p 2 r /2 d P o dp. 

5. The total density of mass-energy p, the pressure P, and the metric functions 
<P and m = |r(l — e~ 2A ) satisfy the equations of structure for a gas sphere 
(“star”), 

m = I 4flr 2 p dr, 

d<P m + 4trr 3 P 
~dr = r(r — 2m) ’ 

dP _ (p + P)(m + 4-nr 3 P) 
dr ~ r(r — 2m) 

6. Thus, to every static, spherical star cluster with isotropic velocity distribution, 
there corresponds a unique gas sphere that has the same distributions of 
p, P, m, and 0. 

7. Conversely [see Fackerell (1968)], given a gas sphere (solution to equations 
of stellar structure for p, P, m, and <P), one can always find a distribution 
function F(E, p) that describes a cluster with the same p, P, m, and 0. But 
for some gas spheres F is necessarily negative in part of phase space, and 
is thus unphysical. 

_> 
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Box 25.9 ISOTHERMAL STAR CLUSTERS 


~\ 


A. Distribution Function 


1. In any relativistic star cluster, one might expect that occasional close en¬ 
counters between stars would “thermalize” the stellar distribution function. 
This suggests that one study isotropic, spherical clusters with the Boltzmann 
distribution function (tacitly assumed zero for p° = Ee~ 0 < ju 0 ) 

* = F(E , L, ju) = Ke~ E/T S(fi - /to). (1) 

Here AT is a normahzation constant, T is a constant “temperature,” and for 
simplicity the stars are all assumed to have the same rest mass ju 0 . 

2. In such a cluster, an observer at radius r sees a star of energy-at-infinity 
E to have locally measured energy 


p® — (rest mass-energy) -f (kinetic energy) = —- ^ 2 y ./2 = (2) 

Consequently, the stars in his neighborhood have a Boltzmann distribution 
d zp d £i dll = % = K exp (~p 6 /T ]0C ) % - N ) (3) 

with locally measured temperature 


7 IOC (r) = Te-^\ (4) 

Thus, the temperature of the cluster is subject to identically the same red- 
shift-blueshift effects as photons, particles, and stars that move about in the 
cluster. (For a derivation of this same temperature-redshift law for a gas 
in thermal equilibrium, see part (e) of exercise 22.7.) 

3. Actually, the Boltzmann distribution (1) can never be achieved. Stars with 
E > n 0 are gravitationally unbound from the cluster and will escape. The 
Boltzmann distribution presumes that, as such stars go zooming off toward 
r — oo, an equal number of stars with the same energies come zooming in 
from r — oo to maintain an unchanged distribution function. Such a situation 
is clearly unrealistic. Instead, one expects the escape of stars to truncate the 
distribution at some energy £ max slightly less than p 0 . The result, in idealized 
form, is the “truncated Boltzmann distribution” 




% 


= F(E, L,n) = 


Ke~ E ' T Kii - N ), E < £ max . 


0 , 


E > £, 


'max' 


(5) 
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Box 25.9 (continued) 


B. Structure and Stability of Cluster Models 


1. Models for star clusters with truncated Boltzmann distributions have been 
constructed by Zel’dovich and Podurets (1965), by Fackerell (1966), and by 
Ipser (1969), using the procedure of Box 25.8. Ipser has analyzed the colli¬ 
sionless radial vibrations of such clusters. 

2. In general, these clusters form a 4-parameter family (AT, T, /u 0 , £ max ). Replace 
the parameter K by the total rest mass of the cluster, M 0 — \i 0 N , where N 
is the total number of stars. Replace T by the temperature per unit rest 
mass, T = T/\i 0 . Replace £ max by the maximum energy per unit rest mass, 
£ max = E mBX /fi 0 . Then the clusters are parametrized by (A/ 0 , f, p 0 , £ max ). 
When one now doubles j u 0 , holding A/ 0 , 7, £ max fixed (and thus halving the 
total number of stars), all macroscopic features of the cluster remain un¬ 
changed. In this sense ju 0 is a “trivial parameter” and can henceforth be 
ignored or changed at will. The total rest mass of the cluster M 0 can be 
regarded as a “scaling factor”; all dimensionless features of the cluster are 
independent of it. For example, if p c is the central density of mass-energy 
[equation (25.81a), evaluated at r — 0], then p c Af 0 2 is dimensionless and is 
thus independent of M 0 , which means that p c oc Af 0 ~ 2 . Only two nontrivial 
parameters remain: T and £ max . 

3. Consider as an instructive special case [Zel’dovich and Podurets (1965)] the 
one-parameter sequence with £ max = 1 —\T. The following figure, com¬ 
puted by Ipser (1969), plots for this sequence the fractional binding energy, 

E hind /M 0 = (M 0 -M)/M 0 (6) 

(here M is total mass-energy); the square of the angular frequency for 
collisionless vibrations (vibration amplitude oc e“ itJ *) divided by central 
density of mass-energy, w 2 /p c ; and the redshift, z c , of photons emitted from 
the center of the cluster and received at infinity. All these quantities are 
dimensionless, and thus depend only on the choice of T — T/fx 0 . 

4. Notice that all models beyond the point of maximum binding energy 
(z c > 0.5) are unstable against collisionless radial perturbations (a? imagi¬ 
nary; amplitude of perturbation oc e jto1 *). When perturbed slightly, such 
clusters must collapse to form black holes. (See Chapter 26 for an analysis 
of the analogous instability in stars). 

5. These results suggest an idealized story of the evolution of a spherical cluster 
[Zel’dovich and Podurets (1965); Fackerell, Ipser, and Thome (1969)]. The 
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r ~\ 



cluster would evolve quasistatically along a sequence of spherical equilibrium 
configurations such as those of the figure. The evolution would be driven 
by stellar collisions and by the evaporation of stars. When two stars collide 
and coalesce, they increase the cluster's rest mass and hence its fractional 
binding energy. When a star gains enough energy from such encounters to 
escape from the cluster, it carries away excess kinetic energy, leaving the 
cluster more tightly bound. Thus, both collisions and evaporation should 
drive the cluster toward states of tighter and tighter binding. When the cluster 
reaches the point, along its sequence, of maximum fractional binding energy, 
it can no longer evolve quasistatically. Relativistic gravitational collapse sets 
in: the stars spiral inward through the gravitational radius of the cluster 
toward its center, leaving behind a black hole with, perhaps, some remaining 
stars orbiting it. 

It is tempting to speculate that violent events in the nuclei of some galaxies 
and in quasars might be associated with the onset of such a collapse, or 
with encounters between an already collapsed cluster (black hole) and 
surrounding stars. 


V 





CHAPTER 


STELLAR PULSATIONS 




§26.1. MOTIVATION 

In relativistic astrophysics, as elsewhere in physics, most problems of deep physical 
interest are too difficult and too complex to be solved exactly. They can be solved 
only by use of approximation techniques. And of all approximation techniques, the 
one that has the widest range of application is perturbation theory. 

Perturbation calculations are typically long, tedious, and filled with complicated 
The raison d'etre of this mathematical expressions. Therefore, they are not appropriate for a textbook such 

cha P ter as this. Nevertheless, because it is so important that aspiring astrophysicists know 

how to set up and carry out perturbation calculations in general relativity, the authors 
have chosen to present one example in detail. 

The example chosen is an analysis of adiabatic, radial pulsations of a nonrotating, 
relativistic star. Two features of this example are noteworthy: (1) it is sufficiently 
complex to be instructive, but sufficiently simple to be presentable; (2) in the results 
of the calculation one can discern and quantify the relativistic instability that is so 
important for modem astrophysics (see Chapter 24). 

The calculation presented here is patterned after that of Chandrasekhar (1964a,b), 
which first revealed the existence of the relativistic instability. For an alternative 
calculation, based on the concept of “extremal energy,” see Appendix B of Harrison, 
Thome, Wakano, and Wheeler (1965); and for a calculation based on extremal 
entropy, see Cocke (1965). 


This chapter is entirely 
Track 2 , but it neither depends 
on nor prepares for any other 
chapter. 


The authors are deeply indebted to Mr. Carlton M. Caves, who found and corrected many errors in 
the equations of this chapter and of a dozen other chapters. 
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§26.2. SETTING UP THE PROBLEM 

The system to be analyzed is a sphere of perfect fluid, pulsating radially with very 
small amplitude. To analyze the pulsations one needs (a) the exact equations gov¬ 
erning the equilibrium configuration about which the sphere pulsates; (b) a coordi¬ 
nate system for the vibrating sphere that reduces, for zero pulsation amplitude, to 
the standard Schwarzschild coordinates of the equilibrium sphere; (c) a set of small 
functions describing the pulsation (radial displacement and velocity, pressure and 
density perturbations, first-order changes in metric coefficients), in which to linearize; 
and (d) a set of equations governing the evolution of these “perturbation functions.” 


a. Equilibrium Configuration 

The equations of structure for the equilibrium sphere are those summarized in §23.7. 
It will be useful to rewrite them here, with a few changes of notation (use of subscript 
"o n to denote “unperturbed configuration”; use of A = —^ln(l — 2m/r) in place 
of m in all equations; use of a prime to denote derivatives with respect to r): 

ds 2 = -e 2 *’ dt 2 + e 2A o dr 2 + r\d0 2 + sin 2 # d<f> 2 ) y (26.1a) 

A 0 ' - ^(1 - e 2A <>) 4- 4? Trp 0 e 2A o, (26.1b) 

Po = -(Po + Po)*o> ( 261c ) 

<p o ' = - -L (l - (? A o) + 4irrp 0 e? Ao . (26.1 d) 


b. Coordinates for Perturbed Configuration 

-The gas sphere pulsates in a radial, i.e., spherically symmetric, manner. Conse¬ 
quently, its spacetime geometry must be spherical. In Box 23.3 it is shown that for 
any spherical spacetime, whether dynamic or static, one can introduce Schwarzschild 
coordinates with a line element 

ds 2 = — e 2<p dt 2 -f e 2A dr 2 4- r 2 (dd 2 + sin 2 # d<f> 2 ), (262) 

<p — <p(t, r), A = A(t, r). 

One uses these coordinates for the^idsating^hexe because they reduce to the 
--^iftpex^urb«4-coordinates'wliefi the pulsations have zero amplitude. 


c. Perturbation Functions 

When the pulsations have very small amplitude, the metric coefficients, 0 and A, 
and the thermodynamic variables p , p, and n as measured in the fluid’s rest frame 


Setting up the analysis of 
stellar pulsations 


Equilibrium configuration of 
star 


Coordinates for perturbed 
configuration 
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Perturbation functions 


How to derive equations 
governing the perturbation 
functions 


Eulerian perturbations 
defined 

Lagrangian perturbations 
defined 


have very nearly their unperturbed values. Denote by 80, 8A, 8p , 8p, and 8n the 
perturbations at fixed coordinate locations: 

HU r) = 0 o (r) + 8<p (U r), A(t , r) = A 0 (r) 4 8A(t , r), 

0 = Po( r ) + S P(U r), p(t , r) = p 0 (r) 4- 5p(f, /*), (26.3a) 

r ) = "oW 4- 8n(t, r). 

Besides 80, 8A, 8p, 8p, and 8n, one more perturbation function is needed to describe 
the pulsations: the radial displacement £ of the fluid from its equilibrium position: 

A fluid element located at coordinate radius r in the 
unperturbed configuration is displaced to coordinate 
radius r 4 £(r, t) at coordinate time t in the vibrat- ' ' 

ing configuration. 

To make the analysis of the pulsations tractable, all equations will be linearized in 
the perturbation functions £, 80, 8A, 8p , 8p, and 8n. 


d. Equations of Evolution 

The evolution of the perturbation functions with time will be governed by the 
Einstein field equations, the local law of conservation of energy-momentum V * T = 
0, and the laws of thermodynamics—all appropriately linearized. The analysis 
from here on is nothing but a reduction of those equations to “manageable form.” 
Of course, the reduction will proceed most efficiently if one knows in advance what 
form one seeks. The goal in this calculation and in most similar calculations is simple: 

(1) obtain a set of dynamic equations for the true dynamic degrees of freedom (only 
the fluid displacement £ in this case; the fluid displacement and the amplitudes of 
the gravitational waves in a nonspherical case, where waves are possible); and 

(2) obtain a set of initial-value equations expressing the remaining perturbation 
functions (80, 8A , 8p , 8p, and 8n in this case) in terms of the dynamic degrees of 
freedom (£). 


§26.3. EULERIAN VERSUS LAGRANGIAN PERTURBATIONS 

Before deriving the dynamic and initial-value equations, it is useful to introduce 
a new concept: the “Lagrangian perturbation” in a thermodynamic variable. The 
perturbations 8p, 8p, and 8n of equations (26.3) are Eulerian perturbations in p , p, 
and n; i.e., they are changes measured by an observer who sits forever at a fixed 
point ( t , r, 8, <j>) in the coordinate grid. By contrast, the Lagrangian perturbations — 
denoted Ap, Ap , and An —are changes measured by an observer who moves with 
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the fluid; i.e., by an observer who would sit at radius r in the unperturbed configura¬ 
tion, but sits at r + £(t, r) in the perturbed configuration: 


Ap{t, r) = p[t, r + £(/, r)] - P 0 (r) 

(26.4a) 

~&p+Pob 


Ap(t, r) = p[t, r + £(/, r)] - p 0 (r) 

(26.4b) 

~8p + p 0 '£; 


An(t, r ) = n[t, r + i{t, r)] - n 0 (r) 

(26.4c) 


Sn + n 0 % 


§26.4. INITIAL-VALUE EQUATIONS 


a. Baryon Conservation 

The law of baryon conservation, ▼ • (nu) = 0 (§22.2), governs the evolution of 
perturbations An and Sn in baryon number. By applying the chain rule to the 
divergence and using the relation u • Vn = Y u n = dn/dr , one can rewrite the con¬ 
servation law as 

dn/dr = — n(V • i/). 

t 

1 -[derivative of n along fluid world line] 


In terms of An, the perturbation measured by an observer moving with the fluid, 
this equation can be rewritten as 


d An 
dr 


-#!(▼•«#). 


(26.5) 


To reduce this equation further, one needs an expression for the fluid’s 4-velocity. 
It is readily derived from 

u r _ / dr/dr\ _ /dr\ _ _ • 

U* \ dt/dr ) \dt) along world line ^ 

{u'fe 2 * - (u r ) 2 e 2A = 1. 

The result to first order in £, SA, and 5# is 

w* = e -* = e~*o(l - 80), u r = (26.6) 

Using these components in equation (26.5), and using the relations 

J- = u = u“-2-, V ■ u =—L=(V^gu a ), a , 

dr dx a v— g 

together with the vibrating metric (26.2), one reduces equation (26.5) to a relation 
whose time integral is 

An = — n o [r- 2 e~ A °(r 2 e A °0' -f* Syl]. (26.7) 


Relation between Eulerian 
and Lagrangian perturbations 


Derivation of initial value 
equations: 

(1) for baryon perturbations 
An and 8n 
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This is the initial value equation for An in terms of the dynamic variable £. The 
initial-value equation for dn, which will not be needed later, one obtains by combin¬ 
ing with equation (26.4c). 


b. Adiabaticity 


(2) for pressure perturbations 
Ap and 5p 


For adiabatic vibrations (negligible heat transfer between neighboring fluid ele¬ 
ments), the Lagrangian changes in number density and pressure are related by 



_Tdefinition of adia- 

„batic index, T 1 


(26.8) 


Combining this adiabatic relation with equation (26.7) for An, and equation (26.4a) 
for dp in terms of Ap, one obtains the following initial-value equation for dp: 

dp = -f -j> 0 [r~ 2 e- A °(r 2 e A °£)' + 5/1] - (26.9) 


c. Energy Conservation 

(3) for density perturbations The local law of energy conservation [first law of thermodynamics; u * (V • T) — 0; 
Ap and dp see §§22.2 and 22.3] says that 

dp _ (p -f- p) dn 

dr n dr ’ 

Rewritten in terms of Lagrangian perturbations (recall: d/dr is a time derivative 
as measured by an observer moving with the fluid), this reads 

d Ap _ p + p d An 

dr n dr 

which has as its time integral (first-order analysis!) 

Ap = Po+ Po An. (26.10) 

« 0 

(The constant of integration is zero, because, when An — 0, Ap must also vanish.) 
Combining this with equation (26.7) for An and equation (26.4b) for dp in terms 
of Ap, one obtains the following initial-value equation for dp: 

Sp = ~(p 0 + p 0 )[r- 2 e~ A °(r 2 e A o£y + 6/1] - £p 0 '. 


(26.11) 
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d. Einstein Field Equations 


Two of the Einstein field equations, when linearized, reduce to initial-value equations 
for the metric perturbations Syl and 5$. The equations needed, expressed in an 
orthonormal frame 

U ) 1 = e* dt, U) T — e A dr, w* — rd8, cu* — r sin 8 d<j>, (26.12) 


(4) for metric perturbations 
and 80 


are G% — 877 -7#, and G- = 877 7~. The components of the Einstein tensor in this 
orthonormal frame were evaluated in exercise 14.16: 

j—[linearized] 

G h = 2 (A/r)e~ {A + 0) = 2r- 1 e- u * + ** ) 8A ; (26.13a) 

G% = 2(0'/r)e~ 2A + r~ 2 (e~ 2A - 1) (26.13b) 

= (G -) 0 + 2r~ x e~ 2A <> 80' - 2e~ 2A <>(2r- 1 0 o ' + r~ 2 ) 8A. 

^—[linearized] 

The components of the stress-energy tensor, 7^ = (p + p)u^u^ + prf a 0 , as calcu¬ 
lated using the 4-velocity (26.6) [transformed into the form u~ = — 1, u- = 
and using expressions (26.3a) for p and /?, reduce to 

= -(P 0 T~=p 0 + 8p. (26.14) 

Consequently, the field equation G% = 8777 ^—after integration with respect to time 
and choice of the constant of integration, so that 5^1 = 0 when £ = 0 —reduces to 

Syl = -4?r(p 0 + p 0 )re 2A o£ = -(yi; + #;)£. (26.15) 

77hs ij //ze initial-value equation for 5 A The field equation G- = 877 ^, after using 
(26.15) to remove Syl and (26.9) to remove 5/?, and (26.1c) to remove 0 O \ reduces 
to 

80' = -477,T 1 /? 0 r- 1 c 2 ^ + ^(r 2 e-^£)' (26.16) 

+ [4 t 7/? 0 > - 4 t 7( p 0 + p 0 )]e 2 H . 

77zzs zs z/ze initial-value equation for 80 . 
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The dynamic evolution of the fluid displacement £(r, r) is governed by the Euler 
equation (22.13): 

(p + p) X (4-acceleration) = -“(projection of Vp orthogonal to w). (26.17) 


Derivation of equation of 
motion for fluid displacement 
£ 


The 4-acceleration a = Y u u corresponding to the 4-velocity (26.6) in the metric (26.2) 
has as its only non-zero, linearized, covariant component: 

a r = 0 O ' + 80' + 
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Boundary conditions on fluid 
displacement 


Summary of theory of stellar 
pulsations 


[The component a t is trivial in the sense that it leads to an Euler equation that 
duplicates (26.1c).] Combining this with p + p = p o + p 0 q- Sp q- 8p, with the radial 
component p 0 ' q* 8p f for the projection of Vp, and with the zero-order equation of 
hydrostatic equilibrium (26.1c), one obtains for the Euler equation 

(Po + P 0 )e^-^l = Sp' - (Sp + 8p)0 o ' - ( Po + p 0 ) 8<P'. (26.18) 

This equation of motion is put into its most useful form by using the initial-value 
equations (26.9), (26.11), and (26.16) to reexpress 8p, Sp, and 8<P' in terms of £, and 
by then manipulating terms extensively with the aid of the zero-order equations 
of structure (26.1). The result is 

m = (pry + ql ( 26 . 19 ) 

where f is a “renormalized displacement function,” and W, P, Q are functions of 
radius determined by the structure of the equilibrium star: 

f = r 2 e-*°£; 

W=(p 0 + p 0 )r~ 2 e 3A ° + ,p °; 
p = r i p 0 r~ 2 e A ° +3 ' 1 ’ 

Q = e A.+wJA£±)L r -2 _ 4 Po 'r -3 _ 8w(Po + p')pjr-*i*A. 1 

I- Po » Po -* 

Equation (26,19) is the dynamic equation governing the stellar pulsations, [This 
equation could be written in other forms; for instance, it could be multiplied by 
W~ x or any other non-zero factor, and terms could be regrouped. The form given 
in equation (26.19) is preferred because it leads to a self-adjoint eigenvalue problem 
for the oscillation frequencies, as indicated in Box 26.1.] 

Not all solutions of the dynamic equation are acceptable. To be physically accept¬ 
able, the displacement function must produce noninfinite density and pressure 
perturbations (Sp and 8p) at the center of the sphere, which means 

(£/r) finite or zero in limit as r —0 (26.22a) 

[see (26.9) and (26.11)]; also, it must leave the pressure equal to zero at the star’s 
surface, which means 

Ap = -T^r-^o^e^y —>■ 0 as r —>■ R (26.22b) 

[surface radius]—^ 


(26.20) 

(26.21a) 

(26.21b) 

(26.21c) 


[see (26.8), (26.7), and (26.15)]. 


§26.6. SUMMARY OF RESULTS 

If an initial displacement of the fluid, £(/ = 0, r), is specified subject to the boundary 
conditions (26.22), then its subsequent evolution £(*, r ) can be calculated by inte- 
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grating the dynamic equation (26.19); and the form of the pressure, density, and 
metric perturbations can be calculated from £(*, r) using the initial-value equations 
(26.9), (26.11), (26.15), and (26.16). 

Several important consequences of these results are explored in Boxes 26.1 
and 26.2. 

(<continued on page 699) 


- \ 

Box 26.1 EIGENVALUE PROBLEM AND VARIATIONAL PRINCIPLE 
FOR NORMAL-MODE PULSATIONS OF A STAR 

Assume that the renormalized displacement function (26.20) has a sinusoidal time 
dependence: 

$ = S(r)e~ iat . 

Then the dynamic equation (26.19) and boundary conditions (26.22) reduce to 
an eigenvalue problem for the angular frequency w and amplitude f(r): 

(PS')' + QZ + = 0, (1) 

f/r 3 finite or zero as r —0, (2a) 

r&T'e**' —0 as r —>- R. (2b) 

Methods for solving this eigenvalue problem are catalogued and discussed by 
Bardeen, Thorne, and Meltzer (1966). One method (but not the best for numerical 
calculations) is the variational principle: 


co 2 = extremal value of 

f {PC 2 -QP) dr 
J o 

R 


J WC dr 


o 


(3) 


where f is varied over all functions satisfying the boundary conditions (2). [See e.g., 
§12.3 of Mathews and Walker (1965) for discussion of the equivalence between this 
variational principle and the original eigenvalue problem.] 

The absolute minimum value of expression (3) is the squared frequency of the 
fundamental mode of pulsation. If it is negative, the star is unstable ( e~ i(i3t grows 
exponentially in time). If it is positive, the star is stable against adiabatic, radial 
perturbations. Therefore, since the denominator of expression (3) is positive definite, 


stability against” 
adiabatic radial 
perturbations 



J (Pf' 2 — Q£ 2 ) dr > 0 for all functions 
0 f satisfying (2) 


(4) 
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Box 26.1 (continued) 

By numerical solution of the eigenvalue equation (1), the pulsation frequencies 
have been calculated for a wide variety of models of neutron stars and supermassive 
stars. Example: The figure gives a plot of pulsation frequency as a function of central 
density for the lowest four normal modes of the Harrison-Wakano-Wheeler models 
at the endpoint of stellar evolution. (Make a detailed comparison with Figure 24.2.) 
These curves are based on calculations by Meltzer and Thorne (1966), with correc¬ 
tions for the fundamental mode of massive white dwarfs by Faulkner and Gribbin 
(1968). 


V l.o 

u 0 



10” 1 3 


10- 3 f 
2 

10 -4 <2 


- P c , g/cm 3 
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\ Box 26.2 THE CRITICAL ADIABATIC INDEX FOR NEARLY NEWTONIAN STARS 




A. Fully Newtonian Stars 

1. For a Newtonian star that pulsates sinusoidally, £ = £(r)e~ i “ t , the dynamic 
equation (26.19) reduces to 

[r^At/r )']' + 3 WiP'M - 4 p 0 Vr + “ 2 p 0 £ = 0. (1) 

2. If r x = 4/3 throughout the star, the physically acceptable solution [solution 
satisfying boundary conditions (26.22)] for the fundamental mode of vibra¬ 
tion (mode with lowest value of u 2 ) is 

w 2 = 0, £ = er, c = const. (2) 

Thus, for r x = 4/3 the fundamental mode is “neutrally stable” and has 
a “homologous” displacement function—independent of the star’s equation 
of state or structure. 

3. If r i is allowed to differ slightly from 4/3 in an r-dependent way, then £(r) 
will differ slightly from the homologous form: 

£ = tr[l + r-dependent corrections of magnitude (T 1 - 4/3)]. 

Consequently, if one uses the homologous expression £ = tr as a trial 
function in the variational principle of Box 26.1, one will obtain w 2 accurate 
to 0{(r i - 4/3) 2 ]. (Recall: first-order errors in trial function produce 
second-order errors in value of variational expression.) The Newtonian limit 
of the variational expression [equation (3) of Box 26.1] becomes, with the 
homologous choice of trial function, 

r B 

f 3 Po r2 dr 

w 2 = (3/\ - 4)-2-g-+ 0[(3/\ - 4) 2 ], (3) 

/ Po r * dr 

where T x is the pressure-averaged adiabatic index 


f r !p 0 4irr 2 dr 
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Box 26.2 (continued) 

By use of the Newtonian virial theorem for the nonpulsating star [equation 
(39.21b) or exercise 23.7], one can convert equation (3) into the form 

= (3 r x - 4)|0|//, (5) 

where 12 is the star’s self-gravitational energy and I — f(p 0 r 2 )47rr 2 dr is the 
trace of the second moment of its mass distribution (see Box 24.2 and exercise 
39.6). 


B. Nearly Newtonian Stars 


1. When one takes into account first-order relativistic corrections (corrections 
of order M/R ), but ignores higher-order corrections, one can rewrite the 
variational expression [equation (3) of Box 26.1] in the form 



+ (3r, - 4)(rV)'](l + A 0 + 1<P 0 ) dr - f* F 0 r,* dr 
__ 

f Po r \ 1 + 3 A + </>„+ p 0 /p 0 yn 2 dr 

J o 

( 6 ) 


where 


F 0 = %TTr i p 0 p 0 + 8 rm 0 p 0 + p 0 m 0 2 , ij = f/r 3 = (£/r)(l - <P 0 ), (7) 


and m 0 (r) is the equilibrium mass inside radius r. 

2. For a relativistic star with r i — 4/3 of order M/R and with M/R < 1, the 
homologous trial function £ = er will still be highly accurate. Equally accu¬ 
rate, and easier to work with, will be £ — ire* 0 ^ er(l -f- <P 0 ), which corre¬ 
sponds to t] = e = constant. Its fractional errors will be of order M/r; and 
the errors which it produces in to 2 will be of order (M/R) 2 . By inserting this 
trial function into the variational principle (6) and keeping only relativistic 
corrections of order M/R , one obtains 


= 3(7", - r lcrit )|i?|//. (8) 

Here r i is the pressure-averaged adiabatic index, and the critical value of 
the adiabatic index f 1CTii is 


f lerit = J + OiM/R, 


( 9 ) 
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V 


with a a positive constant of order unity given by 


a 


1 R 

3 M\S2\ J 0 \ 3f>0 ~ + 


^Po~ -^4wr 2 dr. 



( 10 ) 


Expressions (8) and (9) for the pulsation frequency and the adiabatic index 
play an important role in the theory of supermassive stars (§24.4). 

3. For alternative derivations of the above result, see Chandrasekhar (1964a,b; 
1965c), Fowler (1964,1965), Wright (1964). 


J 


Exercise 26.1. DRAGGING OF INERTIAL FRAMES BY A SLOWLY EXERCISE 

ROTATING STAR 

A fluid sphere rotates very slowly. Analyze its rotation using perturbation theory; keep only 
effects and terms linear in the angular velocity of rotation. [Hints: (1) Centrifugal forces 
are second-order in angular velocity. Therefore, to first order the star is undeformed; its 
density and pressure distributions remain spherical and unperturbed. (2) Show, by symmetry 
and time-reversal arguments, that one can introduce coordinates in which 

ds 2 = -e 20 dt 2 4- e 2A dr 2 + r 2 [d6 2 + sin 2 0 d$ 2 ) (26.23) 

— 2(r 2 sin 2 0)co d<j> dt , 


where 


<P = 0(r ), A = A(r), and w = w(r, 0). (26.24) 

Show that 0 = 0 o and A = A 0 (no perturbations!) to first-order in angular velocity. 

(3) Adopt the following precise definition of the angular velocity J?(r, 9): 

= (^/^Omoving with the fluid* (26.25) 

Assuming u r = u e — 0 (i.e., rotation in the direction), calculate the 4-velocity of the fluid. 

(4) Use the Einstein field equations to derive a differential equation for the metric perturba¬ 
tion *o in terms of the angular velocity i2. (5) Solve that differential equation outside the 
star in terms of elementary functions, and express the solution for w(r, 6) in terms of the 
star’s total angular momentum 5, as measured using distant gyroscopes (see Chapter 19).] 
For the original analyses of this problem and of related topics, see Gurovich (1965), Dorosh- 
kevich, Zel'dovich, and Novikov (1965), Hartle and Sharp (1965), Brill and Cohen (1966), 
Hartle (1967), Krefetz (1967), Cohen and Brill (1968), Cohen (1968). 






PART 



THE UNIVERSE 


Wherein the reader , flushed with joy at his conquest of the 
stars, seeks to control the entira universe, and is foiled by an 
unfathomed mystery: the Initial Singularity. 



CHAPTER ^ # 

IDEALIZED COSMOLOGIES 


From my point of view one cannot arrive, by way of theory, at any at least 
somewhet relieble results in the field of cosmology, if one makes no use of the 

principle of general relativity. 
ALBERT EINSTEIN (1949b, p. 684) 


y 


§27.1. THE HOMOGENEITY AND ISOTROPY 
OF THE UNIVERSE 


Astronomical observations reveal that the universe is homogeneous and isotropic 
on scales of ~10 8 light years and larger. Taking a “fine-scale” point of view, one 
sees the agglomeration of matter into stars, galaxies, and clusters of galaxies in 
regions of size ~1 light year, ~10 6 light years, and ~3 X 10 7 light years, respec¬ 
tively. But taking instead a “large-scale” viewpoint, one sees little difference between 
an elementary volume of the universe of the order of 10 8 light years on a side 
centered on the Earth and other elementary volumes of the same size located 
elsewhere. 

Cosmology, summarized in its simplest form in Box 27.1, takes the large-scale 
viewpoint as its first approximation; and as its second approximation, it treats the 
fine-scale structure as a perturbation on the smooth, large-scale background. This 
chapter (27) treats in detail the large-scale, homogeneous approximation. Chapter 
28 considers such small-scale phenomena as the primordial formation of the ele¬ 
ments, and the condensation of galaxies out of the primeval plasma during the 
expansion of the universe. Chapter 29 discusses observational cosmology. 

Evidence for the large-scale homogeneity and isotropy of the universe comes from 
several sources. (1) There is evidence in the distribution of galaxies on the sky and 
in the distribution of their apparent magnitudes and redshifts [see, e.g., Hubble 
(1934b, 1936); Sandage (1972a); Sandage, Tamman, and Hardy (1972); but note the 
papers claiming “hierarchic” deviations from homogeneity, which Sandage cites and 
attacks]. (2) There is evidence in the isotropy of the distribution of radio sources 
on the sky [see, e.g., Holden (1966), and Hughes and Longair (1967)]. (3) There 
is evidence in the remarkable isotropy of the cosmic microwave radiation [see, e.g., 
Boughn, Fram, and Partridge (1971)]. For a review of most of the evidence, see 
Chapter 2 of Peebles (1971). 


The universe: fine-scale 
condensations contrasted 
with large-scale homogeneity 


Evidence for large-scale 
homogeneity and isotropy 


(continued on page 711) 
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27. IDEALIZED COSMOLOGIES 


Box 27.1 COSMOLOGY IN BRIEF 

Uniform density . Idealize the stars and atoms as scattered like dust through the 
heavens with an effective average density p of mass-energy everywhere the same. 

Geometry homogeneous and isotropic . Idealize the curvature of space to be every¬ 
where the same. 

Closure. Accept the term, “Einstein’s geometric theory of gravity” as including 
not only his field equation G = 877 T, but also his boundary condition of closure 
imposed on any solution of this equation.* 

A three-sphere satisfies the three requirements of homogeneity, isotropy, and 
closure, and is the natural generalization of the metric on a circle and a 2 -sphere: 


Spheres of selected 
dimensionality 

Visualized as embedded 
in a Euclidean space of 
one higher dimension a 

Transformation from 
Cartesian to polar 
coordinates 

Metric on S n expressed 
in terms of these polar 
coordinates 

S 1 

* 2 + y 2 = a 2 

x = a cos <p 
y — a sin <p 

ds 2 — a 2 d<j> 2 

S 2 

x 2 + y 2 + z 2 = a 2 

x ~ a sin 0 cos <p 
y — a sin 6 sin 
z — a cos 6 

ds 2 = a 2 (d0 2 + sin 2 # d§ 2 ) 

S 3 

x 2 + y 2 + z 2 + w 2 = a 2 

x = a sin x sin 0 cos <f> 
y = a sin x sin $ sin <p 
z — a sin x cos 6 
w = a cos $ 

ds 2 = a 2 [d\ 2 + 
sin 2 x(# 2 + sin 2 0 d<p 2 )} 


6 Excursion off the sphere is physically meaningless and is forbidden. The superfluous dimension is added to help the reason in 
reasoning, not to help the traveler in traveling. Least of all does it have anything whatsoever to do with time. 


The spacetime geometry is described by the metric 

ds 2 = —dt 2 + a 2 (t)[dx 2 + sin 2 x(d8 2 + sin 2 6 d<j> 2 )]. (1) 

The dynamics of the geometry is known in full when one knows the radius a as 
a function of the time t. 


"“‘Thus we may present the following arguments against the conception of a space-infinite, and for 
the conception of a space-bounded, universe: 

“1. From the standpoint of the theory of relativity, the condition for a closed surface is very much 
simpler than the corresponding boundary condition at infinity of the quasi-Euclidean structure of the 
universe. 

“2. The idea that Mach expressed, that inertia depends upon the mutual action of bodies, is contained, 
to a first approximation, in the equations of the theory of relativity; . . . But this idea of Mach’s 
corresponds only to a finite universe, bounded in space, and not to a quasi-Euclidean, infinite universe” 
[Einstein (1950), pp. 107-108J. 

Many workers in cosmology are skeptical of Einstein’s boundary condition of closure of the universe, 
and will remain so until astronomical observations confirm it. 
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Einstein's field equation (doubled, for convenience), 26 = 16777 ", 
force concentrated in its 00 (or tt ) component, 




has its whole 



( 2 ) 


[equation (5 a) of Box 14.5]. This component of Einstein’s equation is as central as 
the component V • E = 4irp of Maxwell’s equations. It is described in the Track-2 
Chapter 21 as the “initial-value equation” of geometrodynamics. There the two terms 
on the left receive separate names: the “second invariant” of the “extrinsic curvature” 
of a “spacelike slice” through the 4-geometry (tells how rapidly all linear dimensions 
are being stretched from instant to instant); and the “intrinsic curvature” or three- 
dimensional scalar curvature invariant (3) R of the “spacelike slice” (here a 3-sphere) 
at the given instant itself. 

The amount of mass-energy in the universe changes from instant to instant in 
accordance with the work done by pressure during the expansion, 

d [ (mass'- energy) X (volume) ] = “(Pressure) ^(volume). (3) 

Today the pressure of radiation is negligible compared to the density of mass-energy, 
and the righthand side of this equation (“work done”) can be neglected. The same 
was true in the past, one estimates, back to a time when linear dimensions were 
about a thousand times smaller than they are today. During this “matter-dominated 
phase” of the expansion of the universe, the product 

/density of \ 
xmass-energy/ 

remained a constant, 



p • 2ir 2 a 3 ~ M. ( 4 ) 

Here the symbol M can look like mass in the form of matter, and can even be called 
mass; but one has to recall again (see §19.4) that the concept of total mass-energy 
of a closed universe has absolutely no well-defined meaning whatsoever, not least 
because there is no “platform” outside the universe on which to stand to measure 
its attraction via periods of Keplerian orbits or in any other way. More convenient 
than M, because more significant in what follows, is the quantity a max (“radius of 
universe at phase of maximum expansion”) defined by 


fl max = 4M/377. 


( 5 ) 
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Box 27.1 (continued) 


The decisive component of the Einstein field equation, in the terms of this notation, 
becomes 


or 


a 2 \ dt ) a 2 a 3 



( 6 ) 


The first term in (6) has the qualitative character of “kinetic energy” in an elementary 
problem in Newtonian mechanics. The second term has the qualitative character 
of a “potential energy,” 

. V(a) = 




(see diagram A), resulting from an inverse-square Newtonian force. Pursuing the 
analogy, one identifies the “—1” on the righthand side with the total energy in the 
Newtonian problem. The qualitative character of the dynamics shows up upon an 
inspection of diagram A. Values of the radius of the universe, a , greater than a m3JL 
are not possible. If a were to become greater than < 2 max , the “potential energy” would 
exceed the total “energy” and the “kinetic energy” of expansion would have to 
become negative, which is impossible. Consequently the geometrodynamic system 
can never be in a state more expanded than a = < 2 max . Starting in a state of small 
< 3 , (i a < < 2 max ) and expanding, the universe has for each a value a perfectly definite 
da/dt value. This velocity of expansion decreases as the expansion proceeds. It falls 
to zero at the turning point a = a max . Thereafter the system recontracts. 
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Lack of option is the striking feature of the dynamics. Granted a specific amount 
of matter [specific M value in (5)], one has at his disposal no free parameter what¬ 
soever. The value of a max is uniquely specified by the amount of matter present, 
and by nothing more. There is no such thing as an “adjustable constant of energy,” 
such as there would have been in a traditional problem of Newtonian dynamics. 
Where such an adjustable parameter might have appeared in equation (6), there 
appears instead the fixed number “—1.” This fixity is the decisive feature of a system 
bound up into closure. Were one dealing with a collection of rocks out in space, 
one would have a choice about the amount of dynamite one placed at their center. 
With a low charge of explosive, one would find the rocks flying out for only a limited 
distance before gravity halted their flight and brought them to collapse together 
again. With more propellant, they would fly out with escape velocity and never 
return. But no such options present themselves here, exactly because Einstein’s 
condition of closure has been imposed; and once closed, always closed. Collapse 
of the universe is universal. This is simple cosmology in brief. 

Einstein's unhappiness at this result was great. At the time he developed general 
relativity, the permanence of the universe was a fixed item of belief in Western 
philosophy: “The heavens endure from everlasting to everlasting.” Yet the reasoning 
that led to the fixed equation left open no natural way to change that equation or 
its fantastic prediction. Therefore Einstein (1917), much against his will, introduced 
the least unnatural change he could imagine, a so-called cosmological term (§27.11), 
the whole purpose of which was to avoid the expansion of the universe. A decade 
later, Hubble (1929) verified the predicted expansion. Thereupon Einstein aban¬ 
doned the cosmological term, calling it “the biggest blunder of my life” [Einstein 
(1970)]. Thus ended the first great cycle of apparent contradiction to general relativity, 
test, and dramatic vindication. Will one ever penetrate the mystery of creation? There 
is no more inspiring evidence that the answer will someday be “yes” than man’s 
power to predict, and predict correctly, and predict against all expectations, so 
fantastic a phenomenon as the expansion of the universe. 

“Newtonian cosmology ” provides an “equation of energy” similar to that of Einstein 
cosmology, but fails to provide any clean or decisive argument for closure or for 
the unique constant “—1.” It considers the mass in any elementary spherical region 
of space of momentary radius r, and the gravitational acceleration of a test particle 
at the boundary of this sphere toward the center of the sphere; thus, 

d 2 r _ (mass) _ (477/3)pr 3 477p 

dt 2 (distance) 2 ~~ r 2 ~~ 3 r ' ^ 


Consider such imaginary spheres of varied radii drawn in the cosmological medium 
with the same center. Note that doubling the radius doubles the acceleration. This 
proportionality between acceleration and distance is compatible with a homogeneous 
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Box 27.1 (continued) 


deceleration of the expansion of the universe. Therefore define an expansion param¬ 
eter < 2 * as the ratio between the radius of any one of these spheres now and the 
radius of the same sphere at some fiducial instant; thus, a* = r/r 0 is to be considered 
as independent of the particular sphere under consideration. Write p = p 0 r 3 /r 3 , 
where p 0 is the density at the fiducial instant. Insert this expression for p into the 
deceleration equation (7), multiply both sides of the equation through by dr/dt , 
integrate, and translate the result from an equation for dr/dt to an equation for 
da*/dt , finding 



(8ttp 0 /3) 

a* 


= constant, 


( 8 ) 


in agreement with equation (6), except for (1) the trivial differences that arise because 
a* is a dimensionless expansion ratio, whereas a is an absolute radius with the 
dimensions of cm, and (2) the all-important difference that here the constant is 
disposable, whereas in standard Einstein geometrodynamics it has the unique ca¬ 
nonical value “ — 1.” For more on Newtonian insights into cosmology, see especially 
Bondi (1961). 

Free fall of a particle towards a Newtonian center of attraction according to 
Newtonian mechanics gives an equation of energy of the same form as (6), except 
that the “radius of the universe,” a , is replaced by distance, r, from the center of 



attraction. The solution of this problem of free fall is described by a cycloid (diagram 
B; see also Figure 25.3 and Box 25.4), generated by rolling a circle of diameter a max 
on a fine through an ever increasing angle tj; thus, 

a = J a ma - COS V), 

( 9 ) 

J 


* = y a max( T ? “ sinT ?)- 
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Immediately observable today is the present rate of expansion of the universe, with 
every distance increasing at a rate directly proportional to the magnitude of that 
distance:* 

( velocity of recession\ 
of a galaxy / 

""(distance to a galaxy) = (Hubble “ constant >” H ») ~ 55 km / sec megaparsec 


— or 


18 x 10 9 yr “ 1.7 x 10 28 cm 

( rate of increase of the \ 
radius of the universe itself/ _ da/dt 
(radius of the universe) a 


( 10 ) 


The Hubble time, Hj 1 ~ 18 x 10 9 yr (linearly extrapolated back to zero separation 
on the basis of the expansion rate observed today, as illustrated in the diagram) 
is predicted to be greater by a factor 1.5 or more (Box 27.3) than the actual time 
back to the start of the expansion as deduced from the rate of the development 
of stars (~ 10 X 10 9 yr). No such satisfactory concord between prediction and obser¬ 
vation on this inequality existed in the 1940’s. The scale of distances between galaxy 
and galaxy in use at that time was short by a factor more than five. The error arose 
from misidentifications of Cepheid variable stars and of HII regions, which are used 
as standards of intensity to judge the distance of remote galaxies. The linearly 
extrapolated time, 


(Hubble time) = 


(distance today) 
(recession velocity today) ’ 


back to the start of the expansion was correspondingly short by a factor more than 
five. The Hubble time came out to be only of the order of 3 X 10 9 yr. This number 
obviously violates the inequality 

3 X 10 9 yr Hubble^ > j 5 ^~10 X 10 9 yr; actual time^ 


(time 


' ' J * ’... 1 

back to start of expansion / 


It implies a curve for dimensions as a function of time not bending down, as in 
diagram B, but bending up. On some sides the proposal was made to regard the 
actual curve as rising exponentially. Thus began an era of “theories of continuous 
creation of matter,” all outside the context of Einstein’s standard geometrodynamics. 


*H 0 is predicted to be independent of the choice of galaxy insofar as local motions are unimportant, 
and insofar as the difference between recession velocity now and recession velocity at the time when 
the light was emitted is unimportant. The latter condition is well fulfilled by galaxies close enough to 
admit of the necessary measurement of distance, for they have redshifts only of the order of z — 0.1 
and less (little lapse of time between emission of light and its reception on earth; therefore little change 
in recession velocity between then and now; see §29.3 and Box 29.4 for a fuller analysis). 
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Box 27,1 (continued) 


This era ended when, for the first time, the distinction between stellar populations 
of classes I and II was recognized and as a result Cepheid variables were correctly 
identified, by Baade (1952, 1956) and when Sandage (1958) discovered that Hubble 
had misidentified as bright stars the HII regions in distant galaxies. Then the scale 
of galactic distances was set straight. Thus ended the second great cycle of an 
apparent contradiction to general relativity, then test, and then dramatic vindication. 

The mystery of the missing matter marks a third cycle of doubt and test with the 
final decision yet to come. It follows from equation (2) that, if Einstein’s closure 
boundary condition is correct, then the density of mass-energy must exceed a certain 
lower limit given by the equation 

P>P„ = ^Hl ( 11 ) 

(“critical amount of mass-energy required to curve up the geometry of the universe 
into closure”). A Hubble expansion rate of H 0 ~ 55 km/sec Megaparsec implies 
a lower limit to the density of 

p H - 8w (1.7 X 10 28 cm) 2 ° r Pw ' conv = 5 X 10 30 g/cm 3 (12) 

as compared to p ^ 2 x 10“ 31 g/cm 3 of “luminous matter” observed in galaxies 
(§29.6) and more being searched for today in the space between the galaxies. 

A fuller treatment of cosmology deals with conditions back in the past corresponding 
to redshifts of 10,000 or more and dimensions 10,000 times less than they are today, 
when radiation could not be neglected, and even dominated (§27.10). It also considers 
even earlier conditions, when anisotropy oscillations of the geometry of the universe 
as a whole (analogous to the transformation from a cigar to a pancake and back 
again) may conceivably have dominated (Chapter 30). More broadly, it takes, up the 
evolution of the universe into its present state (Chapter 28) and the present state 
and future evolution of the universe (Chapter 29). The present chapter examines 
the basic assumptions that underlie the simple standard cosmology thus traced out, 
and §27.11 examines what kinds of qualitative changes would result if one or another 
of these assumptions were to be relaxed. 

V J 
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§27.2. STRESS-ENERGY CONTENT OF THE UNIVERSE— 
THE FLUID IDEALIZATION 


By taking the large-scale viewpoint, one can treat galaxies as “particles” of a “gas” 
that fills the universe. These particles have internal structure (stars, globular clusters, 
etc.); but one ignores it. The “particles” cluster on a small scale (clusters of galaxies 
of size <3 x 10 7 light years); but one ignores the clustering. To simplify calculations, 
one even ignores the particulate nature of the “gas” [though one can take it into 
account, if one wishes, by adopting a kinetic-theory description; see §22.6 for kinetic 
theory, and Ehlers, Geren, and Sachs (1968) for its application to cosmology]. One 
removes the particulate structure of the gas from view by treating it in the perfect- 
fluid approximation. Thus, one characterizes the gas by a 4-velocity, u (the 4-velocity 
of an observer who sees the galaxies in his neighborhood to have no mean motion), 
by a density of mass-energy, p (the smoothed-out density of mass-energy seen in the 
frame with 4-velocity u; this includes the rest mass plus kinetic energy of the galaxies 
in a unit volume, divided by the volume), and by a pressure p (the kinetic pressure 
of the galaxies). The stress-energy tensor for this “fluid of galaxies” is the familiar 
one 


T = (p + p)u ® u + gp, 


(27.1) 


where g is the metric tensor. 

Astronomical observations reveal that the rest-mass density of the galaxies is much 
greater than their density of kinetic energy. The typical ordinary velocities of the 
galaxies—and of stars in them—relative to each other are 

(v) ~ 200 km/sec ~ 10~ 3 . (27.2) 

Consequently, the ratios of kinetic-energy density and of pressure to rest-mass den¬ 
sity are 


WP rl n = Y<f 2 > = 10- 6 , 

P/Prm = J<f 2 > = 10~ 6 . 


(27.3) 


At least, these are the ratios today. Very early in the life of the universe, conditions 
must have been quite different. 

The total density of mass-energy, p, is thus very nearly the rest-mass density of 
the galaxies, p rm . Astronomical observations yield for p rm today 



The rest of this chapter, except 
for Box 27.4, is Track 2. 

No earlier track-2 material is 
needed as preparation for it, 
but it is needed as preparation 
for Chapter 29 (Present state 
and future evolution of the 
universe). 

___ ) 


Idealization of matter in 
universe as a perfect fluid 
("fluid of galaxies") 


Large-scale conditions in 
universe today: 


(1) kinetic energy and 
pressure of stars and 
galaxies 


(2) density of mass in 
galaxies 


(see §29.6). 


Prm > 2 X 10- 31 g/cm 3 


(27.4) 
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(3) cosmic-ray density 


(4) density of intergalactic 
gas 


(5) magnetic fields 


(6) radiation density 


The cosmic microwave 
radiation 


Not all the matter in the universe is tied up in galaxies; there is also matter in 
cosmic rays, with an averaged-out density of mass-energy 

Per < 10-33 g/cm 3 , (27.5) 

and, perhaps, gas in intergalactic space with 

p ig < 10 _2S g/cm 3 . (27.6) 

[Delineating more sharply the value of p ig is one of the most important goals of 
current cosmological research. For a review of this question as of 1971, see “The 
mean mass density of the universe,” pp. 56-120 in Peebles (1971).] These sources 
of mass density, and the associated pressures, one can lump together with the galaxies 
into the “cosmological fluid,” with stress-energy tensor (27.1). 

Not all the stress-energy in the universe is in the form of matter. There are also 
magnetic fields, with mean energy density that almost certainly does not exceed the 
limit 

Pmag < 10-35 g/cm 3 (27.7) 

(corresponding to fi avg < 10 -6 G), and radiation (electromagnetic radiation, neutrino 
radiation, and perhaps gravitational radiation) totaling, one estimates, 

Prad ~ 10-33 g/cm 3 . (27.8) 

The magnetic fields will be ignored in this chapter; they are unimportant for large- 
scale cosmology, except perhaps very near the “big-bang beginning” of the uni¬ 
verse—if they existed then. However, the radiation cannot be ignored, for it plays 
a crucial role. 

Most of the radiation density is in the form of “cosmic microwave radiation,” 
which was discovered by Penzias and Wilson (1965) [see also Dicke, Peebles, Roll, 
and Wilkinson (1965)], and has been studied extensively since then [for a review, 
see Partridge (1969)]. The evidence is very strong that this cosmic microwave radia¬ 
tion is a remnant of the big-bang beginning of the universe. This interpretation will 
be accepted here. 

The cosmic microwave radiation has just the form one would expect if the earth 
were enclosed in a box (“black-body cavity”) with temperature 2.7K. The spectrum 
is a Planck spectrum with this temperature, and the radiation is isotropic [Boughn, 
Fram, and Partridge (1971)]. Consequently, its pressure and density of mass-energy 
are given by the formula, 


microwave 


Vmi 


microwave 


= aT 4 


= 4 x 10" 34 g/cm 3 . 


(27.9) 


Thermodynamic considerations (§27.10) suggest that the universe should also be 
filled with neutrino radiation and perhaps gravitational radiation that have Planck 
spectra at approximately the same temperature (~3K). However, they are not 
detectable with today’s technology. 
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To high accuracy (<300 km/sec) the mean rest frame of the cosmic microwave 
radiation near Earth is the same as the mean rest frame of the galaxies in the 
neighborhood of Earth [Boughn, Fram and Partridge (1971)]. Consequently, the 
radiation can be included, along with the matter, in the idealized cosmological fluid. 

Summary: From the large-scale viewpoint, the stress-energy of the universe can 
be idealized as a perfect fluid with 4-velocity u , density of mass-energy p, pressure 
and stress-energy tensor 

T =Xp ±P)u ® u 4- pg . (27.10) 

The 4-velocity u at a given event 9 in spacetime is the mean 4-velocity of the galaxies 
near 9; it is also the 4-velocity with which one must move in order to measure an 
isotropic intensity for the cosmic microwave radiation. The density p is made up 
of material density (rest mass plus negligible kinetic energy of galaxies; rest mass 
plus kinetic energy of cosmic rays; rest mass plus thermal energy of intergalactic 
gas—all “smeared out” over a unit volume), and also of radiation energy density 
(electromagnetic radiation, neutrino radiation, gravitational radiation). The pressure 
p , like the density p, is due to both matter and radiation. Today the pressure of 
the-matter is much less than its mass-energy density, 

/^matter ^ ^matter t°day, (27.11a) 

but this strong inequality cannot have held long ago. Always the pressure of the 
radiation is £ its mass-energy density: 


/^radiation ^ ^radiation always. (27.11b) 

§27.3. GEOMETRIC IMPLICATIONS OF HOMOGENEITY 
AND ISOTROPY 

This chapter will idealize the universe to be completely homogeneous and isotropic. 
This idealization places tight constraints on the geometry of spacetime and on the 
motion of the cosmological fluid through it. In order to discover these constraints, 
one must first give precise mathematical meaning to the concepts of homogeneity 
and isotropy. 

Homogeneity means, roughly speaking, that the universe is the same everywhere 
at a given moment of time. A given moment of what time? Whose time? This is 
the crucial question that the investigator asks. 

In Newtonian theory there is no ambiguity about the concept “a given moment 
of time.” In special relativity there is some ambiguity because of the nonuniversality 
of simultaneity, but once an inertial reference frame has been specified, the concept 
becomes precise. In general relativity there are no global inertial frames (unless 
spacetime is flat); so the concept of “a given moment of time” is completely ambigu¬ 
ous. However, another, more general concept replaces it: the concept of a three- 
dimensional spacelike hypersurface. This hypersurface may impose itself on one’s 


Summary of fluid idealization 
of matter in universe 


Spacelike hypersurface as 
generalization of "moment of 
time" 
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"Homogeneity of universe" 
defined in terms of spacelike 
hypersurfaces 


"Isotropy of universe" 
defined 


Isotropy implies fluid world 
lines orthogonal to 
homogeneous hypersurfaces 


attention by reason of natural symmetries in the spacetime. Or it may be selected 
at the whim or convenience of the investigator. He may find it more convenient 
to explore spacetime here and there than elsewhere, and to push the hypersurface 
forward accordingly (“many-fingered time”; the dramatically new conception of time 
that is part of general relativity). At each event on a spacelike hypersurface, there 
is a local Lorentz frame whose surface of simultaneity coincides locally with the 
hypersurface. Of course, this Lorentz frame is the one whose 4-velocity is orthogonal 
to the hypersurface. These Lorentz frames at various events on the hypersurface 
do not mesh to form a global inertial frame, but their surfaces of simultaneity do 
mesh to form the spacelike hypersurface itself. 

The intuitive phrase “at a given moment of time” translates, in general relativity, 
into the precise phrase “on a given spacelike hypersurface.” The investigator can 
go further. He can “slice up” the entire spacetime geometry by means of a “one- 
parameter family” of such spacelike surfaces. He can give the parameter that 
distinguishes one such slice from the next the name of “time.” Such a one-parameter 
family of slices through spacetime is not required in the Regge calculus of Chapter 
42. However, such a “slicing” is a necessity in most other practical methods for 
analyzing the dynamics of the geometry of the universe (Chapters 21, 30, and 43V 
The choice of slicing may dissolve away the difficulties of the dynamic analysis or 
may merely recognize those difficulties. The successive slices of “moments of time” 
may shine with simplicity or may only do a tortured legalistic bookkeeping for the 
dynamics. Which is the case depends on whether the typical spacelike hypersurface 
is distinguished by natural symmetries or, instead, is drawn arbitrarily. 

Homogeneity of the universe means , then, that through each event in the universe 
there passes a spacelike “hypersurface of homogeneity ” (physical conditions identical 
at every event on this hypersurface). At each event on such a hypersurface the 
density, p, and pressure, p , must be the same; and the curvature of spacetime must 
be the same. 

The concept of isotropy must also be made precise. Clearly, the universe cannot 
look isotropic to all observers. For example, an observer riding on a 10 20 eV cosmic 
ray will see the matter of the universe rushing toward him from one direction and 
receding in the opposite direction. Only an observer who is moving with the cosmo¬ 
logical fluid can possibly see things as isotropic. One considers such observers in 
defining isotropy: 

Isotropy of the universe means that, at any event , an observer who is “moving with 
the cosmological fluid ” cannot distinguish one of his space directions from the others 
by any local physical measurement . 

Isotropy of the universe actually implies homogeneity; of this one can convince 
oneself by elementary reasoning (exercise 27.1). 

Isotropy guarantees that the world lines of the cosmological fluid are orthogonal 
to each hypersurface of homogeneity. This one sees as follows. An observer “moving 
with the fluid” can discover by physical measurements on which hypersurface 
through a given event conditions are homogeneous. Moreover, he can measure his 
own ordinary velocity relative to that hypersurface. If that ordinary velocity is 
nonzero, it provides the observer with a way to distinguish one space direction in 
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his rest frame from all others—in violation of isotropy. Thus in an isotropic universe, 
where the concept of “observer moving with the fluid” makes sense, each such 
observer must discover that he is at rest relative to the hypersurface of homogeneity. 
His world line is orthogonal to that hypersurface. 


Exercise 27.1. ISOTROPY IMPLIES HOMOGENEITY 

Use elementary thought experiments to show that isotropy of the universe implies homoge¬ 
neity. 


§27.4. COMOVING, SYNCHRONOUS COORDINATE 
SYSTEMS FOR THE UNIVERSE 


The results of the last section enable one to set up special coordinate systems in 
the spacetime manifold of an isotropic model universe (Figure 27.1). Choose a 
hypersurface of homogeneity S 7 . To all the events on it assign coordinate time, t v 
Lay out, in any manner desired, a grid of space coordinates (a: 1 , x 2 , x 3 ) on S,. 
“Propagate” these coordinates off S 7 and throughout all spacetime by means of the 
world lines of the cosmological fluid. In particular, assign to every event on a given 
world line the space coordinates (a: 1 , x 2 , x 3 ) at which that world line intersects S 7 . 
This assignment has a simple consequence. The fluid is always at rest relative to 
the space coordinates. In other words, the space coordinates are “comoving”; they 
are merely labels for the world lines of the fluid. For the time coordinate t of a 
given event 9, use the lapse of proper time, f dr, of 9 from S,, as measured along 
the fluid world line that passes through 9, plus t, (“standard of time” on the initial 
hypersurface S 7 ); thus, 


t(&) = t,+ 



along world line of fluid 


(27.12) 


The surfaces t — constant of such a coordinate system will coincide with the hyper¬ 
surfaces of homogeneity of the universe. This one sees by focusing attention on 
observations made by two different observers, A and B , who move with the fluid 
along different world lines. At coordinate time t t (on Sj) the universe looks the same 
to B as to A. Let A and B make observations again after their clocks have ticked 
away the same time interval At. Homogeneity of the initial hypersurface Sj, plus 
the deterministic nature of Einstein’s field equations, guarantees that A and B will 
again see identical physics. (Identical initial conditions on 5 /? plus identical lapses 
of proper time during which Einstein’s equations govern the evolution of the universe 
near A and B , guarantee identical final conditions.) Therefore, after time lapse At, 
A and B are again on the same hypersurface of homogeneity—albeit a different 


EXERCISE 


Construction of a "comoving, 
synchronous" coordinate 
system for the universe 
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This fluid 
element carries 
the label 
(*\ * 2 , x 3 ) = 
(9, 2 , 136) 



A piece of the hyper- 
surface / as t { + 5 


World lines of particles of 
the fluid with spatial grid 
attached to them (“comoving”) 


A piece of the initial hyper¬ 
surface S h with arbitrary 
coordinate grid on it 


Figure 27.1. 

Comoving, synchronous coordinate system for the universe, as constructed in §27.4 of the text. Key 
features of such a coordinate system are as follows (see §§27.4 and 27.5). (1) The spatial coordinates 
move with the fluid, and the time coordinate is proper time along the fluid world lines; i.e., the coordinate 
description of a particular fluid world line is 


(jc 1 , x 2 , x 3 ) = constant, x° = r = r 4 - constant. 


t_n 


proper time measured”! 
j_along world line J 


(2) Any surface of constant coordinate time is a hypersurface of homogeneity of the universe. Every 
such hypersurface is orthogonal to the world lines of all particles of the fluid. (3) The spatial grid on 
some initial hypersurface S 7 is completely arbitrary. (4) If y {j dx { dx* is the metric on the initial 
hypersurface in terms of its arbitrary coordinates (with y w a function of x 1 , x 2 , x 3 ), then the metric of 
spacetime in terms of the comoving, synchronous coordinate system is 

ds 2 = — dt 2 + a 2 {t) y {j dx { dxK 

Thus, the entire dynamics of the geometry of the universe is embodied in a single function of time, 
a(f) s “expansion factor”; while the shape (but not size) of the hypersurfaces of homogeneity is embodied 
in the spatial 3-metric dx 1 dxK 


one from S If where they began. By virtue of definition (27.12) of coordinate time, 
the time coordinate at the intersection of B's world line with this new hypersurface 
is t = tj + At; and similarly for A. Moreover, observers A and B were arbitrary. 
Consequently the new hypersurface of homogeneity, like S I , is a hypersurface of 
constant coordinate time, q.e.d. 

Because the hypersurfaces of homogeneity are given by t =* constant, the basis 
vectors d/dx l at any given event 9 are tangent to the hypersurface of homogeneity 
that goes through that event. On the other hand, the time basis vector, 3/3 1, is tangent 
to the world line of the fluid through 9, since that world line has x 1 = constant 
along it. Consequently, orthogonality of the world line to the hypersurface guarantees 
orthogonality of d/dt to d/dx im . 


(0/30 * (0/0*0 = 0 for / = 1,2, 3. 


(27.13a) 
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The time coordinate has another special property: it measures lapse of proper 
time along the world lines of the fluid. Because of this, and because d/d t is tangent 
to the world lines, one can write 


d/dt — (d/dj) 

= u. 


along fluid’s world lines 


(27.13b) 


where u is the 4-velocity of the ‘‘cosmological fluid.” The 4-velocity always has unit 
length, 


(d/dt)-(d/dt) = - 1 . (27.13c) 

Conditions (27.13a,c) reveal that, in the comoving coordinate frame [where 
ga/j — (d/dx a ) * {d/dx&)\ the line element for spacetime reads 

ds 2 = -dt 2 + gy dx l dxK . (27.14) 

Any coordinate system in which the line element has this form is said to be 
“synchronous” (1) because the coordinate time t measures proper time along the 
lines of constant x l (i.e., g tt = — 1), and (2) because the surfaces t = constant are 
(locally) surfaces of simultaneity for the observers who move with x { = constant 
Sti = (3/3^) * (d/3* 1 ) = 0]; it is also called a “Gaussian normal coordinate 
system” (cf Figure 21.6). 

A hypersurface of homogeneity, t — constant, has a spatial, three-dimensional 
geometry described by equation (27.14) with dt = 0: 


)on hypersurface of homogeneity ^0 


[gijlf — const dx dxK 


(27.15) 


To know everything about the 3-geometry on each of these hypersurfaces is to know 
everything about the geometry of spacetime. 


Exercise 27.2. SYNCHRONOUS COORDINATES IN GENERAL 

In an arbitrary spacetime manifold (not necessarily homogeneous or isotropic), pick an initial 
spacelike hypersurface <S /} place an arbitrary coordinate grid on it, eject geodesic world lines 
orthogonal to it, and give these world lines the coordinates 

(x 1 , x 2 , x 3 ) = constant, x° = t = tj + r, 

where t is proper time along the world line, beginning with t = 0 on S T . Show that in this 
coordinate system the metric takes on the synchronous (Gaussian normal) form (27.14). 


Form of the line element in 
this coordinate system 


EXERCISE 
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Proof that, aside from an 
over-all "expansion factor," 
all homogeneous 
hypersurfaces in the universe 
have the same 3-geometry 


§27.5. THE EXPANSION FACTOR 

To determine the 3-geometry, do 2 = g { ft, x k ) dx' dx\ of each of the hypersurfaces 
of homogeneity, split the problem into two parts: (1) the nature of the 3-geometry 
on an arbitrary initial hypersurface (dealt with in next section); and (2) the evolution 
of the 3-geometry as time passes, i.e., as attention moves from the initial hypersur¬ 
face to a subsequent hypersurface, and another, and another,... (dealt with in this 
section). 

Assume that one knows the initial 3-geometry—i.e., the coefficients in the space 
part of the metric, 

Yy(* fc ) = gifti, x k ), (27.16) 

on the initial hypersurface Sj —in its arbitrary but explicitly chosen coordinate system. 
What form will the metric coefficients g ik (t, x k ) have on the other hypersurfaces of 
homogeneity? This question is easily answered by the following argument: Consider 
two adjacent world lines, d and of the cosmological fluid, with coordinates 
(x l , x 2 , x 3 ) and (x l -f- Ax 1 , x 2 + Ax 2 , x 3 4* Ax 3 ). At time t I (on surface S,) they are 
separated by the proper distance 

Ao(tj) = (y i; - Ax' Ax*) l/2 . (27.17) 

At some later time t (on surface S), they will be separated by some other proper 
distance Ao(t). Isotropy of spacetime guarantees that the ratio of separations 
Ao(t)/Ao(tj) will be independent of the direction from d to $ (no shearing motion 
of the fluid). For any given direction, the additivity of small separations guarantees 
that Ao(t)/Ao(tj) will be independent of Ao(t I ). Thus Ao(t)/Ao(t 7 ) must be the same 
for all pairs of world lines near a given world line. Finally, homogeneity guarantees 
that this scalar ratio will be independent of position on the initial surface S 7 —i.e., 
independent of x\ x 2 , x 3 . Define a(t) to be this spatially constant ratio, 

a(t) = Ao(t)/Ao(tj). (27.18) 

Thus, a(t) is the factor by which the separations of world lines expand between time 
tj and time t. In other words, the function a(t) is a universal “expansion factor 
or “scale factor ” 

By combining equations (27.17) and (27.18), one obtains for the separation of 
adjacent world lines at time t 

Ao(t) = a(r)[Y^(x fc ) Ax' Ax ; ] 1/2 . 

This corresponds to the spatial metric at time t, 

do 2 = a 2 (t)y {j (x k ) dx' dx’, (27.19) 


and to the spacetime metric, 

ds 2 = -dt 2 -b a 2 (t)y ij (x k ) dx' dxK 


( 2120 ) 
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Figure 27.2. 

Inflation of a balloon covered with pennies as a model for the expansion of the universe. 
Each penny A may well consider itself to be the center of the expansion because the distance 
from A to any neighbor B or C increases the more the more remote that neighbor was 
to begin with (“the Hubble relation”). The pennies themselves do not expand (constancy 
of sun-Earth distance, no expansion of a meter stick, no increase of atomic dimensions). 
The spacing today between galaxy and galaxy (— 10 6 lyr) is roughly ten times the typical 
dimension of a galaxy (~~ 10 5 lyr). 


Notice that the coefficients describe the shape not only of the initial 

hypersurface, but also of all other hypersurfaces of homogeneity. All that changes 
in the geometry from one hypersurface to the next is the scale of distances. All 
distances between spatial grid points (fluid world lines) expand by the same factor 
a(t ), leaving the shape of the hypersurface unchanged. This is a consequence of 
homogeneity and isotropy; and it is precisely true only if the model universe is 
precisely homogeneous and isotropic. 

Of all the disturbing implications of “the expansion of the universe,” none is more 
upsetting to many a student on first encounter than the nonsense of this idea. The 
universe expands, the distance between one cluster of galaxies and another cluster 
expands, the distance between sun and earth expands, the length of a meter stick 
expands, the atom expands? Then how can it make any sense to speak of any 
expansion at all? Expansion relative to what? Expansion relative to nonsense! Only 
later does he realize that the atom does not expand, the meter stick does not expand, 
the distance between sun and earth does not expand. Only distances between clusters 
of galaxies and greater distances are subject to the expansion. Only at this gigantic 
scale of averaging does the notion of homogeneity make sense. Not so at smaller 
distances. No model more quickly illustrates the actual situation than a rubber 
balloon with pennies affixed to it, each by a drop of glue. As the balloon is inflated 
(Figure 27.2) the pennies increase their separation one from another but not a single 
one of them expands! [For mathematical detail see, e.g., Noerdlinger and Petrosian 
(1971).] 


What expands in the 
universe, and what does 
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EXERCISE 


Riemann tensor for 
homogeneous, isotropic 
hypersurfaces 


Exercise 27.3. ARBITRARINESS IN THE EXPANSION FACTOR 

How much arbitrariness is there in the definition of the expansion factor a(t)l Civilization 
A started long ago at time t A . For it, the expansion factor is 

( proper distance between \ 
two particles of the “cos- j 
moiogical fluid'* at time / / _ 
proper distance between 
same two particles 
at time t A 

Subsequently men planted civilization B at time t B on a planet in a nearby galaxy, [At this 
time, the expansion factor a A had the value a A (t B )]. Civilization B defines the expansion factor 
relative to the time of its own beginning: 

( proper distance between \ 
two particles of the “cos- ] 
moiogical fluid” at time // 
proper distance between 
the same two particles 
at time t B 

At two subsequent events, C and D , of which both civilizations are aware, they assign to 
the universe in their bookkeeping by no means identical expansion factors, 




a A^c) 7 " 

G a(^d) 7 ^ (*£>)■ 

Show that the relative expansion of the model universe in passing from stage C to stage 
D in its evolution is nevertheless the same in the two systems of bookkeeping: 

a A {t D ) _ /relative expansionX _ a B (t D ) 
a A (t c ) ~~ worn CloD / a B (t c ) ' 


§27.6. POSSIBLE 3-GEOMETRIES FOR 

A HYPERSURFACE OF HOMOGENEITY 

Turn now to the 3-geometry y i} dx l dx’ for the arbitrary initial hypersurface Sj. This 
3-geometry must be homogeneous and isotropic. A close scrutiny of its three-dimen¬ 
sional Riemann curvature must yield no “handles” to distinguish one point on S, 
from any other, or to distinguish one direction at a given point from any other. 
“No handles” means that (3) Riemann must be constructed algebraically from pure 
numbers and from the only “handle-free” tensors that exist: the 3-metric and 
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the three-dimensional Levi-Civita tensor e (jk . (All other tensors pick out preferred 
directions or locations.) One possible expression for {3) Riemann is 

i3>R Uki = K(y ik y jt — y u y ik )', K = “curvature parameter” = constant. (27.21) 

Trial and error soon convince one that this is the only expression that both has the 
correct symmetries for a curvature tensor and can be constructed solely from con¬ 
stants, y ijt and e ijk . Hence, this must be the 3-curvature of <S / . [One says that any 
manifold with a curvature tensor of this form is a manifold of “constant curvature. ”] 
As one might expect, the metric for 5 ; is completely determined, up to coordinate 
transformations, by the form (27.21) of its curvature tensor. (See exercise 27.4 below). 
With an appropriate choice of coordinates, the metric reads (see exercise 27.5 below), 

da 2 = y i; . dx i dx> = K~\dx 2 + sin 2 x(d6 2 + sin 2 # d$ 2 )) if K > 0, 

do 2 = y.. dx { dx’ = dx 2 + x\d9 2 + sin 2 # d<j> 2 ) if K = 0, (27.22) 

da 2 = y w dx i dx’ = (— K)~\dx? + sinh 2 x(<f# 2 + sin 2 # d§ 2 )} if K < 0. 


Absorb the factor K 1/2 or (—AT) 1/2 into the expansion factor a{t) [see exercise 
27.3], and define the function 


2 & sin x, 

^ = X, 

I = sinh x. 


if A: = K/ K = +1 (“positive spatial curvature”), 

if k = K = 0 (“zero spatial curvature”), 

if A: = K/\K\ = — 1 (“negative spatial curvature”). 


(27.23) 


Thus write the full spacetime geometry in the form 

ds 2 = —dt 2 + a\t) Yy dx' dx’, 

Yy dx' dx 1 = dx 2 + 2 2 {d0 2 + sin 2 # d<~> 2 ), 

and the 3-curvatures of the homogeneous hypersurfaces in the form 

i3)R mi = [fc/a 2 (0][YiiY ji ~ YiiYjfc ]- 


(27.24) 


(27.25a) 


The curvature parameter K after this renormalization, is evidently 

K = k/a\i). (27.25b) 

Why is the word “renormalization” appropriate? Previously a(t) was a scale factor 
describing expansion of linear dimensions relative to the linear dimensions as they 
stood at some arbitrarily chosen epoch; but the choice of that fiducial epoch was 
a matter of indifference. Now a(f) has lost that arbitrariness. It has been normalized 
so that its value here and now gives the curvature of a spacelike hypersurface of 
homogeneity here and now. Previously the factor a(t) was conceived as dimensionless. 
Now it has the dimensions of a length. This length is called the “radius of the model 
universe” when the curvature is positive. Even when the curvature is negative one 
sometimes speaks of a(t) as a “radius.” Only for zero curvature does the normaliza- 


Metric for homogeneous, 
isotropic hypersurfaces: three 
possibilities—positive, zero, 
or negative spatial curvature 


Significance of normalization 
of the expansion factor 
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tion of a(t) still retain its former arbitrariness. Thus, for zero-curvature, consider 
two choices for a(t), one of them a{t ), the other a{t) — 2a{t). Then with x = one 
can write proper distances in the three directions of interest with perfect indifference 
in either of two ways: 

a{t)d X = a(t)d x , 


a(t)xdO = a{t)x dO, 


a{t)x sin 9 d<j> = a(t)x sin 9 df>. 

No such freedom of choice is possible when the model universe is curved, because 
then the x’s in the last two lines are replaced by a function, sin x or sinhx, that 
is not linear in its argument. 

Despite the feasibility in principle of determining the absolute value of the 
“radius” a(t ) of a curved universe, in practice today’s accuracy falls short of what 
is required to do so. Therefore it is appropriate in many contexts to continue to 
regard a(t) as a factor of relative expansion, the absolute value of which one tries 
to keep from entering into any equation exactly because it is difficult to determine. 
This motivation will account for the way much of the analysis of expansion is carried 
out in what follows, with calculations arranged to deal with ratios of a values rather 
than with absolute a values. 

Box 27.2 explores and elucidates the geometry of a hypersurface of homogeneity. 


( proper distance\ 
in the direction j 
of increasing x / 

( proper distance \ 
in the direction 
of increasing 9 j 

( proper distance \ 
in the direction j 
of increasing <> / 


EXERCISES 


Exercise 27.4. UNIQUENESS OF METRIC FOR 3-SURFACE 
OF CONSTANT CURVATURE 

Let y {j and y V y be two sets of metric coefficients, in coordinate systems {**} and {**'}, that 
have Riemann curvature tensors [derived by equations (8.22) and (8.42)] of the constant- 
curvature type (27.21). Let it be given in addition that the curvature parameters K and K' 
are equal. Show that y {j and y vr are related by a coordinate transformation. [For a solution, 
see §8.10 of Robertson and Noonan (1968), or §§10 and 27 of Eisenhart (1926).] 

Exercise 27.5. METRIC FOR 3-SURFACE OF CONSTANT CURVATURE 

(a) Show that the following metric has expression (27.21) as its curvature tensor 

y„ = (l (27.26)* 


*With this choice of spatial coordinates, the spacetime metric reads 


ds 2 _ + 


{dx 2 + dy 2 + dz 2 ) 

ir+1 K(x 2 +y+ z 2 )] 2 ' 


This is often called the “Robertson-Walker line element because Robertson (1935, 1936) and Walker 
(1936) gave the first proofs that it describes the most general homogeneous and isotropic spacetime 
geometry. 
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(b) By transforming to spherical coordinates (R, 6, <t>) and then changing to a Schwarzs- 
child radial coordinate (2 irr = “proper circumference”), transform this metric into the form 

da 2 = YZYr 2 + r2(d92 + Si " 20 d<i>2) ’ (¥121) 


(c) Find a further change of radial coordinate that brings the metric into the form (27.22). 

Exercise 27.6. PROPERTIES OF THE 3-SURFACES 

Verify all statements made in Box 27.2. 

Exercise 27.7. ISOTROPY IMPLIES HOMOGENEITY 

Use the contracted Bianchi identity (3) G ifc [fc = 0 (where the stroke indicates a covariant 
derivative based on the 3-geometry alone) to show (1) that (3) VAT = 0 in equation (27.21), 
and therefore to show (2) that direction-independence of the curvature [isotropy; curvature 
of form (27.21)] implies and demands homogeneity (K constant in space). 

(continued on page 726) 



--- \ 

Box 27.2 THE 3-GEOMETRY OF HYPERSURFACES OF HOMOGENEITY 

A. Universe with Positive Spatial 
Curvature "'Spatially Closed 
Universe") 


Metric of each hypersurface is 

da 2 = a 2 [dx 2 4 sin 2 x(d# 2 4- sin 2 # d<f> 2 )]. (1) 

To visualize this 3-geometry, imagine embed¬ 
ding it in a four-dimensional Euclidean space 
(such embedding possible here; not possible for 
general three-dimensional manifold; only four 
freely disposable functions [w, x,y, z] of three vari¬ 
ables [a, /?, y] are at one’s disposal to try to repro¬ 
duce six prescribed functions [g mn (ct, /?, y)] of those 
same three variables). 

The embedding is achieved by 

w = a cos x, z — a sin x cos #, 

x = a sin x sin 9 cos <j>, (2) 

y — a sin x sin 9 sin <f>, 
since it follows that 

da 2 ~ dw 2 + dx 2 4 dy 2 4 dz 2 

— a 2 [dx 2 4 sin 2 x(d9 2 4 sin 2 9 d<p 2 )]. 



w 



A 3-surface of positive curvature embedded in four-dimen¬ 
sional Euclidean space. One rotational degree of freedom is 
suppressed by setting <t> = 0 and tt (“slice through pole,” 3 - 
(3) sphere in 4-space looks like a 2 -sphere in 3 -space). 






Box 27.2 (continued) 


Equations (2) for the embedded surface imply 
that 

w 2 4 x 2 4 y 2 4* z 2 = a 2 \ (4) 

i.e., the surface is a 3-dimensional sphere in 4- 
dimensional Euclidean space. 

To verify homogeneity and isotropy, one need 
only notice that rotations in the four-dimensional 
embedding space can move any given point [any 
given (w, x,y, z) on the 3-sphere] and any given 
direction at that point into any other point and 
direction—while leaving unchanged the line ele¬ 
ment 

do 2 — dw 2 + dx 2 + dy 2 4 dz 2 . 

The above equations and the picture show that 

(1) The 2-surfaces of fixed x (which look like cir¬ 
cles in the picture, because one rotational de¬ 
gree of freedom is suppressed) are actually 
2-spheres of surface area Ana 2 sin 2 x; and (0, <(>) 
are standard spherical coordinates on these 
2-spheres. 

(2) As x ranges from 0 to t r, one moves outward 
from the “north pole” of the hypersurface, 
through successive 2-spheres (“shells”) of area 
47 ia 2 sin 2 x (2-spheres look like circles in pic¬ 
ture). The area of these shells increases rapidly 
at first and then more slowly as one ap¬ 
proaches the “equator” of the hypersurface, 
X = ?r/2. Beyond the equator the area de¬ 
creases slowly at first, and then more rapidly 
as one approaches the “south pole”, (y = n; 
area = 0). 

(3) The entire hypersurface is swept out by 

0 < X < 

0 < 0 < n, 

0 < <f> < 2n 

(<f> is cyclic; <£ = 0 is same as <f> = 2?r); 
its 3-volume is 

T = J(adx)(a s>mxd8){a sinxsin0^) 
c v 

= I Ana 2 sin 2 x(^ dx) = 2<n 2 a z . (5) 

J o 


B. Universe with Zero Spatial 
Curvature (“Spatially Flat Universe") 

Metric of each hypersurface is 

do 2 = a 2 [d X 2 + x\dd 2 4 sin 2 0 d<j> 2 )]. (6) 

This is a perfectly flat, three-dimensional, Eucli¬ 
dean space described in spherical coordinates. In 
Cartesian coordinates 

x = ax sin 6 cos <f>, 
y — ax sin 6 sin <J>, (7) 

z = ax cos 0, 

the metric is 

do 2 — dx 2 4 dy 2 4 dz 2 . (8) 

The entire hypersurface is swept out by 
0 < x < oo, 

0 < 0 < 7T, (9) 

0 < 4 > < 2tt; 

and its volume is infinite. 

C. Universe with Negative Spatial 
Curvature (“Spatially open 
Universe") 

Metric of each hypersurface is 

do 2 = a 2 [d x 2 4 sinh 2 x (rf0 2 4 sin 2 0 d<p 2 )\. (10) 

This 3-geometry cannot be embedded in a four¬ 
dimensional Euclidean space; but it can be em¬ 
bedded in a flat Minkowski space 

do 2 = — dw 2 4 dx 2 4 dy 2 4 dz 2 . (11) 

To achieve the embedding, set 

w = a cosh x, z = a sinh x cos 0, 

x — a sinh x sin 0 cos <J>, (12) 

y = a sinh x sin 0 sin <j >; 

insert this into equation (11), and thereby obtain 

( 10 ). 

Equations (12) for the embedded surface imply 
that 

w 2 — x 2 — y 2 — z 2 = a 2 ; (13) 

i.e., the surface is a three-dimensional hyperboloid 
in four-dimensional Minkowski space. (It has the 



Typical point 

With <f> = 7T 



A 3-surface of negative curvature embedded in four-dimen¬ 
sional Minkowski space. One rotational degree of freedom is 
suppressed by setting <f> = 0 and n (“slice through pole”; 3- 
hyperboloid in 4-space looks like 2-hyperboloid in 3-space). 


same form as a mass hyperboloid in momentum 
space; see Box 22.5.) 

To verify homogeneity and isotropy, one need 
only notice that “Lorentz transformations” in the 
embedding space can move any given point on the 
3-hyperboloid and any direction through that 
point into any other point and direction—while 
leaving unchanged the line element 

do 2 = — dw 2 '+ dx 2 + dy 2 + dz 2 . 

The above equations and the picture show that 

(1) The 2-surfaces of fixed x (which look like cir¬ 
cles in the picture because one rotational de¬ 
gree of freedom is suppressed) are actually 
2-spheres of surface area Ana 2 sinh 2 x; and (0, 
<f>) are standard spherical coordinates on these 
2-spheres. 

(2) As x ranges from 0 to oo, one moves outward 
from the (arbitrarily chosen) “pole” of the 
hypersurface, through successive 2-spheres 
(“shells”) of ever increasing area Ama 2 sinh 2 x. 
For large x, surface area increases far more 
rapid ly than it would if the hypersurface were 
flat 


(proper surface area) __ A 
Am (proper distance) 2 Am l 2 


__ Am a 2 sinh 2 x 
Ama 2 x 2 

\2 l/a) 


(14) 


The entire hypersurface is swept out by 
0 < X < oo, 

0 < 9 < 77, 

0 < cj> < 2m 

(<j> is cyclic; <f> = 0 is same as <j> = 2m). 
The volume of the hypersurface is infinite. 


(15) 


D. Nonuniqueness of Topology 

Warning: Although the demand for homogeneity 
and isotropy determines completely the local geo¬ 
metric properties of a hypersurface of homogene¬ 
ity up to the single disposable factor K , it leaves 
the global topology of the hypersurface undeter¬ 
mined. The above choices of topology are the most 
straightforward. But other choices are possible. 

This arbitrariness shows most simply when the 
hypersurface is flat (k = 0). Write the full space- 
time metric in Cartesian coordinates as 


ds 2 = -dt 2 + a 2 (t)[dx 2 + dy 2 + dz 2 ]. (16) 

Then take a cube of coordinate edge L 

0 < x < L, 0 <y< L, 0 < z< L, 

and identify opposite faces (process similar to roll¬ 
ing up a sheet of paper into a tube and gluing 
its edges together; see last three paragraphs of 
§ 11.5 for detailed discussion). The resulting geom¬ 
etry is still described by the line element (16), but 
now all three spatial coordinates are “cyclic,” like 
the (j> coordinate of a spherical coordinate system: 

(t,x,y,z) is the same event as 
(t, x 4- L, y + L, z + L ). 

The homogeneous hypersurfaces are now “3-tor- 
uses” of finite volume 


V = a 3 L 3 , 

analogous to the 3-toruses which one meets under 
the name “periodic boundary conditions” when 
analyzing electron waves and acoustic waves in 
solids and electromagnetic waves in space. 

Another example: The 3-sphere described in 
part A above (case of “positive curvature”) has the 
same geometry, but not the same topology, as the 
manifold of the rotation group, 50(3) [see exer¬ 
cises 9.12, 9.13, 10.16, and 11.12]. For detailed 
discussion, see for example Weyl (1946), Coxeter 
(1963), and Auslander and Markus (1959).. 


726 


27. IDEALIZED COSMOLOGIES 


Euler equation is vacuous for 
a homogeneous universe 


"Equations of state" for 
matter and radiation 


Energy exchange between 
matter and radiation is 
negligible 


§27.7. EQUATIONS OF MOTION FOR THE FLUID 

After the above analysis of any one hypersurface of homogeneity, return to the 
dynamics of the universe. Examine, first, the evolution of the fluid, as governed by 
the law V • T = 0. 

Recall (§22.3 and 23.5) that for a perfect fluid the equations of motion split into 
two parts. The component along the 4-velocity, «/•(▼• T) = 0, reproduces the first 
law of thermodynamics 


(d/dr)(p V) = —p(dV/dr), (27.28a) 

where V is the volume of any fluid element. The part orthogonal to the 4-velocity, 
(g + i# <8> u) • (▼ • T) = 0, gives the force equation (“Euler equation”) 

(p + p) X (4-acceleration) = —(component of Vp orthogonal to i/). (27.28b) 

For a static star (§23.5) the first law of thermodynamics was vacuous, but the force 
equation was crucial. For a homogeneous universe, the converse is true; the force 
equation is vacuous (no accelerations), but the first law of thermodynamics is crucial. 

To see that the force equation is vacuous, notice that isotropy guarantees the 
vanishing of both sides of equation (27.28b). If either side were nonzero at any event 
9 , it would distinguish a direction in the homogeneous hypersurface at 9 . 

In applying the first law of thermodynamics (27.28a) to cosmology, divide the 
density and pressure into contributions due to matter and contributions due to 
radiation: 


P = Pm + Pr’ P = Pm + Pr (27.29) 

First discuss the density of mass-energy. Today p m (>10“ 31 g/cm 3 ) dominates over 
p r (~~ 10 -33 g/cm 3 ). Matter did not always dominate. Therefore, one cannot set p r = 0. 
Now discuss the pressure. During that epoch of the universe when pressure was 
significant cosmologically, p r dominated over p m . Consequently, one can neglect p m 
at all times, and one can use the “equation of state” for radiation, p r = ^p r , to write 

P = Pm + PP P =jPr (27.30) 

When (27.30) is inserted into the first law of thermodynamics (27.28a), it yields the 
result 

(d/dr)(p n V) + (d/dr)(p r V) = - jp r dv/dr. (27.31) 

One cannot integrate this equation until one knows how mass-energy is fed back 
and forth between matter and radiation—i.e., until one knows another relationship 
between p m V and p r V. All estimates indicate that, except in the first few seconds 
of the life of the universe, the energy exchanged between radiation and matter was 
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negligible compared to p m Fand p r V individually (see §28.1). Under these conditions, 
equation (27.31) can be split into two parts: 

(d/dr)(p m V) = 0, (27.32a) 


and 


First law of thermodynamics 
used to express densities of 
radiation and matter in terms 
of expansion factor 


(d/dT)( Pr V)+jp r dV/dr = 0. 


(27.32b) 


The solutions are simple: 


P m V = constant (conservation of matter) (27.33a) 

and 

p r V 4/3 = const = -j~y 3 V / constancy of number \ (27.33b) 

| V of photons / 

energy hc/X of 1 
one photon, up 
to a factor of 
| proportionality 1 

Now what is V? It is the volume of any fluid element. It has the value 

V = a 3 2 2 sin 6 Ax AO A<f> 

for a fluid element with edges Ax , AO, A<f>. Here x, 4> are constant along each 
world line of the fluid (comoving coordinates). Therefore the element of hyperspher- 
ical solid angle Z 2 sin 6 Ax AS A<f> (or pseudohyperspherical solid angle for the model 
of an open universe) is constant throughout all time for any fluid element. Therefore 
the volume of the fluid element grows in direct proportion to the cube of the 
expansion parameter a ; thus, 


V/a 3 = constant. 

Combining this result with the constancy of p m V and p r V 4/3 , one sees that 

p m a 3 = constant, p r a 4 = constant. (27.34) 

Let Pm be the density of matter today, p w be the density of radiation today, and 
a Q be the expansion factor for the universe today. Then, at any time in the past, 


p(0 Pmo "L Pn 


™ a\t) 1 rro a\t) 


(27.35 a) 


and 


Pit) = -p 


3 a\t)' 


(27.35b) 
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These results were based on two key claims, which will be justified in detail later 
(Chapter 28): the claim that in the epoch when pressure was important p m was much 
smaller than p r \ and the claim that exchange of mass-energy between radiation and 
matter was always negligible (except in the first few seconds after the “creation”). 


§27.8. THE EINSTEIN FIELD EQUATION 

Once the time evolution of the expansion factor, a(t ), is known, one can read off 
the time evolution of the density and pressure directly from equations (27.35). The 
density and pressure, in turn, determine how the expansion proceeds in time, via 
Einstein’s field equations. Thus the field equations “close the logic loop” and give 
one a closed mathematical system from which to determine all three quantities, a(t), 
Pit) and p(/). 

One can readily calculate the components of the Einstein tensor for the model 
universe using the orthonormal basis one-forms, 

cv* = dt , a(t) d\, w^ = a(t)2d0, <v* = a (t)Z sin 6 d<$>. (27.36) 


Evaluation of the Einstein 

The result [see equations (5) of Box 14.5] is 


field equation for a 
homogeneous universe: 


(27.37a) 


Gi; = G»» = G« = -2^f-(^) -Jr. 

(27.37b) 


G-- = 0 if /x ^ v. 

(27.37c) 


(With foresight, one will notice ahead of time that isotropy guarantees the equality 
G-j = = Gfo, and similar equalities for the Riemann tensor; and one will 

calculate only G—, the component that is most easily calculated.) 

The basis one-forms, w*, <v\ u)\ <u* 9 are the orthonormal basis carried along 
by an observer who moves with the “cosmological fluid.” Consequently, 7# is the 
mass-energy density, p, that he measures; 7]j is the pressure,/?; 7g vanishes, because 
he sees no energy flux (no momentum density); and vanishes for i ^ j because 
he sees no shear stresses: 

Tp t = Pf (27.38 a) 

% = =P> (27.38b) 

Tfi = 0 when /x ^ v. (27.38c) 

Equate the Einstein (“moment of rotation”) tensor of equations (27.37) to the 
stress-energy tensor of equations (27.38). And if one insists, include the so-called 
“yl-term” or “cosmological term” in the field equations [Einstein (1970): “the biggest 
blunder of my life”). Thus obtain two nonvacuous field equations. The first is an 
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“initial value equation,” which relates a t to a and p at any initial moment of time: 



omit 


(27.39a) 


The second is a “dynamic equation,” which gives the second time-derivative of the 
expansion factor, and thereby governs the dynamic evolution away from the ini tial 
moment of time, 

~ ~ 7 + JL ~ S ’P- < 27J9b > 

omit 


If (27.39b) is to be compared with anything in Newtonian mechanics, it is to be 
compared with an equation for acceleration (equation of motion), and in the same 
spirit (27.39a) is to be compared with a first integral of the equation of motion; 
that is, an equation of energy. In accordance with this comparison, note that one 
only has to differentiate (27.39a) and combine it with the relation satisfied by the 
pressure, 

(pa 3 ) ft = -p{aP) t 

(“law of conservation of energy”) to get the acceleration equation (27.39b). Without 
any loss of information, one can therefore ignore the “acceleration equation” or 
“dynamic equation” (27.39b) henceforth, and work with the analog of an energy 
equation or what is more properly known as an “initial-value equation” (details of 
initial-value problem for Track-2 readers in Chapter 21). 

What shows up here in the limited context of Friedmann cosmology is appropri¬ 
ately viewed in the wider context of general geometrodynamics. Conservation of 
energy plus one field equation have just been seen to reproduce the other field 
equations. Conversely, by accepting both field equations, one can derive the law 
of conservation of energy in the form just stated. Thus, the very act of writing the 
field equation G = 8 ttT (or, if one insists upon the “cosmological term,” 

G + Ag = 8t tT) 
omit 

was encouraged by and founded on the automatic vanishing of the divergence V • G 
(or the vanishing of the divergences of G and g), because one knew to begin with 
that energy and momentum are conserved, V * T = 0. It is not surprising, then, that 
there should be a redundancy between the conservation law, V • T = 0, and the 
field equations. Neither is it surprising in the dynamics of the Friedmann universe 
that one can use what is here the one and only interesting component of the conser¬ 
vation law, plus the one and only interesting initial value component ( G% component) 
of the field equations, to obtain the one and only interesting dynamic component 
(G- component) of the field equations. 


(1) initial value equation 


(2) dynamic equation 


Why the dynamic equation is 
superfluous 


Side remarks about initial 
value equations, dynamic 
equations, and Bianchi 
identities in more general 
contexts 
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In a similar way, in more general problems that lack symmetry, one can always 
eliminate some of the dynamic field equations, but when gravitational radiation is 
present, one cannot eliminate them all. The dynamic field equations that cannot 
. be eliminated, even in principle, govern the propagation of the gravitational waves. 
No gravitational waves are present in a perfectly homogeneous and isotropic cos¬ 
mological model; its high degree of symmetry—in particular, its spherical (2-sphere!) 
symmetry about x = 0—is incompatible with gravitational waves. 

Now turn back from general dynamics to Friedmann cosmology. To determine 
the time evolution of the expansion factor, a , insert into the initial-value equation 
(27.39a) the expression for the density of mass-energy given in (27.35a), and arrive 
at an equation ready for integration. 

Differential equation for 
expansion factor 


When one has completed the integration of this equation for a = a(t ), one turns 
back to equation (27.35a,b) to get p(/) and /?(/), and to expression (27.24) to get 
the geometry, 

ds 2 = - dt 2 4- a\t)[d X 2 + 2 2 (dQ 2 + sin 2 0 d<t> 2 )], (27.41) 

thus completing the solution of the problem. 


m=- 


A 

+ x + 

omit 


(^PmX/3) , (8wp ro a’/3) 


+ 


(27.40) 


(8w/3)p(a) 


Three choices of time 
parameter for universe: 

(1) proper time, t 


(2) expansion factor, a 


§27.9. TIME PARAMETERS AND THE 
HUBBLE CONSTANT 

To the analysis of this dynamic problem, many investigators have contributed over 
the years, beginning with Friedmann himself in 1922. They discovered, among other 
results, that there are three natural choices of time variable, the one of greatest utility 
depending on the application that one has at hand. 

First is /, the original time variable. This quantity gives directly proper time elapsed 
since the start of the expansion. This is the time available for the formation of 
galaxies. It is also the time during which radioactive decay and other physical 
processes have been taking place. 

Second is a(r), the expansion factor, which grows with time, which therefore serves 
to distinguish one phase of the expansion from another, and which consequently 
can be regarded as a parametric measure of time in its own right. The ratio of a(t) 
at two times gives the ratio of the dimensions of the universe (cube root of volume) 
at those two times. It also gives the ratio (1 + z) of wavelengths at those two times 
(see §29.2). A knowledge of the red shift, z, experienced in time past by radiation 
received today is equivalent to a knowledge of a(t)/a 0 , where a 0 is the expansion 
factor today. Specifically, radiation coming in with z = 999 is radiation coming in 
from a time in the history of the universe when it had 10 -3 of its present dimensions 
and 10~ 9 of its present volume. During the interval of time while the expansion 
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parameter is increasing from a to a + da, the lapse of proper time, according to 
(27.40), is 


dt = 


_ da _ 

[~k + (&7r/3)a 2 p(a) + (A/3)a 2 ] 1/2 ' 


(27.42) 


omit 


In terms of a as a new time parameter, it follows from this formula that the metric 
takes the form [Hughston (1969)] 

* = -k + (87T/3)a 2 p(a) + (A/3)a 2 + + + sin 2 0 d<$> 2 )}. (27.43) 

omit 


Third is -q(t), the “arc-parameter measure of time.” During the interval of time 
dt, a photon traveling on a hypersphere of radius a(t ) covers an arc measured in 
radians equal to 



(27.44) 


(3) arc parameter, 77 


When the model universe is open instead of closed, the same parameter lets itself 
be defined. Only the words “hypersphere” and “arc” have to be replaced by the 
corresponding words for a flat hypersurface of homogeneity (k = 0 ) or a hyperbol- 
oidal hypersurface (k = — 1). In all three cases, the “arc parameter” is defined by 
the integral of this expression from the start of the expansion: 


V = 


r * dt 

l a(0 ; 


(27.45) 


thus small values of the “arc parameter time,” 77 , mean early times; and larger values 
mean later times. In terms of this “arc-parameter measure of time,” the metric takes 
the form 

ds 2 = a 2 (T])[~dq 2 + dx 2 + Z 2 (d6 2 + sin 2 9 d<t> 2 )\. (27.46) 


Let a photon start at the “North Pole” of the 3-sphere (x = 0; any 0 and <f>) at 
the “arc parameter time” r j = tj v Then, by the “arc parameter time” 77 = tj 2 , the 
photon has traveled to a new point on the hypersphere and encountered a new set 
of particles of the “cosmological fluid.” They lie at the hyperpolar angle 


X-V2- Vv 


When one makes a spacetime diagram on a piece of paper to show what is happening 
when an effect propagates from one point to another in the universe, one finds it 
most convenient to take ( 1 ) the space coordinate to be x (the life histories of distinct 
particles of the “cosmological fluid” thus being represented by distinct vertical lines), 
and (2) the time coordinate to be 77 (so that photons are described by lines inclined 
at ±45°). No time parameter is more natural to use than tj when one is tracing 
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Hubble constant and 
Hubble time 


out the course of null geodesics. For an example, see the treatment of the cosmologi¬ 
cal redshift in §29.2. It also turns out that it is simpler analytically (when A is taken 
to be zero) to give a = a(rj) and t = i(rf) than to give a directly as a function of 
time. Thus one gets the connection between the dimension a and the “arc-parameter 
time” tj from the formula 

7 1 = = ————— — (27.47) 

omit 

From a knowledge of the dimension a as a function of this time parameter, one 
immediately gets proper time itself in terms of this time parameter, from the formula 

dt — a(rf) <dq. (27.48) 

An equation (27.40) for the expansion factor and a choice of parameters for 
marking out time have now set the stage for a detailed analysis of idealized cosmol¬ 
ogy, and some of the relevant questions have even been asked: How does the 
characteristic dimension, a , of the geometry (radius of 3-sphere, in the case of closure) 
change with time? What is the spacetime geometry? How do geodesics, especially 
null geodesics, travel in this geometry? However, additional questions are equally 
important: Is the expansion of the universe decelerating and, if so, how fast? How 
do density and pressure of matter and radiation vary with time? And finally, for 
the simplest and most immediate tie between theory and observation, what is the 
expansion rate? 

In speaking of expansion rate, one refers to the “Hubble constant,” the fractional 
rate of increase of distances, 

H=^r, (27.49) 

a(t) 

which is normally evaluated today i/(today) = H 0 , but is in principle defined as 
a function of time for every phase of the history of the universe. The reciprocal 
of H is the “Hubble time,” H" 1 . This quantity represents the time it would have 
taken for the galaxies to attain their present separations, starting from a condition 
of infinite compaction, if they had maintained for all time their present velocities 
(“time for expansion with dimensions linearly extrapolated back to the start”). For 
the conversion from astrophysical to geometric units and to years, take the currently 
accepted value, H 0 = 55 km/sec megaparsec (Box 29.4), as an illustration: 

tj _ _ 55 km/sec _ 

0 “ (299,793 km/sec)(3.0856 X 10 24 cm or 3.2615 X 10 6 yr of time) 

= 0.59 X 10~ 28 per cm of light-travel time 

or 5.6 X 10" 11 fractional expansion per yr, (27.50) 

77" 1 = 1.7 X 10 28 cm of light-travel time or 18 X 10 9 yr. 
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§27.10. THE ELEMENTARY FRIEDMANN COSMOLOGY 
OF A CLOSED UNIVERSE 


Take the simplest cosmological model, an isotropic homogeneous closed universe 
with A = 0, and trace out its features in all detail in the two limiting cases where 
matter dommates and where radiation dominates. The term “Friedmann universe” 
is used here for both cases, although the matter-dominated model is sometimes 
referred to as the Friedmann universe and the radiation-dominated one as the 
Tolman universe. In this analysis, it will be appropriate to let the variable a(t) 
represent the radius of the universe, as measured in cm, because only by reference 
to this radius does one have the tool in hand to discuss all the interesting geometric 
effects that in principle lend themselves to observation. After this discussion, it will 
be enough, in dealing with other models, to summarize their principal parts and 
comment on their differences from this simple model, without repeating the full 
investigation. Any reference to an open universe or any so-called “cosmological 
constant” or its effects will therefore be deferred to a brief final section, §27.11. There 
the variable a{t) will sometimes be taken to represent only a parameter of relative 
expansion, as is appropriate for discussions reaching out only to, say, z — 0.1, where 
global geometric issues are not taken up. 

Rewrite the controlling component (27.40) of Einstein’s field equation in the form 


Features of a closed 
Friedmann universe with 
A = 0: 


(1) radius as function of time 


( da \ 2 %”Pmo a o/l *”Pro a i/ 3 

(IF/ - a -— =~ 1 - ( 27 - 51 ) 


According as one neglects the radiation term or the matter term in this equation, 
the equation idealizes to 


or 


( da \ 2 , 

\dt) a “ *’ 

/27.52; matter 
\dominates 

(da\ 2 a* 2 , 

\dt) a 2 

/27.52; radiation 
Xdominates 


In both cases, the problem lends itself to comparison to the problem of particle 
motion in Newtonian mechanics with “total energy” - 1 and with an “effective 
potential energy” of the qualitative form shown in diagram A of Box 27.1—apart 
from minor differences in shape according as the potential goes as — 1 /a or as — 1 /a 2 . 

The principal features of the solution are collected in Box 27.3. 

It is a striking feature of the radiation-dominated era of the early Friedmann (2) early era, when radiation 
universe that the density of the radiation depends on time according to a simple dominates: types of 
universal law, radiation 

P r = 3/32 irt 2 (27.53) 

(final line and final column of Box 27.3). This circumstance may someday provide 

{continued on page 736) 
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Box 27.3 SOLUTIONS FOR THE ELEMENTARY FRIEDMANN 

COSMOLOGY OF A CLOSED UNIVERSE IN THE TWO LIMITING CASES IN WHICH 

(1) MATTER DOMINATES AND RADIATION IS NEGLIGIBLE, AND 

(2) RADIATION DOMINATES AND MATTER IS NEGLIGIBLE 


Idealization for 
dynamics of 3-sphere 


Matter dominated 


Radiation dominated 


Model relevant when? 


Effective “potential” in 


back into past to redshift z — 10,000; 
through today and through phase 
of maximum expansion, and re¬ 
contraction down to dimensions 
— 10,000-fold smaller than today 


very early phase of expansion, for 
redshifts z — 1,000 and greater; 
and corresponding phase in late 
stages of recontraction; not di¬ 
rectly relevant today. 


4- V{d) = 


V(a) = -- 


V(a)= -- 


Value of constant in this “potential” 
in terms of conditions at some 
standard epoch 


, 2 8w , 

3 °^ n 


Solution of dynamic equation ex¬ 
pressed parametrically in terms of 
“arc parameter” 17 (radians of arc 
distance on 3 -sphere covered by a 
photon travelling ever since start 
of expansion) 


a - - cosrj) 


. max / . 

t = •——(77 - simj) 


a — a sin 17 


t — a* (1 — cos 77) 


Range of 17 from start of expansion 
to end of recontraction 


2?7 (one trip around the universe) 


77 (gets only as far as antipodal point 
of universe) 


Nature of curve relating radius a to 
time t 


cycloid 


semicircle 


Hubble time 


(da/dt) (da/dr}) 



§27.10. FRIEDMANN COSMOLOGY OF A CLOSED UNIVERSE 


Idealization for 

dynamics of 3-sphere Matter dominated 


Radiation dominated 


Inequality between Hubble or “ex¬ 


trapolated” time and actual time 
back to start of expansion 

H - 1 > 1.5 1 

IV 

to 

Density of mass-energy 

3 

/) — ——- 

3 

«*mJ\ ~ cos „) 3 

PT 8ua* 2 sin 4 7 j 

This density expressed in terms of 

_ 3 H 2 2 

_ 3 tf 2 1 

Hubble expansion rate 

P m 877 1 -f- COS TJ 

P T 8w cos 2 !) 

Inequality satisfied by density 

. 3 H 2 

Prr, > -- 

m ~ 877 

. 3 H 2 

r 

Analysis of magnification of distant 
galaxy by curvature of intervening 

§ 29.5 and Figure 29.2 

§ 29.5 


space 




fl max 



3 



Limiting form of law of expansion 
for early times 



a = a* 7] 
a = (2a*t) 1/2 


H-' = -assy = 1.5* 

O 


H~ l = a\ 2 = It 


Other features of expansion at early 
times 


( 87 T£ 7 3 / 3 ) 

1 = 3 H 2 

6 i nt 2 8tt 


Pr = 3 p r = 

3 


a * 2 

(87rtf 4 /3) 
3 H 2 


32tt/ 2 8*77 
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a tool to tell how many kinds of radiation contributed to p r in the early universe; 
or, in other words, to learn about field physics from observational cosmology. Express 
the density of radiation in the form 

P r ( cm_2 ) = Pr, con V (erg/cm 3 ) = (27 ' 54) 


It would be surprising if electromagnetism made the sole contribution to the radiation 
density, since the following additional mechanisms are available to sop up thermal 
energy from a violently radiating source: 


f„ = 7; 

ftxv — 7 * 


electromagnetic radiation (already considered), f em — 8; 

gravitational black body radiation, f g — 8; 

neutrino plus antineutrino radiation of the electron- 
neutrino type [its contribution depends on the chemical 
potential of the neutrinos, on which see Brill and 
Wheeler (1957); a zero value is assumed here for that 
potential], 

neutrino plus antineutrino radiation of the muon- 
neutrino type [with the same assumptions as for y e ’s], 
pairs of positive and negative electrons produced out 
of the vacuum when temperatures are of the order of 
T = mc 2 /k — 0.59 x 10 10 K and higher, evaluated in 
the approximation in which these particles are treated 
as overwhelmingly more numerous than the unpaired 
electrons that one sees today, f e +e~ = 14; 

other particles such as mesons created out of the vac¬ 
uum when temperatures are two orders of magnitude 
higher (—10 12 K), and baryon-antibaryon pairs created 
out of the vacuum when temperatures are of the order 
of —10 13 K and higher, / 

sum of these /-values, f 


/r 


r ,U 


(27.55) 


As the expansion proceeds and temperatures drop below 10 13 K, then 10 12 , then 
10 10 , the various particle pairs presumably annihilate and disappear [see, however, 
Alfv6n and Klein (1962), Alfv6n (1971), Klein (1971), and Omnes (1969)]. One is 
left with the radiations of zero rest mass, and only these radiations, contributing 
to the specific heat of the vacuum. At the phases of baryon-antibaryon and electron- 
positron annihilation, the thermal gravitational radiation present has already effec¬ 
tively decoupled itself from the matter, according to all current estimates. Therefore 
the energy set free by annihilation of matter and antimatter is expected to pour 
at first into the other two carriers of energy: neutrinos and electromagnetic radiation. 
However, the neutrinos also decouple early (after baryon-antibaryon annihilation; 
before full electron-positron annihilation), because the mean free path for neutrinos 
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rises rapidly with expansion. The energy of the subsequent annihilations goes almost 
exclusively mto electromagnetic radiation. Thus the temperatures of the three radia¬ 
tions at the present time are expected to stand in the order 


, > T, > T r 


(27.56) 

T em has been measured to be 2.7 K; T p is calculated to be (4/11) 1/3 T = 1 9 K 
and T g has been calculated to be 1.5 K [Matzner (1968)] in a model where gravitom 
decouple durrng an early, quark-dominated era 

J?X U| Tn"? “ i0n : T ” * P1 “ ck specm,m ' reraaiK » "•”<* 

(see Box 2M)^Expans,on leaves eonstam the product or the product 

r,decoupled Compare the temperature of this particular radiation now to the 

StTCpta ““ radiati °"*' a ” y ch “ e " time,fter i,s CTa ° f 


r *^ sj ^r,now (1 “1“ z)T f d 
a fid ’ 


(27.57) 


2 ? r r ems , the : ed shift of any “ tracer ” spectrai une > ° s ** fiducial 

time and observed today, relative to the standard wavelength of the same transition 
as observed m the laboratory. 

If the three radiations could be catalyzed into thermodynamic equilibrium then 
all radiations could be treated on the same footing during the radiation-domi- 
m ed era of cosmology. Their individual/values could be added directly to give 
/-» +8 + 7 + 7 = 30. Temperature and time would then be connected by the 


(T/10 10 K)\t/\ sec) = 1.19. 
This formula together with (27.57) implies the relation 




(27.58a) 


(27.58b) 


This relation concerns two radiations: (1) the actual electromagnetic radiation with 
Planck spectrum (a continuum); and (2) the redshift and time of emission of a “tracer 
radiation (a hne spectrum). A measured departure from this relation could serve 
as one potential (mdirect) indication that, in accordance with standard theory 
neutrinos and gravitational radiation today are cooler than electromagnetic radiation’ 
Turn now from the radiation-dominated era of cosmology to the matter-dominated 
era. Numbers sometimes elicit more response from the imagination than formulas 
Therefore ideahze to a matter-dominated cosmology, and for the moment arbitrarily 
adopt 20 x 10^ yr and 10 x 10* yr as Hubble time and actual time, respectively 
back to the start of the expansion. It is certain that future work will show both 
numbers to require revision, but probably not by more than a factor 2, in the opinion 
of observational cosmologists. Since any judgment on the best numbers is subject 


(3) later era, when mat 
dominates 
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to uncertainty, one can pick the numbers to be simple as well as reasonable. From 
Box 27.3, one then deduces the present value of the arc parameter time 77 , 


20 X 10 9 lyr 
10 x 10 9 lyr 


H - 1 

t 


a max 0 ~ CQST ?) 2 

2 sin 7 ] 
^0?-Sin T,) 


(27.59) 


or 

t? = 1.975 (or 113.2°) (27.60) 

(arc traveled by a photon on the 3-sphere from the start of the expansion to today.) 
This fixed, all other numbers emerge as shown in Box 27.4. 



Box 27.4 A TYPICAL COSMOLOGICAL MODEL COMPATIBLE WITH 
ASTRONOMICAL OBSERVATIONS AND WITH EINSTEIN'S 
CONCEPTION OF COSMOLOGY (A = 0; Universe Closed) 


Radius at phase of maximum expansion, 
Time from start to maximum, 

Time from start to final recontraction, 
Time from start to today (adopted value), 
Radius today, 

Hubble time today (adopted value), 
Hubble expansion rate today, 

Deceleration parameter today, q 0 
[equation (29.1b)] 

Density today (3/8?r a 0 2 ) + (3// 2 /87t), 


Volume today, 2-77 2 a 0 3 , 

Density at maximum (3/877a 2 ) + ( 3 H 2 /%tt), 

Volume at maximum, 

Rate of increase of radius today, 

Rate of increase of volume today, 

Amount of matter, 

Equivalent number of solar masses, 
Equivalent number of baryons, 

Fraction visible today 


18.94 x 10 9 lyr; 

29.76 x 10 9 yr; 

59.52 x 10 9 yr; 

10 X 10 9 yr; 

13.19 x 10 9 lyr; 

20 X 10 9 yr; 

49.0 km/sec Megaparsec; 

1.7 

(7.67 + 3.33) x 10“ 58 cm -2 
= 11.00 x 10 -58 cm" 2 
or 14.8 x 10 _30 g/cm 3 ; 

38.3 x 10 84 cm 3 ; 

(3.70 + 0.00) x 10" 58 cm~ 2 
= 5.0 x 10 -30 g/cm 3 ; 

114 X 10 84 cm 3 ; 

13.19 X 10 9 lyr/20 X 10 9 yr 
= 0.66 lyr/yr; 

1.82 x 10 68 cm 3 /sec; 

5.68 x 10 56 g; 

2.86 X 10 23 ; 

3.39 x 10 80 . 

0.74 


It must be emphasized that these numbers do not deserve the title of “canonical,” 
however convenient that adjective may be for describing them; they can at most 
be called illustrative. 



Figure 27.3. 

Many Schwarzschild zones are fitted together to make a closed uni¬ 
verse. This universe is dynamic because a test particle at the interface 
between two zones rises up against the gravitational attraction of each 
and falls back under the gravitational attraction of each. Therefore 
the two centers themselves have to move apart and move back to¬ 
gether again. The same being true for all other pairs of centers, it 
follows that the lattice universe itself expands and recontracts, even 
though each Schwarzschild geometry individually is viewed as static. 
This diagram is taken from Lindquist and Wheeler (1957). 


If every five seconds a volume of space is added to the universe, a volume 
equivalent to a cube 10 5 lyr (= 0.95 x 10 23 cm) on an edge, about equal to the 
volume occupied by the Milky Way, where does that volume make its entry? Rather 
than look for an answer, one had better reexamine the question. Space is not like 
water. The outpouring of fresh water beneath the ocean at the Jesuit Spring off 
Mount Desert Island can be detected and measured by surrounding the site with 
flowmeters. There is no such thing as a flowmeter to tell “how fast space is streaming 
past.” The very idea that “space flows” is mistaken. There is no way to define a 
flow of space, not least because there is no way to measure a flow of space. Water, 
yes, space, no. Life is very different for the flowmeter, according as it is stationary 
or moving with respect to the water. For a particle in empty space, however, physics 
is indistinguishable regardless of whether the particle is at rest or moves at high 
velocity relative to some chosen inertial frame. To try to pinpoint where those cubic 
kilometers of space get bom is a mistaken idea, because it is a meaningless idea. 

One can get a fresh perspective on what is going on in expansion and recontraction 
by turning from a homogeneous isotropic closed universe to a Schwarzschild lattice 
closed universe. [Lindquist and Wheeler (1957)]. In the former case, the mass is 
idealized as distributed uniformly. In the latter, the mass is concentrated into 120 
identical Schwarzschild black holes. Each is located at the center of its own cell, 
of dodecahedral shape, bounded by 12 faces, each approximately a pentagon; and 
space is empty. The dynamics is easy to analyze in the approximation in which each 
lattice cell is idealized as spherical, a type of treatment long familiar in solid-state 
physics as the Wigner-Seitz approximation” (references in Lindquist and Wheeler). 
In this approximation, the geometry inside each lattice cell is treated as having 
exactly the Schwarzschild character (Figure 27.3); a test particle placed midway 
between black hole A and black hole B rises against the attraction of each, and 
ultimately falls back toward each, according to the law developed in Chapter 25 
[equation (25.28) with a shift of it in the starting point for defining tj]. 


(4) "Where is the new space 
created during 
expansion?"—a 
meaningless question 


r = y (1 - cos?}), 


T 



(?} - sin 7}). 


(27.61) 


Accordingly, the two masses in question must fall toward each other; and so it is 
with all the masses. One comes out in this way with the conclusion that the lattice 
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universe follows the same law of expansion and recontraction as the Friedmann 
universe to an accuracy of better than 4 per cent [Lindquist and Wheeler; Wheeler 
(1964a), pp. 370-381]. Now ask again the same meaningless question about where 
the cubic kilometers of space pour into the universe while it is expanding, and where 
they pour out while it is recontracting. Receive a fuller picture why the question 
is meaningless. Surrounding each center of mass, the geometry is and remains the 
Schwarzschild geometry (until eventually the black holes come so close together that 
they coalesce). The situation inside each cell is therefore static. Moreover, the 
interface between cell and cell is defined in imagination by a sprinkling of test 
particles so light that they have no influence on the geometry or its dynamics. The 
matchup between the geometry in one cell and the next is smooth (“tangency 
between the two geometries”). There is nothing abnormal whatsoever in the space- 
time on and near the interface. One has as little right to say those cubic kilometers 
are “created”.here as anywhere else. To speak of the “creation” of space is a bad 
way of speaking, and the original question is a bad question. The right way of 
speaking is to speak of a dynamic geometry. So much for one question! 

In charting the dynamics of the geometry of a Friedmann universe, one often 
finds that it simplifies things to take as space coordinate the hyperpolar angle x> 
measured from some chosen world line (moving with the “cosmological fluid”) as 
standard of reference; and to take as time coordinate the arc-parameter measure 
of time, rj , as illustrated in Figure 27.4. 

(5) causal isolation of various Inspection of the (x, 7?)-diagram makes it clear that photons emitted from matter 
regions of universe from at one p 0 j n t cannot reach, in a limited time, any matter except that which is located 

in a limited fraction of the 3-sphere. In a short time /, according to Box 27.3, a 
photon can cover an arc distance on the 3-sphere equal only to rj = (2 t/a*) 1/2 . 
Moreover, what is true of photons is true of other fields, forces, pressures, energies 
and influences; they cannot reach beyond this limit. Evidently the 3-sphere at time 
t is divided into a number of “zones,” 


/number of\ 
\“zones” / 


/hyperspherical solid 
\ angle of entire 3-sphere 
/hyperspherical solid\ 

\ angle of one zone / 


2tt 2 3tt /tf*\ 3/2 

4ttx 3 /3 = 2^2\T/ ’ 

(27.62) 


effectively decoupled one from the other. As time goes on, there are fewer separate 
zones, and ultimately every particle has been subjected to influences from every other 
particle in the model universe. 


EXERCISES Exercise 27.8. MATTER-DOMINATED AND RADIATION-DOMINATED REGIMES 

OF FRIEDMANN COSMOLOGY 

Derive the results listed in the last two columns of Box 27.3, except for the focusing properties 
of the curved space. 




A O 

Figure 27.4. 


Use of “arc parameter” 77 as a time coordinate and hyperpolar angle x as a space coordinate to describe 
travel of a photon (±45° line) in a Friedmann universe that is matter-dominated (center) or radiation- 
dominated (right). The burst of photons is emitted from the “TV-pole” of the 3-sphere at a time very 
little after the big bang, and the locus of the cloud of photons at subsequent stages of the expansion 
and recontraction is indicated by sections of the 3-sphere in the diagrams at the left. The matter-dominated 
Friedmann universe appears to be a reasonable model for the physical universe, except when its 
dimensions have fallen to the order of one ten-thousandth of those at maximum expansion or less 
(“radiation regime”). 


Exercise 27.9. TRANSITION FROM RADIATION-DOMINATED REGIME TO 
MATTER-DOMINATED REGIME 

Including both the radiation and the matter terms in equation (27.51), restate the equation 
in terms of the arc parameter 77 (with dr\ = dt/a) as independent variable, and integrate to 
find 

fl = («max/2) ~ [(fl ma x/2) 2 + a* 2 ] 1/2 COS (77 + 5), (27.63) 

1 = (^max/ 2 ) 1 ? - [( a max/2) 2 + a* 2 ] 1/2 [sin (tj + 8) - sin 5], (27.64) 

where 

6 = arc tan [a*/(a max /2)]. (27.65) 

(a) Verify that under appropriate conditions these expressions reduce at early times to 
a “circle” relation between radius and time and to a “cycloid” relation later. 
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(b) Assign to a* 2 the value a 0 a max / 10,000 (why?) and construct curves for the dimensionless 
measures of density, 




(8™ m ax73) 




as a function of the dimensionless measure of time. 


Io Sio (t/a m a*). 

What conclusions emerge from inspecting the logarithmic slope of these curves? 

Exercise 27.10. THE EXPANDING AND RECONTRACTING 
SPHERICAL WAVE FRONT 

An explosion takes place at the “A-pole” of the matter-dominated Friedmann model universe 
at the value of the “arc parameter time” ij = sr/3, when the radius of the universe has reached 
half its peak value. The photons from the explosion race out on a spherical wave front. 
Through what fraction of the “cosmological fluid” has this wave front penetrated at that 
instant when the wave front has its largest proper surface area? 


§27.11. HOMOGENEOUS ISOTROPIC MODEL UNIVERSES 
THAT VIOLATE EINSTEIN'S CONCEPTION 
OF COSMOLOGY 


Open Friedmann universe 
with 4 = 0: 


(1) expansion factor as 
function of time 


(2) early stage—same as for 
closed universe 


It violates Einstein’s conception of cosmology (Box 27.1)—though not the equations 
of his theory—to replace the closed 3-sphere of radius a by the open hyperboloidal 
geometry of equation (27.22) with the same scale length a . Even so, the results of 
Box 27.3 continue to apply in the two limiting regimes of matter-dominated and 
radiation-dominated dynamics when the following changes are made. (1) Change 
the constant — 1 on the righthand side of the analog of a “Newtonian energy 
equation” to +1, thus going over from a bound system (maximum expansion) to 
an open system (forever expanding). (2) Replace (1 - costj) by (coshrj - 1), sin rj 
by sinh rj, cos r] by cosh tj , and (tj — sin rj) by (sinh rj — rj). (3) The range of the “arc 
parameter” tj now extends from 0 to oo, and the curve relating “radius” a to time 
t changes from cycloid or circle to an ever-rising curve. (4) The listed inequalities 
on the Hubble time (as related to the actual time of expansion) and on the density 
(as related to 3H*/&it) no longer hold. (5) The formulas given in Box 27.3 for 
conditions at early times continue to hold, for a simple reason: at early times the 
curvature of spacetime “in the direction of increasing time” [the extrinsic curvature 
(i 5/a 2 )(da/dt) 2 as it appears in Box 27.1, equation (2)] is overwhelmingly more 
important than the curvature within any hypersurface of homogeneity, ±6 /a 2 (the 
intrinsic curvature); therefore it makes no detectable difference at early times whether 
the sign is plus or minus, whether the space is closed or open, or whether the 
geometry of space is spherical or hyperboloidal. 

Why doesn’t it make a difference? Not why mathematically, but why physically, 
doesn’t it make a difference in early days whether the space is open or closed? 
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Because photons, signals, pressures, forces, and energies cannot get far enough to 
“smell out” the difference between closure and openness. The “zones of influence” 
of (27.62) are too small for any one by itself to sense or to respond significantly 
to any difference between a negative space curvature —6/a 2 and a positive space 
curvature +6/a 2 . Therefore the simple power-law time-dependence of the density 
of the mass-energy of radiation given in Box 27.3 for a closed universe holds equally 
well in the earliest days of a radiation-dominated, open, isotropic model universe; 
thus, 

P r = 3/32 771 2 . (27.66) 

Only at a later stage of the expansion, when the “extrinsic curvature” term [equation 
(2), Box 27.1], (6 /a 2 )(da/dt) 2 (initially varying as 1.5r 2 , according to Box 27.3) has 
fallen to a value of the same order of magnitude as the “intrinsic curvature” term 
±6/a 2 (initially varying as ±3a*~ 1 r 1 ), does the sign of the intrinsic curvature begin 
to matter. Only then do the differences in rate of expansion begin to show up that 
distinguish the open model universe from the closed one. 

The open model goes on expanding forever. Therefore the density of mass-energy, (3) late stage—expansion 
whether matter-dominated and proportional to a max /a 3 , or radiation-dominated and forever 
proportional to a* 2 /a 4 , or some combination of the two, (1) ultimately falls to a level 
that is negligible in comparison with the intrinsic curvature, —6/a 2 , and (2) thereafter 
can be neglected. Under these circumstances, the only term left to balance the 
intrinsic curvature is the extrinsic curvature. The important component of the field 
equation (after removal from all terms of a common factor 3) now reads 



For a closed universe, the two terms (one sixth the extrinsic curvature and one sixth 
the intrinsic curvature) have the same sign, and any equation like (27.67) leads to 
an impossibility. Here, however, rather than impossibility, one has the remarkably 
simple solution 

a — t, (27.68) 

and the corresponding metric 


Write 


ds 2 = -dt 2 + i 2 [d x 2 + sinh 2 x (d8 2 + sin 2 0 d<t> 2 )). (27.69) 


r = t sinh x, 
'new = * COsh X. 


(27.70) 


and find that (27.69), solution as it is of Einstein’s empty-space field equation, is 
identical with the Lorentz-Minkowski metric of flat spacetime, 


ds 2 = —dt 2 evi + dr 2 + r 2 (d6 2 -f sin 2 6 d<\> 2 ) (27.71) 


(see Box 27.2C). This geometry had acquired the flavor of an expanding universe 
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Homogeneous cosmologies 
with A ^ 0: 

(1) equation for evolution of 
expansion factor 


(2) qualitative features of 
evolution 


because the cosmological fluid, too thinly spread to influence the dynamics of the 
geometry, and serving only to provide marker points, was flying out in all directions 
[for a fuller discussion of this “expanding Minkowski universe,” see, for example, 
Chapter 16 of Robertson and Noonan (1968)]. The typical spacelike hypersurface 
of homogeneity looks to have a curved 3-space geometry, and does have a curved 
geometry (instrinsic curvature), because the slice (27.70) through flat spacetime is 
itself curved (extrinsic curvature). 

Turn now to a second violation of Einstein’s conception of cosmology: a cosmolog¬ 
ical term in the field equation (27.39), 

/ da/dtV k A 8w ^P^aj/3 8wp ro a^/3 

+ 7 - T - T p< '= ^ + ' ' (27 72) 

In analyzing the implications of this broadened equation, turn attention from the 
“radius” a(t) itself, which was the focus of interest in the previous section, §27.10, 
on Friedmann cosmology. Recognize that present measurements have not yet pro¬ 
vided a good, direct handle on the absolute dimension a(t) of the universe. However, 
they do give good figures for the redshift z and therefore for the ratio between a 
at the time of emission and a ~ a 0 now, 


a 0 /a = 1 + z (27.73) 

For any comparison with observations designed to fix limits on k (Einstein value: 
& = +1) and on A (expected to be zero), it is therefore appropriate to rewrite the 
foregoing equations so that they refer as much as possible only to ratios. Thus one 
rephrases (27.72) as the “generalized Friedmann equation,” 


(27.74) 

Here the quantity 

VWa.-, = - *L (i) + ft. (£f ] - i 4 (j-J (27.75) 


acts as an “effective potential” for the dynamics of the expansion. The constant term 
K 0 represents one sixth of the intrinsic curvature of the model universe today. Its 
negative, — K 0 , plays the role of an “effective energy” in the generalized Friedmann 
equation (Box 27.5). All the qualitative features of the cosmology can be read off 
from the curve for the effective potential as a function of (a/a 0 ) and from the value 

of A,. 

For a quantitative analysis, the log-log diagram of Figure 27.5 is often more useful 
than the straight linear plot of V against (a/a 0 ) of Box 27.5. 

AO the limiting features shown in the varied types of cosmology have been 
encountered before in the analysis of the elementary Friedmann cosmology (big bang 
out of a configuration of infinite compaction; reaching a maximum expansion at 
a turning point, or continued expansion to a Minkowski universe; recoUapse to 
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infinite density) or lend themselves to simple visualization (static but unstable 
Einstein universe; “hesitation” model; “turnaround” model), except for the even 
more rapid expansion that occurs when A is positive and the dimension a has 
surpassed a certain critical value. In this expansion, a eventually increases as 
exp [(A/3) 1/2 t] irrespective of the openness or closure of the universe (k = 0, ±1). 
This expansion dominates every other feature of the cosmology. Therefore, in dis¬ 
cussing it, it is appropriate to suppress every other feature of the cosmology, take the 
density of matter to be negligible, and take k = 0 (hypersurfaces of homogeneity 
endowed with flat 3-space geometry). In this limit, one has the following empty-space 
solution of Einstein’s field equation with cosmological constant: 

ds 2 = —dt 2 + a 2 e 2(A/3) ' /2 '[dx 2 + x 2 (d0 2 + sin 2 # <7<f> 2 )]. (27.76) 

This “de Sitter universe” [de Sitter (1917a,b)] may be regarded as a four-dimensional 
surface, 

-(z 0 ) 2 + (z 1 ) 2 + (z 2 ) 2 + (z 3 ) 2 + (z 4 ) 2 = 3/A, (27.77) 

in a five-dimensional space endowed with the metric 

(ds) 2 = -(dz 0 ) 2 + (dz 1 ) 2 + (dz 2 ) 2 + (dz 3 ) 2 + (dz 4 ) 2 . (27.78) 

The correctness of this description may be checked directly by making the substitu¬ 
tions 


z° = (3/A) 1/2 sinh [(A/3) 1/2 t\ + j-(A/3) 1/2 e <A/3), ' 2l a 2 X 2 , 

z 4 = (3/A) 1/2 cosh [(A/3) 1/2 t] - l(A/3) 1/2 e iA/3)1/H a 2 x 2 , 

2 (27.79) 

z 1 = a 0 e (A/3)1/2t x sin 8 cos <£, 
z 2 = a 0 e iA/3)1/2t x sin 8 sin <f>, 
z 3 = a 0 e iA/3)1/2t x cos 8. 

Because of its beautiful group-theoretical properties and invariance with respect to 
5 X 4/2 = 10 independent rotations, the de Sitter geometry has been the subject 
of scores of mathematical investigations. The physical implications of a cosmology 
following the de Sitter model are described for example by Robertson and Noonan 
(1968, especially their §16.2). The de Sitter model is the only model obeying Ein¬ 
stein’s equations (with A ^ 0) which (1) continually expands and (2) looks the same 
to any observer who moves with the cosmological fluid, regardless of his position 
or his time. Any model of the universe satisfying condition (2) is said to obey the 
so-called “perfect cosmological principle.” This phrase arose in the past in studying 
models that lie outside the framework of general relativity, models in which matter 
is envisaged as continuously being created, and to which the name of “steady-state 
universe” has been given. Any such model has been abandoned by most investigators 
today, not least because it gives no satisfactory account of the 2.7 K background 
radiation. 


(3) de Sitter universe 


Other non-Einsteinian 
cosmologies: 

(1) steady-state model 


(continued on page 748) 
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— 

Box 27.5 EFFECT OF VALUE OF COSMOLOGICAL CONSTANT AND 

OF INTRINSIC CURVATURE OF MODEL UNIVERSE "TODAY" 

ON THE PREDICTED COURSE OF COSMOLOGY 

The “effective potential” V in the generalized Friedmann equation (27.74) is repre¬ 
sented schematically here as a function of the expansion ratio a/a 0 . The diagram 
illustrates the influence on the cosmology of (1) the cosmological constant A (deter¬ 
mines the behavior of the effective potential at large values of a/a Q \ see dashed 
curves) and (2) the value adopted for K 0 — (one sixth of the intrinsic curvature of 
3-space at the present epoch). The value of the quantity — K 0 determines the “effec¬ 
tive energy level” and is shown in the diagram as a horizontal line. The difference 
between this horizontal line and the effective potential determines (a~ l da/dt) 2 . 
Regions where this difference is negative are inaccessible. From the diagram one 
can read off the histories of 3-space on the facing page. 



The diagram is schematic, not quantitative. Representative values might be A conv — 0 
or ±3 X 10~ 28 g/cm 3 ; = 10- 30 g/cm 3 or p^^ = 10“ 28 g/cm 3 ; and 

(a' 1 da/dt) 2 = H 2 = (1/20 X 10 9 yr) 2 or 3.8 X 10 -29 g/cm 3 . At small values of 
a/a 0 the cosmological term — {A/?>)(a/a Q ) 2 is negligible. Not negligible at small values 
of ( a/a 0 ) is the difference between a model universe curved only by the density of 
matter (the dashed curve in the diagram) and one curved also by a density of 
radiation (the full curve). The different dependence of “radius” and density on time 
at early times in these two cases of a matter-dominated cosmology and a radiation- 
dominated cosmology is spelled out in the last part of Box 27.3, giving in the one 
case p = 1 / 6777 2 and in the other p = 3/32?rr 2 . 
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Intrinsic curvature 
of space today 

A 

Cosmology 

Hyperbolic; 

K 0 negative 

negative 

Universe starts in a condition of infinite density. It ex¬ 
pands to a maximum extent (or minimum density) governed 
by the value of A. It then recontracts at an ever increasing 
rate to a condition of infinite density. 

Hyperbolic; 

K 0 negative 

zero 

Universe starts in a condition of infinite density. It ex¬ 
pands. Ultimately the rate of expansion reaches a steady 
rate, da/dt = 1. The 4-geometry is Minkowski flat spacetime. 
Only the curvature of the spacelike slices taken through this 
flat 4-geometry gives the 3-geometry its hyperbolic character 
[see equation (27.70)]. 

Closed; 

K 0 positive 

zero 

Standard Friedmann cosmology: expansion from infinite 
compaction to a finite radius and recontraction and collapse. 

Closed; 

K 0 positive 

negative 

Qualitatively same as foregoing. Quantitatively a slightly 
smaller radius at the phase of maximum expansion and a 
slightly shorter time from start to end. 

Closed; 

K 0 positive 

A more positive than a certain 
critical value: A > A cril 

“Summit” of “effective potential” is reduced to a value 
slightly less than — K a . The closed universe once again starts 
its expansion from a condition of infinite compaction. This 
expansion once again slows down as the expansion proceeds 
and then looks almost as if it is going to stop (“moment of 
hesitation”). However, the representative point slowly passes 
over the summit of the potential. Thereafter the expansion 
gathers more and more speed. It eventually follows the ex¬ 
ponential law 



a = constant X exp [(4/3) 1/2 /J. 

Closed; 

K 0 positive 

A positive and exactly equal 
to the critical value, A = yl crit , 
that puts the “summit of the 
potential” into coincidence 
with ~K q 

Situation similar to that of a pencil with its tip dug into 
the table and provided with just enough energy to rise 
asymptotically in infinite time to the vertical position. Uni¬ 
verse starts from a compact configuration and expanding 
approaches a certain radius (“Einstein radius”, a P ) according 
to a law of the form & 



a = a E - constant X exp (—«/). 

Or (Einstein’s original proposal, when he thought that the 
universe is static, and added the “cosmological term,” 
against his will, to the field equation to permit a static uni¬ 
verse) the representative point sits forever at the “summit of 
the effective potential” (Einstein universe). Aside from con¬ 
tradicting present-day evidence on expansion, this configura¬ 
tion has the same instability as does a pencil trying to stand 
on its tip. The least disturbance will cause it to~“faH” either 
way, toward collapse or toward accelerating expansion, in 
the expansion case ultimately approaching the law 



a = constant x exp[(4/3) 1/2 /]. 

Closed; 

K 0 positive 

A less positive than the 
critical value: 

0 < 4 < A crit {K 0 ) 

Motion on the large a side of the “potential barrier.” Far 
back in the past the model universe has enormous dimen¬ 
sions, but is also contracting with enormous rapidity, in 
approximate accord with the formula 


a ~ constant X exp [-(yl/3) 1/2 /]. 

The radius a reaches a minimum value and thereafter the 
universe reexpands (“turn-around solution”), ultimately ap¬ 
proaching the asymptotic law 

a - constant X exp [(7l/3) 1/2 /]. 

___— - J 
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Figure 27.5. (facing page) 

Log-log plot of the effective potential V(a) of equation (27.75) and Box 27.5 as it enters the generalized 
Friedmann equation 




Horizontally is given the expansion ratio referred to (a, ^ O ) today = 1 as standard of reference. Vertically 
is given the value of V(a/a 0 ) in the geometric units of cm* 2 . The supplementary scale at the right translates 
to — (c 2 /G)(3/877) V(a/a Q ) as an equivalent density, expressed in g/cm 3 . The contribution of radiation 
density to the effective potential is indicated by the wavy line in the diagram. It is normalized to a value 
of the radiation densiiy today of p w = 10~ 33 g/cm 3 and has a logarithmic slope of two. The contribution 
of matter density to the “effective potential” has a logarithmic slope of unity. Two choices are made 
for it, corresponding to a density of matter today of p mo = 10* 30 g/cm 3 and p mo — 10~ 28 g/cm 3 (dashed 
lines in the diagram). The total effective potential in the two cases is also indicated in the diagram: 
a heavy line for the case p ^ = 10 -30 g/cm 3 (no cosmological term included) and a light line for the 
case = 10 -28 g/cm 3 . In this second case, a cosmological term is included, with the cosmological 
constant given by (3/8?r)(yl/3) = 10~ 29 g/cm 3 . The line describing the contribution of this term has a 
negative slope of magnitude two (dashed line). The horizontal or “level line” is drawn for a value of 
the Hubble expansion rate today. H 0 , equal to 1/(20 X 10 9 years). The vertical separation on the log 
plot between the potential curve and the level line gives the ratio — V/H 2 . This ratio as evaluated at 
any time t has the value a 2 (t)/dl + K 0 H~ 2 , where a = da/dt. As evaluated “today” (a/a 0 — 1) this ratio 
has the value 1 + K 0 H~ 2 . Knowing the Hubble expansion rate H 2 today, and knowing (or trying 
a certain set of parameters for) the potential curve, one can therefore deduce from the spread between 
the two the value of 1 + A',,/// 2 , hence the value of K 0 H~ 2 , hence the present value, K 0 , of the curvature 
factor. As an example, consider the case of the low-density universe (heavy line) and read off “today’s” 
value, 1 + K 0 H~ 2 = 0.223. From this follows K 0 = —0.777 H 2 (open or hyperbolic universe), hence 
k = -1 and a 0 = ( k/K 0 ) l/2 = (1/0.777) 1 ' 2 20 X 10 9 yr = 22.7 X 10 9 yr. For the high-density model 
universe, with p^ = 10“ 28 g/cm 3 , one similarly finds 1 + A^/// 2 = 24.5, hence K 0 = +23.5 H 2 , hence 
k = +1 (closed universe) and a 0 = (k/K 0 ) 1/2 = (1/23.5) 1/2 20 x 10 9 yr = 4.12 X 10 9 yr. Expansion 
stops, if and when it stops, at that stage when the ratio — V/H 2 between the potential and the level 
line, or a 2 (t)/a 2 + K 0 Hr 2 , falls from its “present value” of 1 + K 0 H~ 2 to 0 + K 0 H~ 2 ; that is, from 0.223 
to —0.777 in the one case, and from 24.5 to 23.5 in the other case. This log-log plot should be replaced 
by the linear plot of Box 27.5 when A < 0. 


However great a departure it is from Einstein’s concept of cosmology to give any 
heed to a cosmological constant or an open universe, it is a still greater departure 
hierarchic model to contemplate a “hierarchic model” of the universe, in which clusters of galaxies, 

and clusters of clusters of galaxies, in this part of the universe are envisaged to grade 
off in density with distance, with space at great distances becoming asymptotically 
flat [Alfven and Klein (1962), Alfv6n (1971), Klein (1971), Moritz (1969), de Vau- 
couleurs (1971), Steigman (1971)]. The viewpoint adopted here is expressed by Oskar 
Klein in these words, “Einstein’s cosmology was adapted to the discovery by Hubble 
that the observed part is expanding; the so-called cosmological postulate has been 
used as a kind of an axiomatic background which, when analyzed, makes it appear 
that this expansion is shared by a very big, but still bounded system. This implies 
that our expanding metagalaxy is probably just one of a type of stellar objects in 
different phases of evolution, some expanding and some contracting ” 
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, a /a 0 -^ 



The contrast between the hierarchic cosmology and Einstein’s cosmology [Einstein 
(1931) advocates a closed Friedmann cosmology] appears nowhere more strongly 
than here, that the one regards asymptotically flat spacetime as a requirement; the 
other, as an absurdity. “Only the genius of Riemann, solitary and uncomprehended,” 
Einstein (1934) puts it, “had already won its way by the middle of the last century 
to a new conception of space, in which space was deprived of its rigidity, and in 
which its power to take part in physical events was recognized as possible.” That 
statement epitomizes cosmology today. 
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EXERCISES 


But today’s view of cosmology, as dominated by Einstein’s boundary condition 
of closure (k = -f 1) and his belief in A = 0, need not be accepted on faith forever. 
Einstein’s predictions are clear and definite. They expose themselves to destruction. 
Observational cosmology will ultimately confirm or destroy them, as decisively as 
it has already destroyed the 1920 belief in a static universe and the 1948 steady-state 
models (see Box 27.7 on the history of cosmology). 


Exercise 27.11. ON SEEING THE BACK OF ONE'S HEAD 

Can a being at rest relative to the “cosmological fluid” ever see the back of his head by 
means of photons that travel all the way around a closed model universe that obeys the 
Friedmann cosmology and has a non-zero cosmological constant (see the entries in Box 27.3 
for the case of a zero cosmological constant)? 

Exercise 27.12. DO THE CONSERVATION LAWS FORBID THE PRODUCTION OF 
PARTICLE-ANTI PARTICLE PAIRS OUT OF EMPTY SPACE BY 
TIDAL GRAVITATION FORCES? 

Find out what is wrong with the following argument: “The classical equations 

G «n = Swr aj8 

are not compatible with the production of pairs, since they lead to the identity TJ*.* = 0. 
Let the initial state be vacuum, and let u and its derivative be equal to zero on the 
hypersurface t = const or / = — oo. It then follows from = 0, that the vacuum is always 
conserved.” [Answer: See Zel’dovich (1970, 1971, 1972). Also see §30.8.] 

Exercise 27.13. TURN-AROUND UNIVERSE MODEL NEGLECTING 
MATTER DENSITY 

If turn-around (minimum radius) occurs far to the right (large a) of the maximum of the 
potential V(a) in equation (27.75), the matter terms will be negligible. Let = p ro = 0. 
Then (what signs of k, A are needed for turn-around?), solve to show that A = 3(a min )“ 2 , 
H — (a^)" 1 tanh (t/a min ) near turnaround {t = 0) and that the deceleration parameter 
<7= —(1 /H 2 a)(d 2 a/dt 2 ) has the value 

q= >aV"4)‘l< “1. 

Exercise 27.14. ''HESITATION" UNIVERSE 

Neglect radiation in equation (27.75) but assume K 0 and A to be chosen so that the universe 
spent a very long time with a{t) near a h ( a h measures location of highest point of the barrier, 
or the size of the universe at which the universe is most sluggish). Choose a h = a 0 /2> to 
produce an abnormally great number of quasar redshifts near z — 2 [as Burbidge and 
Burbidge (1969a,b) believe to be the case, though their opinion is not shared by all observers]. 
Show (a) that the density of matter now would account for only 10 per cent of the value 
of Hi — (a/fl) 2 0W in equation (27.75) [“missing matter”, i.e., K 0 and A terms, account for 
90 per cent], (b) that a h ~ 20and (c) that the deceleration parameter defined in the 
previous exercise, as evaluated “today”, has the value q 0 = —13/10. 
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Exercise 27.15. UNIVERSE OPAQUE TO BLACK-BODY RADIATION AT 
A NONSINGULAR PAST TURN-AROUND REQUIRES 
IMPOSSIBLE PARAMETERS 

From a plot like that in Box 27.5, construct a model of the universe that contains 2.7 K 
black-body radiation at the present, but, with k = +1 and A > 0, had a past turn-around 
(minimum radius) at which the blackbody temperature reached 3,000 K where hydrogen 
would be ionized. Try to use values of H~ x and that are as little as possible smaller 
than presently accepted values. 




Box 27.6 ALEXANDER ALEXANDROVITCH FRIEDMANN 

St. Petersburg, June 17, 1888—Leningrad, September 15, 1925 




Graduated from St. Petersburg University, 1909; 
doctorate, 1922; 1910, mathematical assistant in 
the Institute of Bridges and Roads; 1912, lecturer 
on differential calculus in the Institute of Mines; 
1913, physicist in the Aerological Institute of 
Pavlov; dirigible ascent in preparation for observ¬ 
ing eclipse of the sun of August 1914; volunteer 
in air corps on war front near Osovets, 1914; head 
of military air navigation service, 1916-1917; pro¬ 
fessor of mechanics at Perm University, 1918; St. 
Petersburg University, 1920; lectures in hydrody¬ 
namics, tensor analysis; author of books, Experi¬ 
ments in the Hydromechanics of Compressible Liq¬ 
uids and The World as Space and Time, and the 
path-breaking paper, On the Curvature of Space, 
1922; a director of researches in the department 
of theoretical meteorology of the Main Geophysi¬ 
cal Laboratory, Leningrad, and, from February 
1925 until his death of typhoid fever seven months 
later, director of that Laboratory; with L. V. Keller 
“introduced the concept of coupling moments, i.e., 
mathematical expectation values for the products 
of pulsations of hydrodynamic elements at differ¬ 
ent points and at different instants ... to elucidate 
the physical structure of turbulence” [condensed 
from Polubarinova-Kochina (1963), which also 
contains a bibliography of items by and about 
Friedmann], 
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Box 27.7 SOME STEPS IN COSMOLOGY ON THE WAY TO WIDER 

PERSPECTIVES AND FIRMER FOUNDATIONS [For general reference on 
the history of cosmology, see among others Munitz (1957), Nesr (1964), 
North (1965), Peebles (1971), Rindier (1969), and Sciema (1971); and 
especially see Peebles and Sciama for bibliographical references to 
modern developments listed below in abbrevieted form.] 

A. Before the Twentieth Century 

Concepts of very early Indian cosmology [summarized by Zimmer (1946)]: “One 
thousand mahayugas—4,320,000,000 years of human reckoning—constitute a single 
day of Brahma, a single kalpa. ... I have known the dreadful dissolution of the 
universe. I have seen all perish, again and again, at every cycle. At that terrible 
time, every single atom dissolves into the primal, pure waters of eternity, whence 
all originally arose.” 

Plato, ca. 428 to ca. 348 b.c. [from the Timaeus, written late in his life, as translated 
by Comford (1937)]: “The world [universe] has been fashioned on the model of 
that which is comprehensible by rational discourse and understanding and is always 
in the same state. . . . this world came to be ... a living creature with soul and 
reason. ... its maker did not make two worlds nor yet an indefinite number; but 
this Heaven has come to be and is and shall be hereafter one and unique. ... he 
fashioned it complete and free from age and sickness. ... he turned its shape 
rounded and spherical... It had no need of eyes, for nothing visible was left outside; 
nor of hearing, for there was nothing outside to be heard. ... in order that Time 
might be brought into being, Sun and Moon and five other stars—‘wanderers/ as 
they are called—were made to define and preserve the numbers of Time. . . . the 
generation of this universe was a mixed result of the combination of Necessity and 
Reason ... we must also bring in the Errant Cause. . . . that which is to receive 
in itself all kinds [all forms] must be free from all characters [all form] .... For 
this reason, then, the mother and Receptacle of what has come to be visible and 
otherwise sensible must not be called earth or air or fire or water . . . but a nature 
invisible and characterless, all-receiving, partaking in some very puzzling way of 
the intelligible, and very hard to apprehend.” 

Aristotle, 384-322 b.c. [from On the Heavens , as translated by Guthrie (1939)]: 
“Throughout all past time, according to the records handed down from generation 
to generation, we find no trace of change either in the whole of the outermost heaven 
or in any one of its proper parts. . . . the shape of the heaven must be spherical. . . . 
From these considerations [motion invariably in a straight line toward the center; 
regularity of rising and setting of stars; natural motion of earth toward the center 
of the universe] it is clear that the earth does not move, neither does it lie anywhere 
but at the center. . . . the earth . . . must have grown in the form of a sphere. This 
[shape of segments cut out of moon at time of eclipse of moon; and ability to see 



§27.11. BRIEF HISTORY OF COSMOLOGY 


753 



in Egypt stars not visible in more northerly lands] proves both that the earth is 
spherical and that its periphery is not large . . . Mathematicians who try to calculate 
the circumference put it at 400,000 stades [1 stade = 600 Greek feet = 606 English 
feet; thus 24.24 x 10 7 ft/(6080 2 ft/nautical mile) = 39,900 nautical miles—the 
oldest recorded calculation of the earth’s circumference, and reportedly known to 
Columbus—85 per cent more than the true circumference, 60 x 360 = 21,600 
nautical miles].” 

Aristotle [from the Metaphysics , as translated by Warrington (1956)]: “Euxodus 
[of Cnidos, 408-355 b.c.] supposed that the motion of the sun and moon involves, 
in each case, three spheres. ... He further assumed that the motion of the planets 
involves, in each case, four spheres. . . . Calippus [of Cyzicus, flourished 330 b.c.] 
. . . considered that, in the light of observation, two more spheres should be added 
to the sun, two to the moon, and one more to each of the other planets.” 

Eratosthenes, 276-194 b.c. [a calculation attributed to him by Claudius Ptolemy, 
who observed at Alexandria from 127 to 141 or 151 a.d., in his Almagest, I, §12; 
see the translation by Taliaferro (1952)]: 

(Maximum distance of moon from earth) = (64J) (radius of earth); 

(Minimum distance of sun from earth) = (1,160) (radius of earth). 

Abu ’All al-Husain ibn ’Abdallah ibn SIna, otherwise known as Avicenna, 980-1037; 
physician, philosopher, codifier of Aristotle, and one of the most influential of those 
who preserved Greek learning and thereby made possible its transmission to me¬ 
diaeval Europe [quoted in Nasr (1964), p. 225]: “Time is the measure of motion.” 

From the Rasa y il y F51-treatise encyclopedia, sometimes known as the Koran after 
the Koran , of the Ikhwan al-Safa’ or Brothers of Sincerity, whose main center was 
at Basra, Iraq, roughly a.d. 950-1000 [as quoted by Nasr (1964), p. 64; see p. 78 
for a list of distances to the planets (in terms of Earth radii) taken from the Rasa'il, 
as well as sizes of planets and the motions of rotation of the various Ptolemaic 
carrier-spheres]: Space is “a form abstracted from matter and existing only in the 
consciousness.” 

Abu Raihan al-BirunJ, 973-1030, a scholar, but concerned also with experiment, 
observation, and measurement, who calculated the circumference of the Earth from 
measurements he made in India as 80,780,039 cubits (about 4 per cent larger than 
the value accepted today), and gave a table of distances to the planets [as quoted 
in Nasr (1964), pp. 120 and 130]: “Both [kinds of eclipses] do not happen together 
except at the time of the total collapse of the universe.” 

fitiene Tempier, Bishop of Paris, in 1277, to settle a controversy then dividing much 
of the French theological community, ruled that one could not deny the power of 
God to create as many universes as He pleases. 

Roger Bacon, 1214-1294, in his Opus Majus { 1268), gave the diameter of the sphere 
that carries the stars, on the authority of Alfargani, as 130,715,000 Roman miles 
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Box 27.7 {continued) 

[mile equal to 1,000 settings down of the right foot]; the volume of the sun, 170 
times that of the Earth; first-magnitude star, 107 times; sixth-magnitude, 18 times 
Earth. 

Nicolas Cusanus, 1401-1464 [from Of Learned Ignorance (1440), as translated by 
Heron (1954)]: “Necessarily all parts of the heavens are in movement. ... It is 
evident from the foregoing that the Earth is in movement . . . the world [universe], 
its movement and form . . . will appear as a wheel in a wheel, a sphere in a sphere 
without a center or circumference anywhere. ... It is now evident that this Earth 
really moves, though to us it seems stationary. In fact, it is only by reference to 
something fixed that we detect the movement of anything. How would a person 
know that a ship was in movement, if. . . the banks were invisible to him and he 
was ignorant of the fact that water flows?” 

Nicolaus Copernicus, February 19, 1473, to May 24, 1543 [from De Revolutionibus 
Orbium Coelestrum (1543), as translated by Dobson and Brodetsky (1947)]: “I was 
induced to think of a method of computing the motions of the spheres by nothing 
less than the knowledge that the mathematicians are inconsistent in these investiga¬ 
tions. . . . they cannot explain or observe the constant length of the seasonal year. 

. . . some use only concentric circles, while others eccentrics and epicycles. . . . Nor 
have they been able thereby to discern or deduce the principal thing—namely the 
shape of the universe and the unchangeable symmetry of its parts. . . . 

“I found first in Cicero that Nicetas had realized that the Earth moved. Afterwards 
I found in Plutarch [—a.d. 46-120] . . . The rest hold the Earth to be stationary, 
but Philolaus the Pythagorean [bom —480 b.c.] says that she moves around the 
(central) fire on an oblique circle like the Sun and Moon. Heraclides of Pontus 
[flourished in 4th century b.c.] and Ecphantus the Pythagorean also make the Earth 
to move, not indeed through space but by rotating round her own center as a wheel 
on an axle from West to East.’ Taking advantage of this I too began to think of 
the mobility of the Earth. . . . 

“Should we not be more surprised if the vast Universe revolved in twenty-four 
hours, than that little Earth should do so? . . . Idle therefore is the fear of Ptolemy 
that Earth and all thereon would be disintegrated by a natural rotation. . . . That 
the Earth is not the center of all revolutions is proved by the apparently irregular 
motions of the planets and the variations in their distances from the Earth. . . . We 
therefore assert that the center of the Earth, carrying the Moon’s path, passes in 
a great orbit among the other planets in an annual revolution round the Sun; that 
near the Sun is the center of the Universe; and that whereas the Sun is at rest, any 
apparent motion of the Sun can be better explained by motion of the Earth. . . . 
Particularly Mars, when he shines all night, appears to rival Jupiter in magnitude, 
being distinguishable only by his ruddy color; otherwise he is scarce equal to a star 
of the second magnitude, and can be recognized only when his movements are 
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carefully followed. All these phenomena proceed from the same cause, namely 
Earth s motion. . . . That there are no such phenomena for the fixed stars proves 
their immeasurable distance, compared to which even the size of the Earth’s orbit 
is negligible and the parallactic effect unnoticeable ” 

Thomas Digges, 1546-1595 [in a Perfit Description of the Caelestiall Orbes accord¬ 
ing to the most aunciente doctrine of the Pythagoreans, latelye reuiued by Copernicus 
and by Geometricall Demonstrations approued (1576), the principal vehicle by which 
Copernicus reached England, as quoted in Johnson (1937)]: “Of whiche lightes 
Celestiall it is to bee thoughte that we onely behoulde sutch as are in the inferioure 
partes of the same Orbe, and as they are hygher, so seeme they of lesse and lesser 
quantity, euen tyll our sighte beinge not able farder to reach or conceyue, the greatest 
part rest by reason of their wonderfull distance inuisible vnto vs.” 

Giordano Bruno, bom ca. 1548, burned at the stake in the Campo dei Fiori 
in Rome, February 17, 1600 [from On the Infinite Universe and Worlds, written on 
a visit to England in 1583-1585, as translated by Singer (1950)]: “Thus let this surface 
be what it will, I must always put the question, what is beyond? If the reply is 
NOTHING, then I call that the VOID or emptiness. And such a Void or Emptiness 
hath no measure and no outer limit, though it hath an inner; and this is harder 
to imagine than is an infinite or immense universe. . . . There are then innumerable 
suns, and an infinite number of earths revolve around those suns, just as the seven 
we can observe revolve around this sun which is close to us.” 

Johann Kepler established the laws of elliptic orbits and of equal areas (1609), 
and established the connection between planetary periods and semimajor axes (1619). 

Galileo Galilei observed the satellites of Jupiter and realized they provided support 
for Copernicantheory, and interpreted the Milky Way as a collection of stars (1610). 

In 1638 he wrote: 

“Salvati . Now what shall we do, Simplicio, with the fixed stars? Do we want to 
sprinkle them through the immense abyss of the universe, at various distances from 
any predetermined point, or place them on a spherical surface extending around 
a center of their own so that each of them will be at the same distance from that 
center? 

“Simplicio . I had rather take a middle course, and assign to them an orb described 
around a definite center and included between two spherical surfaces . . ” 

Isaac Newton (1687): “Gravitation toward the sun is made up out of the gravita¬ 
tions toward the several particles of which the body of the sun is composed, and 
in receding from the sun decreases accurately as the inverse square of the distances 
as far as the orbit of Saturn, as evidently appears from the quiescence of the aphelion 
of the planets.” 

Isaac Newton [in a letter of Dec. 10, 1692, to Richard Bentley, quoted in Munitz 
(1957)]: If the matter of our sun and planets and all the matter of the universe 
were evenly scattered throughout all the heavens, and every particle had an innate 
gravity toward all the rest, and the whole space throughout which this matter was 
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scattered was but finite, the matter on the outside of this space would, by its gravity, 
tend toward all the matter on the inside and, by consequence, fall down into the 
middle of the whole space and there compose one great spherical mass. But if the 
matter was evenly disposed throughout an infinite space, it could never convene 
into one mass; but some of it would convene into one mass and some into another, 
so as to make an infinite number of great masses scattered at great distances from 
one to another throughout all that infinite space. And thus might the sun and fixed 
stars be formed.” 

Christiaan Huygens, 1629-1695 [in his posthumously published Cosmotheoros 
(1698)]: “Seeing then that the stars . . . are so many suns, if we do but suppose 
one of them [Sirius, the Dog-star] equal to ours, it will follow [details, including 
telescope directed at sun; thin plate; hole in it; comparison with Sirius] . . . that 
his distance to the distance of the sun from us is as 27,664 to 1. . . . Indeed it seems 
to me certain that the universe is infinitely extended.” 

Edmund Halley (1720): “If the number of the Fixt Stars were more than finite, 
the whole superficies of their apparent Sphere [i.e., the sky] would be luminous” 
[by today’s reasoning the same temperature as the surface of the average star; this 
is known today as Olber’s paradox, or the paradox of P. L. de Cheseaux (1744) and 
Heinrich Wilhelm Matthias Olbers (1826)]. 

Thomas Wright of Durham (1750): “To . . . solve the Phaenomena of the Via 
Lactea . . . granted . . . that the Milky Way is formed of an infinite number of small 
Stars . . . imagine a vast infinite gulph, or medium, every way extended like a plane, 
and inclosed between two surfaces, nearly even on both sides. . . . Now in this space 
let us imagine all the Stars scattered promiscuously, but at such an adjusted distance 
from one another, as to fill up the whole medium with a kind of regular irregularity 
of objects. [Considering its appearance] “to an eye situated . . . anywhere about the 
middle plane” ... all the irregularity we observe in it at the Earth, I judge to be 
entirely owing to our Sun’s position . . . and the diversity of motion . . . amongst 
the stars themselves, which may here and there . . . occasion a cloudy knot of stars.” 

Immanuel Kant, 1724-1804 (1755): “It was reserved for an Englishman, Mr. Wright 
of Durham, to make a happy step ... we will try to discover the cause that has 
made the positions of the fixed stars come to be in relation to a common plane. . . . 
granted . . . that the whole host of [the fixed stars] are striving to approach each 
other through their mutual attraction . . . ruin is prevented by the action of the 
centrifugal forces . . . the same cause [centrifugal force] . . . has also so directed 
their orbits that they are all related to one plane. . . . [The needed motion is calcu¬ 
lated to be] one degree [or less] in four thousand years; . . . careful observers . . . 
will be required for it. . . . Mr. Bradley has observed almost imperceptible displace¬ 
ments of the stars” [known from later work to be caused by aberration (effect of 
observer velocity) rather than real parallax (effect of position of observer)]. 
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Asks for the first time how a very remote galaxy would appear: “circular if its 
plane is presented directly to the eye, and elliptical if it is seen from the side or 
obliquely. The feebleness of its light, its figure, and the apparent size of its diameter 
will clearly distinguish such a phenomenon when it is presented, from all the stars 
that are seen single. . . . this phenomenon . . . has been distinctly perceived by 
different observers [who] . . . have been astonished at its strangeness. . . . Analogy 
thus does not leave us to doubt that these systems [planets, stars, galaxies] have 
been formed and produced . . . out of the smallest particles of the elementary matter 
that filled empty space.” 

Goes on to consider seriously “the successive expansion of the creation [of planets, 
stars, galaxies] through the infinite regions of space that have the matter for it. . . . 
attraction is just that universal relation which unites all the parts of nature in one 
space. It reaches, therefore, to . . . all the distance of nature’s infinitude.” 

Johann Heinrich Lambert, 1728-1777 (1761): “The fixed stars obeying central 
forces move in orbits. The Milky Way comprehends several systems of fixed stars. 
Each system has its center, and several systems taken together have a common center. 
Assemblages of their assemblages likewise have theirs. In fine, there is a universal 
center for the whole world round which all things revolve.” [First spelling out of 
a “hierarchical model” for the universe, later taken up by C. V. I. Charlier and by 
H. Alfven and O. Klein (1962); see also O. Klein (1966 and 1971)]. 

Auguste Comte (1835) concluded that it is meaningless to speak of the chemical 
composition of distant stars because man will never be able to explore them; “the 
field of positive philosophy lies wholly within the limits of our solar system, the 
study of the universe being inaccessible in any positive sense.” 

The first successful determination of the parallax [1 second of parallax: 1 pc = 
3.08 x 10 18 cm = 3.26 lyr] of any star was made in 1838 (for a Centauri by Hen¬ 
derson, for a Lyrae by Struve, and for 61 Cygni by Bessel). 


B. The Twentieth Century . 

Derivation by James Jeans in 1902 of the critical wavelength that separates short- 
wavelength acoustical modes of vibration of a hot primordial gas and longer wave¬ 
length modes of commencement of gravitational condensation of this gas. Applica¬ 
tion of these considerations by P. J. E. Peebles and R. H. Dicke in 1968 to explain 
why globular star clusters have masses of the order of 10 5 A/ o . 

Investigations of cosmic rays from first observation by V. F. Hess and 
W. Kolhorster in 1911-1913 to date; determination that the energy density in 
interstellar space (in this galaxy) is about 1 eV/cm 3 or 10^ 12 erg/cm 3 , comparable 
to the density of energy of starlight, to the kinetic energy of clouds of ionized 
interstellar gas, averaged over the galaxy, and to the energy density of the interstellar 
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magnetic field (-10 -5 gauss). In connection with this equality, see especially 
E. Fermi (1949). 

Discovery by Henrietta Leavitt in 1912 that there is a well-defined relation between 
the period of a Cepheid variable and its absolute luminosity. 

First determination of the radial velocity of a galaxy by V. M. Slipher in 1912: 
Andromeda approaching at 200 km/sec. Thirteen galaxies investigated by him by 
1915; all but two receding at roughly 300 km/sec. 

Albert Einstein (1915d): Interpreted gravitation as a manifestation of geometry; 
gave final formulation of the law that governs the dynamic development of the 
geometry of space with the passage of time. 

Albert Einstein (1917): Idealized the universe as a 3-sphere filled with matter at 
effectively uniform density; the radius of this 3-sphere could not be envisaged as 
static without altering his standard 1915 geometrodynamic law; for this reason 
Einstein introduced a so-called “cosmological term,” which he later dropped as “the 
biggest blunder” in his life [Gamow (1970)]. 

Formulation by W. de Sitter in 1917 of a cosmological model in which (1) the 
universe is everywhere isotropic (and therefore homogeneous) and (2) the universe 
does not change with time, so that the mean density of mass-energy and the mean 
curvature of space are constant, but in which perforce (3) a cosmological term 
(“repulsion”) of the Einstein type has to be added to balance the attraction of the 
matter. Observation by de Sitter that he could obtain another static model by 
removing all the matter from the original model, but that the A-term would cause 
test particles to accelerate away from one another. 

From 1917 to 1920, debate about whether spiral nebulae are mere nebulous objects 
(Harlow Shapley) or are “island universes” or galaxies similar to but external to 
the Milky Way (H. D. Curtis). 

Discovery by Harlow Shapley in 1918, by mapping distribution of about 100 
globular clusters of this Galaxy (10 4 to 10 6 stars each) in space that center is in 
direction of Sagittarius (present value of distance from sun —10 kpc). 

Independent derivation of evolving homogeneous and isotropic cosmological 
models [also leading to the relation v — H* (distance)] by A. Friedmann in 1922 
and G. Lemaitre in 1927, with Lemaitre tieing in his theoretical analysis with the 
then-ongoing Mt. Wilson work, to become the “father of the big-bang cosmology”. 
(Universe, however, taken to expand smoothly away from Einstein’s static A > 0 
solution in Lemaitre’s original paper). 

Remark by H. Weyl in 1923 that test particles in de Sitter model will separate 
at a rate given by a formula of the form v = H* (distance). 

In 1924, resolution of debate about nature of spiral nebulae by Edwin P. Hubble 
with Mount Wilson 100-inch telescope; discovery of Cepheid variables in Andromeda 
and other spiral nebulae, and consequent determination of distances to these nebulae. 
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Determination by Jan Oort in 1927 of characteristic 
of stars near sun, 


pattern of radial velocities 


Sv T = Ar cos 2(0 — 5), 


showing that. (1) axis of rotation of stars in Milky Way is perpendicular to disc; 
(2) sun makes a complete revolution in ~10 8 yr; and (3) the effective mass pulling 
on the sun required to produce a revolution with this period is of the order ~10 44 g 
or ~10 u M o . & 

Age of a uranium ore as established from lead-uranium ratio: greatest value found 
up to 1927, 1.3 X 10 9 yr (A. Holmes and R. W. Lawson). Age of the lead in the 
“average” surface rocks of the earth as calculated from time required to produce 
this lead from the uranium in the same surface rocks, 2 X 10 9 yr to 6 x 10 9 yr. Age 
of elemental uranium as estimated by Rutherford from time required for U 235 and 
U to decay from assumed roughly equal ratio in early days to known very unequal 
ratio today, ~3 X 10 9 yr. 

Establishment by Hubble in 1929 that out to 6 x 10 6 lyr the velocity of recession 
of a galaxy is proportional to its distance. 

Note by A. S. Eddington in 1930 that Einstein A > 0 static universe is unstable 
against any small increase or decrease in the radius of curvature. 

Recommendation from Einstein in 1931 hereafter to drop the so-called cosmolog¬ 
ical term. 

Proposal by Einstein and de Sitter in 1932 that one tentatively adopt the simplest 
assumption that A = 0, that pressure is negligible, and that the reciprocal of the 
square of the radius of curvature of the universe is neither positive nor negative 
(spherical or hyperbolic universe) but zero (“cosmologically flat”), thus leading to 
the relation p = 3H 2 /Sir (in geometric units). 

Evidence uncovered by Grote Reber in 1934 for the existence of a discrete radio 
source in Cygnus; evidence for this source, Cygnus A, firmed up by J. S. Hey, S. J. 
Parsons, and J. W. Phillips in 1946; six other discrete radio sources, including 
Taurus A and Centaurus A, discovered by J. G. Bolton in 1948. 

Discovery by E. A. Milne and W. H. McCrea in 1934 of dose correspondence 
between Newtonian dynamics of a large gas cloud and Einstein theory of a dynamic 
universe, with the scale factor of the expansion satisfying the same equation in both 
theories, so long as pressure is negligible. 

Demonstration by H. P. Robertson and by A. G. Walker, independently, in 1935 
that the Lemaitre type of line element provides the most general Riemannian 
geometry compatible with homogeneity and isotropy. 

Classification of nebulae as spiral, barred spiral, elliptical, and irregular bv Hubble 
in 1936. J 

First detailed theory of thermonuclear energy generation in the sun, H. A Bethe 
1939. 
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Reasoning by George Gamow in 1946 that matter in the early universe was dense 
enough and hot enough to undergo rapid thermonuclear reaction, and that energy 
densities were radiation-dominated. 

Proposal of so-called “steady-state cosmology” by H. Bondi, T. Gold, and 
F. Hoyle in 1948, lying outside the framework of Einstein’s standard general rela¬ 
tivity, with “continuous creation of matter” taking place throughout the universe, 
and the mean age of the matter present being equal to one third of the Hubble 
time. 

Prediction by R. A. Alpher, H. A. Bethe, and G. Gamow in 1948 that the black- 
body radiation that originally filled the universe should today have a Planck spec¬ 
trum corresponding to a temperature of 25 K. Independent conception of same idea 
by R. H. Dicke in 1964 and start of an experimental search for this primordial 
cosmic-fireball radiation. Discovery of unwanted and unexpected 7 cm background 
radiation in 1965 by A. A. Penzias and R. W. Wilson with a temperature of about 
3.5 K; immediate identification of this radiation by Dicke, P. J. E. Peebles, P. G. 
Roll, and D. T. Wilkinson as the expected relict radiation. 

Radio sources Taurus A, Virgo A, and Centaurus A tentatively and, as it later 
proved, correctly identified with the Crab Nebula and with the galaxies NGC 4486 
and NGC 5128 by J. G. Bolton, G. J. Stanley, and O. B. Slee in 1949. 

Analysis by Lemaitre in 1950 of big-bang expansion approaching very closely the 
Einstein static universe (A > 0) and then, at first slowly, subsequently more and 
more rapidly, going into exponential expansion. 

Discovery by Walter Baade in 1952 that there are two types of Cepheid variables 
with different period-luminosity relations; consequent increase in Hubble distance 
scale by factor of about 2.6, and a corresponding increase in the original value 
(roughly 2 x 10 9 yr) of the Hubble time, H~ l . 

Identification of radio source Cygnus A by W. Baade and R. Minkowski in 1954 
with the brightest member of a faint cluster of galaxies, contrary to the then widely 
held view that the majority of radio sources lie within the Milky Way. Determination 
of redshift in the optical spectrum of 8\/X = z = 0.057 by Minkowski, implying for 
Cygnus A a distance of 170 Mpc and a radio luminosity of 10 45 erg/sec, 10 7 times 
the radio power and ten times the optical power of a normal galaxy. 

Resolution of radio source Cygnus A in 1956 into two components symmetrically 
located on either side of the optical galaxy, the first indication that most radio sources 
are double. Still unsolved is the mystery of the explosion or other mechanism that 
caused this and other double sources. 

Calculation by G. R. Burbidge in 1956 of the kinetic energy in the electrons giving 
off synchrotron radiation in a radio galaxy and the energy of the magnetic field 
that holds these electrons in orbit; minimization of the sum of these two energies; 
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determination that this minimum is of the order of 10 60 ergs (energy of annihilation 
of half a million suns) for Hercules A, for example. 

Solar system determined to have an age of 4.55 x 10 9 yr or more from relative 
abundances of pb 204 > 206 - 207 and U 235 ’ 238 in meteorites and oceanic sediments by 
C. Patterson in 1956; and by others in 1965 and 1969 from evidence on the processes 
Rb 87 —Sr 87 and K 40 —»- A 40 in meteorites. 

Discovery by Allen Sandage in 1958 that what Hubble had identified in distant 
galaxies as bright stars were HII regions, clumps of hot stars surrounded by a plasma 
ionized by stars, and consequent upping of Hubble’s distance scale by a further factor 
of about 2.2. 

Estimation by Jan Oort in 1958, from luminosity of other galaxies, that matter 
in galaxies contributes to the density of mass-energy in the universe roughly 
3 x 10 -31 g/cm 3 [see Peebles (1971) for updated analysis], this being one or two 
orders of magnitude less than that called for by Einstein’s concept that the universe 
is curved up into closure, and thereby giving rise to “the mystery of the missing 
matter,” the focus of much present-day research. 

Discovery of celestial (nonsolar) X-rays in 1962 by Giacconi, Gursky, Paolini, and 
Rossi. Majority of sources in plane of the Milky Way, presumably local to this galaxy, 
as is the Crab nebula. Extragalactic sources include the radio galaxy Virgo A and 
the quasar 3C273. 

Revised “3C-catalog” of radio sources published in 1962 by A. S. Bennett, con¬ 
taining 328 sources, nearly complete in coverage between declinations —5° and 
+ 90° for sources br ighter than 9 flux units (9 x 10 -26 watt/m 2 Hz) at 178 MHz. 

Identification of the first quasistellar object (QSO) by Maarten Schmidt at Mt. 
Palomar in 1963: radio-position determination of 3C273 to better than 1 second 
of arc by C. Hazard, M. B. Mackey, and A. J. Shimmins in 1962, followed by 
Schmidt’s taking an optical spectrum of the star-like source and, despite all pre¬ 
sumptions that it was a star in this galaxy, trying to fit it, and succeeding, with a 

redshift of the magnitude (unprecedented for a “star”) of SA A = z = 0.158. Distance 

implied by Hubble relation, 1.5 x 10 9 lyr; optical brightness, 100 times brightest 
known galaxy. Largest redshift of any QSO known in 1972, z = 2.88 (4C05.34; 
C. R. Lynds). Such a source detectable even if it had a redshift of 3; but no QSO’s 
known in 1972 with such redshifts. See Box 28.1. 

Reasoning by Dennis Sciama in 1964 [see also Sciama (1971)] that intergalactic 
hydrogen can best escape observation if at a temperature between 3 x 10 5 K and 
10 6 K. With as many as 10 5 protons and 10 -5 electrons per cm 3 and a temperature 
lower than 3 x 10 5 K, the number density of neutral atoms would be great enough 
and the resulting absorption of Lyman a from a distant galaxy (z = 2) would be 
strong enough to show up, contrary to observation. 

In 1964 J. E. Gunn and B. A. Peterson, E. J. Wampler, and others determined 
that, at a temperature greater than 10 6 K, the intensity of 0.25 keV or 50 A x-rays 
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from intergalactic space would be too high to be compatible with the observations. 

Emphasis by Wheeler (1964a) that the dynamic object in Einstein’s general 
relativity is 3-geometry, not 4-geometry, and that this dynamics, both classical and 
quantum, unrolls in the arena of superspace. 

Discovery by Sandage in 1965 of quasistellar galaxies (radio-quiet QSO’s). 

Discovery by E. M. Burbidge, G. R. Burbidge, C. R. Lynds, and A. N. Stockton 
in 1965 of a QSO, 3091, with numerous absorption lines, implying the coexistence 
of several redshifts in one spectrum. 

Fraction (by mass) of matter converted to helium in early few minutes of universe 
nearly independent of the relative numbers of photons and baryons, over a 10 6 range 
in values of this number ratio, so long as the universe at 10 10 K is still radiation- 
dominated. Value of this plateau helium abundance (following earlier work of 
others) first accurately calculated as 27 per cent by P. J. E. Peebles in 1966 and 
by R. V. Wagoner, W. A. Fowler, and F. Hoyle in 1967. 

Proposal by C. W. Misner in 1968 to consider as an important part of early 
cosmology the anisotropy vibrations of the geometry of space previously brought 
to attention by E. Kasner and by I. M. Khalatnikov and E. Lifshitz. [Misner’s hope 
to account naturally in this way for the otherwise so puzzling homogeneity of the 
universe was later dashed.] 

Proof on the basis of standard general relativity by S. W. Hawking, G. F. R. 

Ellis, and R. Penrose in 1968 and 1969 [see also related work of earlier investigators 
cited in Chapter 44] that a model universe presently expanding and filled with matter 
and radiation obeying a physically acceptable equation of state must have been 
singular in the past, however wanting in symmetry it is today. 

Discovery of pulsars in 1968 by Hewish, Bell, Pilkington, Scott, and Collins, and 
their interpretation as spinning neutron stars (see Chapter 24). 


"No poet, nor artist of any art , has his complete meaning alone. His 
significance, his appreciation, is the appreciation of his relation to the dead poets 
and artists. You cannot value him alone; you must set him, for contrast and 
comparison, among the dead . . . when a new work of art is created . . . 
something . . . happens simultaneously to all the works of art which preceded it. 
The existing monuments form an ideal order among themselves, which is modified 
by the introduction of the new (the really new) work of art among them ." 




j 


T. S. ELIOT (1920). 


CHAPTER 


EVOLUTION OF THE UNIVERSE 
INTO ITS PRESENT STATE 

Cosmology. . . restrains the aberrations of the mere 
undisciplined imagination. 
ALFRED NORTH WHITEHEAD {1929, p. 21) 




§28.1. THE "STANDARD MODEL" OF THE UNIVERSE 


Since the discovery of the cosmic microwave radiation in 1965, extensive theoretical 
research has produced a fairly detailed picture of how the universe probably evolved 
into its present state. This picture, called the “standard hot big-bang model” of the 
universe, is sketched in the present chapter, and its main features appear in Figure 
28.1. Gravitation dominates the over-all expansion; but otherwise most details of 
the evolution are governed much less by gravitation than by the laws of thermody¬ 
namics, hydrodynamics, atomic physics, nuclear physics, and high-energy physics. 
This fact, and the existence of three excellent recent books on the subject [Sciama 
(1971); Peebles (1972); Zel’dovich and Novikov (1974)], encourage brevity here. 

The past evolution of the universe is qualitatively independent of the nature of 
the homogeneous hypersurfaces (k = - 1 , 0, or + 1) and qualitatively independent 
of the cosmological constant, since the contributions of k and A to the evolution 
are not important in early stages of the history (small a/a 0 ) [see equation (27.40) 
and Figure 27.5], One crucial assumption underlies the standard hot big-bang model: 
that the universe “began” in a state of rapid expansion from a very nearly homoge¬ 
neous, isotropic condition of infinite (or near infinite) density and temperature. 

During the first second after the beginning, according to this analysis, the temper¬ 
ature of the universe was so high that there was complete thermodynamic equilib- 


Evolution of universe 
according to "standard hot 
big-bang model": 


(1) initial state 
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Time /. see Now (12 x I0 9 yrs.) 



Pairs recombine 1_ 

neutrinos decouple 

Primordial (■ 
elements form 


Transition from M t Galaxies 

| radiation dominance F | ^ orm 

to matter dominance] Plasma 


1 

Now 


recombines 


Matter and radiation 
in thermal equilibrium 


Most distant 
quasar observed 

(*=s3) 


Figure 28.1. 

Evolution of the universe into its present state, according to the standard hot big-bang 
model. The curves are drawn assuming 

P nw = 5 X 10 -30 g/cm 3 , p ro = 10" 33 g/cm 3 , k = 0; 

but for other values of p TO , p T0 , and k within the limits of observation, the curves are virtually 
the same (see exercise 28.1). See text and Box 28.1 for detailed discussion of the processes 
described at the bottom of the figure. [This figure is adapted from Dicke, Peebles, Roll, 
and Wilkinson (1965).] 


(2) thermal equilibrium, 
decay of particles, 
recombination of pairs 
(0 < t < 10 sec.) 


(3) decoupling and free 
propagation of gravitons 
and neutrinos (t < 1 sec.) 


rium between photons, neutrinos, electrons, positrons, neutrons, protons, various 
hyperons and mesons, and perhaps even gravitons (gravitational waves) [see, e.g., 
Kundt (1971) and references cited therein]. However, by the time the universe was 
a few seconds old, its temperature had dropped to about 10 10 K and its density was 
down to —10 5 g/cm 3 ; so all nucleon-antinucleon pairs had recombined, all hyperons 
and mesons had decayed, and all neutrinos and gravitons had decoupled from 
matter. The universe then consisted of freely propagating neutrinos, and perhaps 
gravitons, with black-body spectra at temperatures T ~~ 10 10 K, plus electron-posi¬ 
tron pairs in the process of recombining, plus electrons, neutrons, protons, and 
photons all in thermal equilibrium at T ~~ 10 10 K. 

Since that early state, the gravitons (if present) and neutrinos have continued 
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to propagate freely, maintaining black-body spectra; but their temperatures have 
been redshifted by the expansion of the universe in accordance with the law 

T<x\/a (28.1) 

(Box 29.2). Consequently, today their temperatures should be roughly 3 K, and they 
should still fill the universe. Unfortunately, today’s technology is far from being able 
to detect such a “sea” of neutrinos or gravitons. However, if and when they can 
be detected, they will provide direct observational information about the first one 
second of the life of the universe! 

As the universe continued to expand after the first few seconds, it entered a period 
lasting from / ~ 2 seconds to t ~ 1,000 seconds ( T~~ 10 10 to ~10 9 K, p ~ 10 +5 to 
10 -1 g/cm 3 ), during which primordial element formation occurred. Before this 
period, there were so many high-energy protons around that they could blast apart 
any atomic nucleus (e.g., deuterium or tritium or He 3 or He 4 ) the moment it formed; 
after this period, the protons were too cold (had kinetic energies too low) to penetrate 
each others’ coulomb barriers, and all the freely penetrating neutrons from the 
earlier, hotter stage had decayed into electrons plus protons. Only during the short, 
crucial period from t ~~ 2 seconds to t • 1,000 seconds were conditions right for 
making elements. Calculations by Gamow (1948), by Alpher and Hermann (1948a,b; 
1950), by Fermi and Turkevitch (1950), by Peebles (1966), and by Wagoner, Fowler, 
and Hoyle (1967) reveal that about 25 per cent of the baryons in the universe should 
have been converted into He 4 (alpha particles) during this period, and about 75 
per cent should have been left as H 1 (protons). Traces of deuterium, He 3 , and Li 
should have also been created, but essentially no heavy elements. All the heavy 
elements observed today must have been made later, in stars [see, e.g., Fowler (1967) 
or Clayton (1968)]. Current astronomical studies of the abundances of the elements 
give some support for these predictions; but the observational data are not yet very 
conclusive [see, e.g., Danziger (1968) and pp. 268-275 of Peebles (1971)]. 

After primordial element formation, the matter and radiation continued to interact 
thermally through frequent ionization and recombination of atoms, keeping each 
other at the same temperature. Were the temperatures of radiation and matter not 
locked together, the radiation would cool more slowly than the matter (for adiabatic 
expansion, T r cc 1/a, but T m oc 1/a 2 ). Thus thermal equilibrium was maintained only 
by a constant transfer of energy from radiation to matter. But the heat capacity 
of the radiation was far greater than that of the matter. Therefore the energy transfer 
had a negligible effect on p r , p r , and T r . It held up the temperature of the matter 
( T m = T t) without significantly lowering the temperature of the radiation. On the 
other hand, the total mass-energy of matter was and is dominated by rest mass. 
Therefore the energy transfer had negligible influence on p m . [This circumstance 
justifies the approximation of ignoring energy transfer when passing from equation 
(27.31)to(27.32).] 

When the falling temperature reached a few thousand degrees (a/a 0 ~ 10 -3 , 
p ~ 10“ 20 g/cm 3 , t ~~ 10 5 years), two things of interest happened: the universe ceased 
to be radiation-dominated and became matter-dominated [p m = p mo (a 0 /a) 3 came 
to exceed p T = p T0 {aJ a) 4 ]; and the photons ceased to be energetic enough to keep 


(4) primordial element 
formation 

(2 sec. < t < 1,000 sec 


(5) thermal interaction of 
matter and radiation 
(1,000 sec. < t < 10 5 
years) 


(6) plasma recombination 
and transition to matter 
dominance (t — 10 5 yrs.) 
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(7) subsequent propagation 
of photons (t > Id 5 yrs.) 


(8) condensation of stars, 
galaxies and clusters (10 s 
yrs. < if < 10 9 yrs.) 


EXERCISE 


hydrogen atoms ionized, so the electrons and protons quickly recombined. That these 
two events were roughly coincident is a result of the specific, nearly conserved value 
that the entropy per baryon has in our universe: 

, (number of photons in universe) 

5 = entropy per baryon —--- : -- — 10 s . 

(number of baryons in universe) 

Why the universe began with this value of 5, rather than some other value (e.g. 
unity), nobody has been able to explain. 

Recombination of the plasma at / — 10 5 years was crucial, because it brought an 
end to the interaction and thermal equilibrium between radiation and matter (“de¬ 
coupling”). Thereafter, with very few free electrons off which to scatter, and with 
Rayleigh scattering off atoms and molecules unimportant, the photons propagated 
almost freely through space. Unless energy-releasing processes reionized the inter- 
galactic medium sometime between a/a Q — 10“ 3 and a/a 0 — 0.1, the photons have 
been propagating freely ever since the plasma recombined. Even if reionization 
occurred, the photons have been propagating freely at least since a/a 0 —0.1. 

The expansion of the universe has redshifted the temperature of the freely propa¬ 
gating photons in accordance with the equation T oz 1/a (see Box 29.2). As a 
consequence, today they have a black-body spectrum with a temperature of 2.7 K. 
They are identified with the cosmic microwave radiation that was discovered in 1965, 
and they give one direct information about the nature of the universe at the time 
they last interacted with matter (a/a 0 — 10~ 3 , t — 10 5 years if reionization did not 
occur; a/a 0 — 0.1, t — 5 X 10 s years if reionization did occur.) 

Return to the history of matter. Before plasma recombination, the photon pressure 
(“elasticity of the cosmological fluid”) prevented the uniform matter (25 per cent 
He 4 , 75 per cent H) from condensing into stars, galaxies, or clusters of galaxies. 
However, after recombination, the photon pressure was gone, and condensation could 
begin. Small perturbations in the matter density, perhaps dating back to the begin¬ 
ning of expansion, then began to grow larger and larger. Somewhere between 
a/a 0 — 1/30 and a/a 0 — 1/10 (10 s years < t < 10 9 years) these perturbations began 
developing into stars, galaxies, and clusters of galaxies. Slightly later, at a/a 0 — 1/4, 
quasars probably “turned on,” emitting light which astronomers now receive at Earth 
(see Box 28.1). 


Exercise 28.1. UNCERTAINTY IN EVOLUTION 

Current observations, plus the assumption of complete homogeneity and isotropy at the 
beginning of expansion, plus the assumption that the excess of leptons over antileptons is 
less than or of the order of the excess of baryons over antibaryons, place the following limits 
on the cosmological parameters today: 

Matter density today = between 10” 28 and 2 X 10~ 31 g/cm 3 ; 
k — 0 or -h 1 or — 1; 

temperature of electromagnetic radiation today = 2.7 ±0.1 K. 

Total radiation density [observed photons, plus neutrinos and gravitons that presumably 
originated in big bang in thermal equilibrium with photons] = p TO , between 0.7 x 10 -33 
and 1.2 x 10“ 33 g/cm 3 . 


(i continued on page 769) 
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Box 28.1 EVOLUTION OF THE QUASAR POPULATION 

That the large-scale, average properties of the universe are changing markedly with 
time one can infer from quasar data. In brief, there appear to have been about 50 
times more quasars in the universe at a redshift z^2 than at z zz 0.5; and there 
may well have been fewer, or none, at redshifts z > 3. (On the use of redshift to 
characterize time since the big bang, see Box 29.3.) In greater detail, Schmidt (1972) 
gives the following analysis of the data:* 

1. Schmidt assumes from the outset that quasar redshifts are cosmological in origin 
[redshift = (Hubble constant) x (distance); §29.2]. The evidence for this is 

a. Observational: Some quasars are located in clusters of galaxies [as evidenced 
both by position on sky and by quasar having same redshift as galaxies in 
cluster; see Gunn (1971)]. Since the evidence for the cosmological distance- 
redshift relation for galaxies is overwhelming (Boxes 29.4 and 29.5), the 
redshifts of these particular quasars must be cosmological. 

b. Theoretical: Observed quasar redshifts of z 1 to 3 cannot be gravitational 
in origin; objects with gravitational redshifts larger than z ^ 0.5 are unstable 
against collapse (see Chapters 24 and 26 and Box 25.9). Nor are the quasar 
redshifts likely to be Doppler; how could so massive an object be accelerated 
t° v ~ 1 without complete disruption? The only remaining possibility is a 
cosmological redshift. For this reason, opponents of the cosmological hypothe¬ 
sis usually feel pressed to invoke in the quasars a breakdown of the laws of 
physics as one understands them today. [See, e.g., Arp (1971) and references 
cited therein. These references also describe evidence against the cosmological 
assumption, evidence that a few prominent investigators find compelling, but 
that most do not as of 1972.] 

2. Schmidt then asks how many quasars, N, there were in the universe at a time 
corresponding to the redshift z, and with absolute luminosity per unit frequency, 

L,(2,500 A) at the wavelength 2500 A as measured in the quasar’s local Lorentz 
frame. 

3. The data on quasars available in 1972 are not at all sufficient to determine 

L v ( 2,500 A)] uniquely. But they are sufficient to show unequivocally that: 

a. There must have been evolution; N(z, L v ) cannot be independent of z. 

b. The evolution cannot have resided primarily in the luminosities: the total 
number of quasars, 

N tot (z)= 2 A(z,L„) 

L,(2,500A) 


must have changed markedly with time (with z). 


Our version of Schmidt s (1972) argument is oversimplified. The reader interested in greater precision 
should consult his original paper. 



Box 28.1 (continued) 

c. If the evolution was primarily in the total number, A tot (z), i.e., if the changes 
in the relative luminosity distribution at 2,500 A 

/(r, L,) = [l/NJft]N(z, L v ) 

were negligible, and if the universe today is characterized by o 0 = q 0 = 1 (see 
Chapter 29 for notation), then the data show 

NJiz = 2)=s 50iV tot (z = 0.5). 

This steep increase in number as one goes backward in time—and all other 
basic features of the observed quasar redshift and magnitude distributions for 
z < 2—can be fit in a universe with <j 0 = q 0 = 1 by either of the evolution 
laws 

N tot (z) oc (1 + z) 6 , 

N ioi [z{t)\ oc lO 5 ^- 0 ^. 

Here t 0 is the current age of the universe and t was the age at redshift z. 

d. These evolution laws, when extrapolated beyond a redshift z ^2 and when 
combined with the observed relative luminosity function f(z, L v ) for quasars 
near apparent magnitude 18, predict that an observer on Earth should see the 
following fractions of nineteenth and twentieth-magnitude quasars to have 
redshifts greater than 2.5: 


evolution law 

fraction with z > 2.5 


m as 19 

m = 20 

(1 + zf 

29% 

49% 

j05(t o -«/t o 

12% 

14% 


In 1972 about 30 quasars fainter than m = 18.5 are known, and of these only 
1 (3%) has z > 2.5. This shows, in Schmidt’s words, “that the density law 
(1 + z) 6 cannot persist beyond a redshift of around 2.5.” Schmidt regards the 
]Q5tt 0 -o/to law (which becomes nearly constant at z > 2.5) to be also in 
apparent conflict with the observations, but he says that “further spectroscopic 
work on faint quasars is needed to confirm this suspicion.” 

One reason for caution is the difficult problem of removing “observational selection 
effects” from the data. Schmidt, Sandage, and others have independently searched 
for selection effects that might produce an artificial apparent decrease in the number 
of quasars atz > 2.5. None have been found. In the words of Sandage (1972d) “The 
apparent cutoff in quasar redshifts near z = 2.8 [has been] examined for selection 
effects that could produce it artificially. If the cutoff is real, it may be the time of 
the birth of the first quasars, although the suggested redshift is unexpectedly small. 
Atz = 3ina^ 0 = 1 universe, the look-back time is 89 per cent of the Friedmann 
age. Assessment of the observational selection effects shows that none are positively 
established that could produce the cutoff artificially.” 
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(The uncertainties taken into account in p ro are uncertainty about whether quadrupole 
moments at early times were sufficient to create gravitons at the full level corresponding 
to thermal equilibrium, and uncertainty about the number and statistical weights of particle 
species in equilibrium at the time gravitons decoupled.) Use the equations in §27.10 to 
calculate the uncertainties in the evolutionary history (Figure 28.1) caused by these uncer¬ 
tainties in the present state of the universe. 


§28.2. STANDARD MODEL MODIFIED FOR 
PRIMORDIAL CHAOS 

The standard hot big-bang model is remarkably powerful and accords well with 
observations (primordial helium abundances; existence, temperature, and isotropy 
of cosmic microwave radiation; homogeneity and isotropy of universe in the large; 
close accord between age of universe as measured by expansion and ages of oldest 
stars;...). However, in 1972 it encounters apparent difficulty with one item: the 
origin of galaxies. In a universe that is initially homogeneous and isotropic it is not 
clear that random fluctuations will give rise (after plasma recombination) to pertur¬ 
bations in the density of matter of sufficient amplitude to condense into galaxies. 

The perturbations that eventually form galaxies might have to reside in the initial, 
exploding state of the universe. [See ZeFdovich and Novikov (1974) for detailed 
review and discussion; see also references cited in §30.1.] 

Is it reasonable to assume a small amount of initial inhomogeneity? Is it not much 
more reasonable to assume either perfect homogeneity (one extreme) or perfect chaos 
(the other extreme)? 

Thus, if perfect initial homogeneity turns out to be incompatible with the origin What if the universe began 
of galaxies, it is attractive to try “perfect initial chaos”—i.e., completely random chaotic? 
initial conditions, with a full spectrum of fluctuations in density, entropy, and local 
expansion rate [Misner (1968, 1969b)]. It is conceivable, but far from proved, that 
during its subsequent evolution such a model universe will quickly smooth itself 
out by natural processes (Chapter 30) such as “Mixmaster oscillations,” neutrino- 
induced viscosity [see, e.g., Matzner and Misner (1971)], and gravitational curva¬ 
ture-induced creation of particle pairs [ZePdovich (1972)]. Will one be left, after 
a few seconds or less, with a nearly homogeneous and isotropic, Friedmann universe, 
containing just enough remaining perturbations to condense eventually into galaxies? 

Theoretical calculations have not yet been carried far enough to give a clear answer. 

Of course, after the initial chaos subsides, if it subsides, such a model universe will 
evolve in accord with the standard big-bang model of the last section. 


§28.3. WHAT "PRECEDED" THE INITIAL SINGULARITY? 

No problem of cosmology digs more deeply into the foundations of physics than 
the question of what “preceded” the “initial state” of infinite (or near infinite) density, 
pressure, and temperature. And, unfortunately, no problem is farther from solution 
in 1973. 
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The initial singularity and 
quantum gravitational effects 


Cosmologies that violate 
general relativity 


General relativity predicts, inexorably, that even if the “initial state” was chaotic 
rather than smooth, it must have involved a spacetime “singularity” of some sort 
[see Hawking and Ellis (1968); also §34.6 of this book]. And general relativity is 
incapable of projecting backward through the singularity to say what “preceded” 
it. Perhaps only by coming to grip with quantum gravitational effects (marriage of 
quantum theory with classical geometrodynamics) will one ever reach a clear under¬ 
standing of the initial state and of what, if anything, “preceded” it [see Misner 
(1969c), Wheeler (1971c)]. For further discussion of these deep issues, see §§34.6, 
43.4, the final section of Box 30.1, and Chapter 44. 


§28.4. OTHER COSMOLOGICAL THEORIES 

This book confines attention to the cosmology of general relativity. If one were to 
abandon general relativity, one would have a much wider set of possibilities, includ¬ 
ing (1) the steady-state theory [Hoyle (1948); Bondi and Gold (1948)], which has 
not succeeded in accounting for the cosmic microwave radiation or in explaining 
observed evolutionary effects in radio sources and quasars [Box 28.1]; (2) the Klein- 
Alfven “hierarchic cosmology” of matter in an asymptotically flat spacetime [Alfven 
and Klein (1962), Alfvdn (1971), Klein (1971), Moritz (1969), de Vaucouleurs (1971)], 
which disagrees with cosmic-ray and gamma-ray observations [Steigman (1971)]; 
and the Brans-Dicke cosmologies [Dicke (1968), Greenstein (1968a,b), Morganstern 
(1973)], which are qualitatively the same and quantitatively almost the same as the 
standard hot big-bang model. However, no motivation or justification is evident for 
abandoning general relativity. The experimental basis of general relativity has been 
strengthened substantially in the past decade (Chapters 38-40); and the standard 
big-bang model of the universe predicted by general relativity accords remarkably 
well with observations—far better than any other model ever proposed! 



CHAPTER 


■\ 


PRESENT STATE AND FUTURE 
EVOLUTION OF THE UNIVERSE 

--- J 


§29.1. PARAMETERS THAT DETERMINE THE FATE 
OF THE UNIVERSE 

Will the universe continue to expand forever; or will it slow to a halt, reverse into 
contraction, and implode back to a state of infinite (or near infinite) density, pressure, 
temperature, and curvature? The answer is not yet known for certain. To discover 
the answer is one of the central problems of cosmology today. 

The only known way to discover the answer is to measure, observationally, the 
present state of the universe; and then to calculate the future evolution using 
Einstein’s field equations. The field equations have already been solved in §§27.10 
and 27.11. From those solutions one reads off the following correlation between the 
present state of the universe and its future. 

If A = 0 [in accord with Einstein’s firmly held principle of simplicity]: 

Expansion forever -*=>- negative or zero spatial curvature for hypersurfaces of 
homogeneity, i.e., k/a 0 2 < 0 (“open” or “flat”); 

Recontraction c » positive spatial curvature for homogeneous hypersurfaces, 
i.e„ k/a 2 > 0 (“closed”). 

If A 5 * 0: 

Expansion forever -«=>- A > A = f ® if k < 0, 

- crlt l (4wp m 0 a 0 3 ) -2 if A: > 0; 

Recontraction -<=*- A < yl crU . 

Evidently three parameters are required to predict the future: the cosmological 
constant, A; the curvature parameter today for the hypersurface of homogeneity, 
k/a 0 2 ; and the density of matter today, p^. (To extrapolate into the past, as was 
done in the last chapter, one needs, besides these quantities, the radiation density 
today, p ro . But p ro is so small now and is getting smaller so fast (p r oc a -4 ; p m oc a -3) 
that it can have no influence on the decision between the possibilities just listed. 


- 

This chapter is entirely 
Track 2. Chapter 27 (idealized 
cosmological models) is needed 
as preparation for it, but this 
chapter is not needed as 
preparation for any later 
chapter. 

- / 


Expansion forever vs. 
recontraction of universe 


Parameters required to 
predict future of universe: 

(1) "relativity parameters" 
A, k/a n 2 , p^ 

' ‘O' rmo 
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The task of predicting the future, then, reduces to the task of measuring the “rela¬ 
tivity parameters” A n k/a 0 2 , and p m0 . 

In tackling this task, observational cosmologists prefer to replace the three “rela¬ 
tivity parameters,” which have immediate significance for relativity theory, by 
(2) "observational parameters that are more directly observable. One parameter close to the observa- 

parameters H 0 . q Q , o 0 tions is the Hubble expansion rate today , i.e., the “ Hubble constant” 

H 0 = ^ t /a) 0 . (29.1a) 

Another is the dimensionless “ deceleration parameter” today, q 0 , defined by 


q<> = 


\tt 


a H‘< 



And a third is the dimensionless “ density parameter” today, 


— 4ff Pmo 

o - 3H 2 ■ 


(29.1b) 


(29.1c) 


(3) relationship between 
relativity parameters and 
observational parameters 


The relationships between these three “observational parameters” and the three 
“relativity parameters” A , k/a 2 , and p^ (together making six “cosmological param¬ 
eters”) can be calculated by combining definitions (29.1) with the Einstein field 
equations (27.39), which, evaluated today, say 


tj 2 A: , A ,87T 

H 0 ~ - - + — + —Pmo > 

a 0 j j 

- 2 q 0 H* = -H* - + A 

a o 


(29.2) 


By combining these equations, one finds the relationships shown in Box 29.1, where 
the implications of several values of o 0 and q 0 are also shown. 


EXERCISE Exercise 29.1. IMPLICATIONS OF PARAMETER VALUES 

Derive the results quoted in Box 29.1. 


§29.2. COSMOLOGICAL REDSHIFT 

One of the key pieces of observational data used in measurements of H 0 , q 0 , and 
Observed features of o 0 is the cosmological redshift: spectral lines emitted by galaxies far from Earth and 

cosmological redshift received at Earth are found to be shifted in wavelength toward the red. For example, 

the [O II]A3727 line, when both emitted and observed in an Earth-bound laboratory, 
has a wavelength of 3727 A. However, when it is emitted by a star in the galaxy 
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Box 29.1 OBSERVATIONAL PARAMETERS COMPARED TO RELATIVITY PARAMETERS 




A. Relativity Parameters 

1. Matter density today, 

Pmo 

2 . Curvature of hypersurface of homogeneity 
today, 

k/a 0 2 

3. Cosmological constant, 

A 

4. Radiation density today, p ro (unimportant 
for the present dynamics of the universe, 
and therefore ignored in this chapter) 


B. Observational Parameters 


1. Hubble constant (Hubble expansion rate 
today), 

H 0 = (a t /a) 0 

2. Deceleration parameter, 


3. Density parameter, 

_ 4ff Pmo 
3 H} 


C. Observational Parameters in Terms 
of Relativity Parameters 


H 0 2 = ( 8 w /3)P m0 - k/a g 2 + A/3, 

(1) 

(4w/3)p mo - A/3 

0 (8w/3)p mo - k/a 2 + A/3 ' 

( 2 ) 

(4w/3 ) Pmo 

0 (8^/3)p mo - k/a 2 + A/3 ' 

(3) 

Relativity Parameters in Terms 
of Observational Parameters 


Pmo = (3/4 v)H 0 2 o 0 , 

(4) 

k/a 2 = H 2 (3a 0 - q 0 - 1), 

(5) 

A = 3 H 2 (a 0 - q 0 ). 

( 6 ) 


E. Implications of Specific Parameter Values 


1 . A = 0 (in accord with Einstein’s point of view) if and only if o 0 = q 0 . 

2. Sign of A is same as sign of o 0 — q 0 . 
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Box 29.1 ( continued ) 


3. If A = 0 


(a) q B > 


(b) q 0 = 


=1 h2 —.On / positive curvature; \ 

crit 8w 0 \“closed” universe / 


r'mo ' ^crit — g^ “o ,v w 

universe will eventually recontract; 


_ ___ 3 TJ 2 

Pmo Pcrit = g^ 


k = 0 


'zero curvature; 
v‘flat” universe 


universe will expand forever; 


^ 1 „ ^ 3 rT o /negative curvature; \ 

(c) Cf K — -<=>- Pmo P or ;* = ~—.// 0 /c ^ 0 I ,, ) 

7 ™ 2 1 8?r \ open universe / 


universe will expand forever. 


4. If yl ^ 0 


(a) a 0 > |( 9o + 1 ) k > 0 

and in this case, 


( positive curvature;\ 
“closed” universe / 


1 / g + 1 \ 3 

o 0 ~ un i verse will expand forever, 

U 0 \ 3 / 

1 / <7 + 1 \3 

a 0 — < —-^a o — —j universe will eventually recontract; 


(b) a 0 = |( ?0 + 1) k = 0 
and in this case, 


( zero curvature;\ 
“flat” universe / 


o 0 > q o -«=>- universe will expand forever, 
a 0 < q o -<=>- universe will eventually recontract; 


(C) a 0 < ±(q 0 + l)^k<0 


and in this case. 


/negative curvature; \ 
\“open” universe / 


a o > q o -<=>- universe will expand forever, 
o 0 < q 0 -<=►* universe will eventually recontract. 
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3C 295 (presumably with the same wavelength, A em = 3727 A) and received at Earth, 
it is measured here to have the wavelength A rec = 5447 A. The fractional change 
in wavelength is 

z = (A rec - A em )/A em = 0.4614 for 3C 295. (29.3) 



The cosmological redshift is observed to affect all spectral lines alike, and not 
only lines in the visible spectrum. Thus, the 21-cm line of hydrogen, with 400,000 
times the wavelength of the central region of the visible, undergoes a redshift that 
agrees (within the errors of the measurements) with the redshifts of lines in the visible 
for recession velocities of the order of v ~~ 0.005, according to observation of thirty 
objects by Dieter, Epstein, Lilley, and Roberts (1962) and further observations by 
Roberts (1965). 

No one has ever put forward a satisfactory explanation for the cosmological 
redshift other than the expansion of the universe (see below). The idea has been Why redshift cannot be due 

proposed at various times by various authors that some new process is at work (“tired t0 ” tired light" 

light”) in which photons interact with atoms or electrons on their way from source 
to receptor, and thereby lose bits and pieces of their energy. Ya. B. Zel’dovich (1963) 
gives a penetrating analysis of the difficulties with any such ideas: 


(1) “If the energy loss is caused by an interaction with the intergalactic matter, it is 
accompanied by a transfer of momentum; that is, there is a change of the direction 
of motion of the photon. There would then be a smearing out of images; a distant 
star would be seen as a disc, not a point, and that is not what is observed.” (2) “Let 
us suppose that the photon decays, y — y' + k , giving up a small part of its energy 
to some particle, k. It follows from the conservation laws that k must move in the 
direction of the photon (this, by the way, avoids a smearing out), and must have zero 
rest mass. Because of the statistical nature of the process, however, some photons would 
lose more energy than others, and there would be a spectral broadening of the lines, 
which is also not observed.” 


(3) If there does exist any such decay process, then simple arguments of special 
relativity that Zel’dovich attributes to M. P. Bronshtein, and spells out in detail, 
demand the relationship 

( probability per 
second of 
“photon decay” 


( a universal constant with\ 

__ the dimensions sec -2 / 

(frequency of photon in sec -1 ) 


“Thus,” Zel’dovich concludes, “if the decay of photons is possible at all, those in radio 
waves must decay especially rapidly! This would mean that the Maxwell equation for 
a static electric field would have to be changed.... There is no experimental indication 
of such effects: the radio-frequency radiation from distant sources is transmitted to 
us not a bit more poorly than visible light, and the red shift measured in different 
parts of the spectrum is exactly the same .... Thus, suggestions that there is an 
explanation of the red shift other than Friedmann’s fail completely.” 
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Emission: 

atom excites n -node standing wave; 
universe small, a(i e ) = a em : 
wavelengths small. X(/ e ) = A em . 


Reception: 

universe larger, a{t T ) = a rec : 
wavelengths larger, A(/ r ) = A rec ; 
number of nodes in standing 
wave unchanged; 

n = constant — —■- ^ r — = 


Figure 29.1. 

Redshift as an effect of standing waves. The ratio of wavelengths, A rec /A em , is identical with the ratio 
of dimensions, tf rec /tf em in any closed spherically symmetrical (Friedmann) model universe. The atom 
excites an n-node standing wave in the universe. The number n stays constant during the expansion. 
Therefore wavelengths increase in the same proportion as the dimensions of the universe. One sees 
immediately in this way that the redshift is independent of all such details as (1) why the expansion 
came about (spherical symmetry, but arbitrary equation of state); (2) the rate—uniform or nonuni¬ 
form—at which it came about; and (3) the distance between source and receptor at emission, at reception, 
or at any time in-between. The reasoning in the diagram appears to depend on the closure of the universe 
(standing waves; k = +1 rather than 0 or — 1). That closure is not required for this simple result is 
seen from the further analysis given in the text. 



Derivation of redshift 
formula: 


A cc 


( expansion\ 
factor / 


Not the least among the considerations that lead one to accept the general recession 
of the galaxies as the explanation for the redshift is the circumstance that this general 
recession was predicted [Friedmann (1922)] before the redshift was observed [Hubble 
(1929)]. 

The cosmological redshift is easily understood (Figure 29.1) in terms of the 
standard big-bang model for the universe. A detailed analysis focuses attention on 
three processes: emission of the light, propagation of the light through curved 
spacetime from emitter to receiver, and reception of the light. Emission and reception 
\ occur in the proper reference frames (orthonormal tetrads) of the emitter and 
receiver; they are special-relativistic phenomena. Propagation, by contrast, is a 
general-relativistic process; it is governed by the law of geodesic motion in curved 
spacetime. 

In calculating all three processes—emission, propagation, and absorption—one 
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needs a coordinate system. Use the coordinates ( t, x , 9, <#>) or (tj, x , 9, <j>) introduced 
in Chapter 27; and orient the space coordinates in such a way that the paths of 
the light rays through the coordinate system are simple. This is best done by putting 
the origin of the coordinate system ( x = 0) at the Earth. Then the emitting galaxy 
will lie at some “radius” Xe and some angular position (9 e , <f> e ). The cosmological 
line element 


ds 2 = -dt 2 + a 2 (t)[d x 2 + 2 2 {d9 2 + sin 2 9 d<t> 2 )] 

= a 2 (-q)[ — d-q 2 + d x 2 + S 2 {d9 2 + sin 2 9 d<f> 2 )], (29.4a) 

( sin X if k = + 1, 

X if k = 0, (29.4b) 

sinh x if k = — 1, 

is spherically symmetric about x = 0 (i.e., about the Earth) whether k = -1, 0, or 
+1. Consequently, the geodesics (photon world lines) that pass through both Earth 
and the emitting galaxy must all be radial 


9 = 0* <t> = ^ X = X(0- (29.5) 

(One who wishes to forego any appeal to symmetry can examine the geodesic 
equation in the (t, x ,9,<t>) coordinate system, and discover that if d9/d\ = 
d4>/d\ — 0, then d 2 9/d\ 2 = d 2 <}>/d\ 2 = 0. Consequently a geodesic that is initially 
radial will always remain radial.) 

Consider, now, emission. A galaxy at rest (moving with the “cosmological fluid”) 
at (X e » 9 e , <p e ) emits two successive crests, A and B, of a wave train toward Earth 
at coordinate times t Ae and t Be . It has been arranged that proper time as measured 
on the galaxy is the same as coordinate time (t — t + const, was part of the con¬ 
struction process for the coordinate system in §27.4). Consequently the period of 
the radiation as seen by the emitter is P em = t eB — t eA ; and the wavelength is the 
same as the period when geometrized units are used: 

A em = teB ~ ‘eA- (29.6) 

Next examine propagation. Wave crests A and B propagate along null geodesics. 
This fact enables one to read the world lines of the wave crests, Xx(0 ar >d Xb( 0> 
directly from the line element (29.4): ds 2 = 0 guarantees that a(t) d x = —dt(—, not 

+, because the light propagates toward the Earth at x = 0). Consequently, the world 
lines are 

Xe ~ Xa( { or 7)) = 7) - i) eA = f a- 1 dt, (29.7) 

J ‘,a 

Xe - Xisit or T}) = 1) - i) eB = f ar l dt. 

Finally, examine reception. The receiver on Earth moves with the “cosmological 
fluid,” just as does the distant emitter. (Ignore the Earth’s “peculiar motion” relative 
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to the fluid—motion around the sun, motion around center of our Galaxy, etc.; 
it can be taken into account by an ordinary Doppler correction.) Thus, for receiver 
as for emitter, proper time is the same as coordinate time, and 

*rec = - 'M. ( 29 - 8 ) 


where t rB and t rA are the times of reception of the successive wave crests. 

Now combine equations (29.6), (29.7), and (29.8) to obtain the redshift. The 
receiver is at x = 0- Therefore equations (29.7) say 


o = Xe~ f a x dt, 

J <,A 

a - 1 dt. 

Subtract these equations from each other to obtain 



n *rB 

C * tA 

O 

II 

1 

£ 

1 

j a~ x dt 

t'B 

J t eA 

r *rB 

c * eB 

II 

C5 

1 

1 

f a- 1 dt 

t TA 

Ka 


Kb Ka 

<2(*r) 


Kb Ka . 


(29.9) 


and combine with (29.6) and (29.8) to discover 


Kec _ Km . (29.10) 

a(t r ) a(t 9 )' 


i.e., 

z EE A\/\ = a{t T )/a(t e ) - 1. (29.11) 

These redshift equations confirm the simple result of Figure 29.1: As the light 
ray propagates, its wavelength (as measured by observers moving with the “fluid”) 
increases in direct proportion to the linear expansion of the universe. The ratio of 
the wavelength to the expansion factor ; \/a, remains constant. For important applica¬ 
tions of this result, see Boxes 29.2 and 29.3. 


Exercise 29.2. ALTERNATIVE DERIVATION OF REDSHIFT 

Notice that the only part of the line element that is relevant for the light ray is 

ds 2 = -dt 2 4- a 2 (t ) dx 2 , 

since dd = d<j> = 0 along its world line (spherical symmetry!). Regard the light ray as made 


Box 29.2 COSMOLOGICAL REDSHIFT OF THE PRIMORDIAL RADIATION 


As an important application of the redshift for- in the “fluid,” the temperature of the primordial 
mula radiation is redshifted 

X/a= constant (1) Toe \/a. (2 t 


[equation (29.10)], consider the radiation emerging 
from^^he-hoLbig bang. Because it is initially in 
thermal equilibrium with matter, this primordial 
radiation initially has a Planck black-body spec¬ 
trum. Subsequent interactions with matter cannot 
change the spectrum, because the matter remains 
in thermal equilibrium with the radiation so long 
as interactions are occurring. The cosmological 
redshift can and does change the spectrum, how¬ 
ever. It was shown in exercise 22.17, using kinetic 
theory, that radiation with a Planck spectrum as 
viewed by one observer has a Planck spectrum as 
viewed by all observers; but the observed temper¬ 
ature is redshifted in precisely the same manner 
as the frequency of an individual photon is red¬ 
shifted. Consequently, as seen by observers at rest 


T cc \/a. (2) 

This is true after plasma recombination, when the 

radiation and matter are decoupled, as well as 
before recombination, when they are interacting. 
And it is true not only for the primordial photons 
but also for thermalized neutrinos and gravitons 
emerging from the hot big bang. 

There is another way to derive the redshift 
equation (2). Combine the equation 

Pr * F 4 (3) 

for the energy density of black-body radiation in 
terms of temperature, with the equation 

p T cc (volume) -473 cc (a 3 )” 473 cc a~ 4 (4) 

for the decrease of energy density with adiabatic 
expansion. 


Box 29.3 USE OF REDSHIFT TO CHARACTERIZE DISTANCES AND TIME 


Distance: When discussing objects within the 
Earth’s cluster of galaxies, astronomers typically 
describe distances in units of lightyears or parsecs. 
But when dealing with more distant objects (gal¬ 
axies, quasars, etc.), astronomers find it more con¬ 
venient to describe distance in terms of what is 
actually observed: redshift. For example, the 
statement “the galaxy 3C 295 is at a redshift of 
0.4614” means that “3C 295 is at that distance 
from Earth [given by equation (29.16)] which cor¬ 
responds to a redshift of z — 0.4614.” 


Time: When discussing events that occurred dur¬ 
ing the last few 10 9 years, astronomers usually 
measure time in units of years. Example: “The 
solar system condensed out of interstellar gas 
4.6 X 10 9 years ago” [see Wasserburg and Burnett 
(1968)]. But when dealing with events much nearer 
the beginning of the universe, all of which have 


essentially the same age, of about 12 x 10 9 years, 
astronomers find it more convenient to describe 
time in terms of redshift. Example: “The primor¬ 
dial plasma recombined at a redshift of 1,000” 
means that “If a photon had been emitted at the 
time of plasma recombination, and had propa¬ 
gated freely ever since, it would have experienced 
a total redshift between then and now of 
z = 1,000.” Equivalently, since 1 + z = (tf 0 /a)[see 
equation (29.11)], “the plasma recombined when 
the universe was a factor of 1 + z zz 1,000 smaller 
than it is today.” [Application: In Figure 28.1, 
where the past evolution of the universe is sum¬ 
marized, one can freely replace the horizontal 
scale a/a 0 by 1/(1 + z), and thereby see that pri¬ 
mordial element formation occurred at a redshift 
of z ^ 10 9 .) The conversion from redshift units to 
time units is strongly dependent on the parameters 
Pm* Pm> and k /°o 2 f see §§27.10 and 27.11; also 
equation (29.15)]. 



780 


29. PRESENT STATE AND FUTURE EVOLUTION OF THE UNIVERSE 


Derivation 

relation 


of photons with 4-momenta p. From the geodesic equation (or, for the reader who has studied 
chapter 25, from arguments about Killing vectors), show that 


/> x = p-O/0x) 


is conserved along the photon’s world line. Use this fact, the fact that a photon’s 4-momentum 
is null, p * p =0, and the equation E — —p-u for the energy measured by an observer 
with 4-velocity u , to derive the redshift equation (29.11). _ —— 

Exercise 29.3. REDSHIFT OF PARTICLE DE BROGLIE WAVELENGTHS 

A particle of finite rest mass /i moves along a geodesic world line through the expanding 
cosmological fluid. Let 

P = (P'P) U2 = (X _^ - 2) T72 

be the spatial 4-momentum of the particle as measured by observers at rest in the fluid. 
(The ordinary velocity they measure in their proper reference frames is u.) The associated 
“de Broglie wavelength” of the particle is A = h/p. 

(a) Show that this de Broglie wavelength is redshifted in precisely the same manner as 
a photon wavelength: 

A /a — constant. 

(b) Employing this result, show that, for the molecules of an ideal gas that fills the universe, 
their mean kinetic energy decreases in inverse proportion to a 2 when the gas is nonrelativistic 
and (like photon energies) in inverse proportion to a when the gas is highly relativistic. 


§29.3. THE DISTANCE-REDS HI FT RELATION; 

MEASUREMENT OF THE HUBBLE CONSTANT 

Equation (29.11) expresses the redshift in terms of the change in expansion factor 
between the event of emission and the event of reception. For “nearby” emitters 
(emitters at distances much less than \/H 0 > the “Hubble length”) it is more conven¬ 
ient to express the redshift in terms of the distance between the emitter and Earth. 
That distance (“present distance”) is defined on the hypersurface of homogeneity 
that passes through Earth today, since that hypersurface agrees locally with the 
surface of simultaneity of the receiver today, and it is also, locally, a surface of 
simultaneity for any observer moving today with the “cosmological fluid.” 
distance-redshift The distance between emitter and observer today [the distance along the spatial 
geodesic of constant (t, 6 , <f>) connecting (t r 0, 9 e , <j> e ) and ( t„ x e > <#><.)] can be read 

directly from the line element (29.4): 

l = a(t r )(Xe - Xr) = «(Ox C ' (29.12) 

Using expression (29.9) for x e > one finds 

r *r 

i = a(t r ) I a 1 dt. 


(29.12') 
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a(t) = a(t r ) + (a t ) tr (t - t r ) + ~ ( a n ) tr (t - t r ) 2 + ■■■ 

1 (29.13) 

= + H 0 {t - t T ) - \q 0 H 0 \t - t r f 

where definitions (29.1) for the Hubble constant H 0 and the deceleration parameter 
q 0 have been used. Putting this expression into equation (29.12') and integrating, 
one finds for the distance the expression 

l = Or ~ O + \H 0 {t r - t e f + ... 


or, equivalently, 


t r -t e = l-±H 0 l*+ .... (29.14) 

The redshift [equation (29.11)] can be expressed as a power series in t r —t e by using 
equation (29.13): 

. = gft) - g(Q = Wr - te) ± koHo 2 (tr ~ *e) 2 ± - J 

a (‘e) a{t r )[\ - H 0 (t T -t e ) + ■■■] J 

= H 0 (t r - t e ) + H 2 {\ + if 0 )(/ r - t e f +■... 

Combining this with equation (29.14) for t T — t e in terms of X, one finally obtains 

z — H 0 l + 1(1 + q 0 ){H 0 lf + 0([H 0 Lf). (29.16) 

This is the “ distance-redshift relation ” for the standard big-bang model of the uni¬ 
verse. 

By comparing this distance-redshift relation with astronomical observations (see 
Box 29.4, which is best read after the next section), Allan Sandage (1972a) obtains 
a Hubble constant of 


H 0 — 55 zt 1 km sec -1 Mpc -1 ; 

(29.17) 

H~ x = (18 ± 2) x 10 9 years. 

(29.18) 


(Note: 1 Mpc = one Megaparsec is 3.26 x 10 6 light years, or 3.08 x 10 24 cm.) The 
uncertainty of ±7 km sec -1 Mpc' 1 quoted here is the “one-sigma” statistical uncer¬ 
tainty associated with the distance-redshift data. Systematic errors, not now under¬ 
stood, might be somewhat larger; but the true value of H 0 almost certainly is within 
a factor 2 of Sandage’s value, 55 km sec” 1 Mpc -1 . 


Result for distance-redshift 
relation 


Measurement of Hubble 
constant H 0 
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Value of critical density 


Apparent magnitude defined 


Derivation of 

magnitude-redshift relation 


Note that, if A = 0, then the “critical density” marking the dividing line between 
a “closed” universe and an “open” universe—i.e., between eventual recontraction 
and expansion forever—is 

p crit = ^H 0 2 = 5 X 10-30 g/cm 3. (29.19) 

(As described in Box 29.1, p > p crit “closed” recontraction; p < p crit 
“open” expansion forever.) Comparison with the actual density will be de¬ 
layed until §29.6. 

The distance measurements are not accurate enough to yield useful information 
about the deceleration parameter, q 0 . 


§29.4. THE MAGNITUDE-REDSHIFT RELATION; 

MEASUREMENT OF THE DECELERATION PARAMETER 

Information about q 0 is best obtained by comparing the apparent magnitudes of 
galaxies with their redshifts. 

In astronomy one defines the apparent (bolometric) magnitude, m , of an object 
by the formula 


m = —2.5 log 10 (S/2.52 x 10 5 erg cm 2 sec -1 ) 
= —2.5 log 10 S + constant, 


(29.20) 


where S is the flux of energy (energy per unit time per unit area) that arrives at 
Earth from the object. [Of course, one cannot measure the flux over the entire 
wavelength range 0 < X < oo; so one distinguishes various apparent magnitudes 
(ra a , m B , m v ,...) corresponding to fluxes in various wavelength ranges (“U”=E 
“ultraviolet”; “B” ~ “blue”; “V” = “visual”). However, these subtleties are too far 
from gravitation physics to be treated here.] 

Calculate the apparent magnitude for a galaxy of intrinsic luminosity L and 
redshift z. To simplify the calculation, put the emitter at the origin of the space 
coordinates (x* = 0); and put the Earth at (x r , <£ r ). (Note the reversal of locations 
compared to redshift calculation of §29.2.) On Earth, place a photographic plate 
of area A perpendicular to the incoming light. Then at time t T the plate is a tiny 
segment of a spherical two-dimensional surface (/ = r r , x = x r ; ® an ^ <P vary) about 
the emitting galaxy. The total area of the 2-sphere surrounding the galaxy is 


A = Ma(t r )2(xr)] 2 - 


(29.21) 


Therefore, the ratio of the area of the plate to the area of the 2-sphere is given 
by 


A = A 
a (x r )F 


(29.22) 


The plate catches a fraction A/A of the energy that pours out through the 2-sphere. 
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If there were no redshift, the power crossing the entire 2-sphere at time t r would 
be precisely the luminosity of the emitter at time t e . However, the redshift modifies 
this result in two ways. (1) The energy of each photon that crosses the 2-sphere is 
smaller, as measured in the local Lorentz frame of the fluid there, than it was as 
measured by the emitter: 

E r JE, m = A em /A rec = 1/(1 + z). (29.23) 

(2) Two photons with the same 8 and <p , which are separated by a time At r as 
measured by an observer stationary with respect to the “cosmological fluid” at the 
2-sphere, were separated by a shorter time At e as measured by the emitter: 

At r /At e = X r /X e = 1 + z. (29.24) 

The luminosity, L , as measured at the source, is the sum of the energies E emJ of 
the individual photons (labeled with the index J) emitted in a time interval At e , 
divided by At e : 

L = {\/At e )Y J E, mJ . (29.25) 

J 

The power that crosses the 2-sphere a time t r — t e later, as measured by the fluid 
at the 2-sphere, is 

P = (1/210 £ E Tec j , (29.26) 

J 

where the summation runs over the same set of photons. 

Combining equations (29.23) to (29.26), one sees that the power crossing the 
2-sphere is 


P = L/( 1 + z) 2 . 


Of this, a fraction, 


?A A P 4tf[(l + z)a(0^(x r )] 2 ’ 

crosses the photographic plate; so the flux measured at the Earth is 



L 

4 tt /? 2 (1 + zf 9 


(29.27) 


where R is the “radius of curvature” of the 2-sphere surrounding the emitter and 
passing through the receiver at the time of reception, 

' a 0 sinh (x r - x e ) 

a JXr - Xe\ if k = 0, 

. a„ sin (x r - X«) if * = +1 


R = a 0 I(x r - Xe) = 


(29.28) 
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[recall: x e is 0 according to the present conventions, and a 0 = a(t r )}. The correspond¬ 
ing apparent magnitude [equation (29.20)] is 


m = + 5 log 10 [(l + z)/?] — 2.5 log 10 L + constant. (29.29) 


In order to relate the apparent magnitude to the redshift of the emitter, one must 
express the quantity R in terms ofz. From equation (29.7) for the photon propagation 
(with sign reversed because positions of receiver and emitter have been reversed), 
one knows that 


r tr r a -jr dt T r a(t )' 

Xr -x e =jr ^ 


and from equation (29.11) one knows that 

z = a(t r )/a(t e ) - 1. 


Hence 


_ r 1+ *r a 

* e v, a(t r ). .da. . a _ 


(29.31) 


Equations (4) to (6) of Box 29.1, and (27.40), determine the function dt/da in terms 
of a/a(t T ) and the constants H 0 , q 0 , a 0 . By inserting that result into equation (29.31) 
and integrating, one obtains x r — x e * n terms of the redshift z and the cosmological 
parameters H 0 , q oy a 0 : 


Xr X e — 11 + <? 0 3a o |1/2 j^ [2a 0 «3 + (1 


du 


+ q 0 - 3 <0« 2 + - q„] 1/2 


(29.32a) 


The 2-sphere radius of curvature R is obtained by inserting this expression into the 
equation x 

+ *<*-*) <2, ' 32b ’ 

[equation (29.28), with a 0 evaluated by equation (5) of Box 29.1]. 

Equations (29.29) and (29.32) determine the apparent magnitude, m, in terms of 
redshift, z. 

For the case of vanishing cosmological constant (o 0 — q 0 ; A — 0), the integral 
(29.32a) can be expressed in terms of elementary functions, yielding 


R = 


H: 


-7„ 2 (1 + z) 


[~q 0 + 1 + q 0 z + (q 0 - 1X2 q 0 z + 1) 1/2 ], 


+ q 0 )z + 0(z 2 )], 


(29.33) 


so that 


Result for magnitude-redshift 
relation 


m = 5 log 10 [1 - q 0 + q 0 z + (q 0 - l)(2q 0 z + 1) 1/2 ] - 2.5 log 10 L + const. 

~ 3 l°gio z + 1-086(1 — q 0 )z + 0(z 2 ) — 2.5 log 10 L + const. (29.34) 

for z < 1. 
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(Note: the factor 1.086 is actually 2.5/ln 10.) A power-series solution for nonzero 
A- (for ° 0 5* q 0 ) reveals a dependence on a 0 only at 0(z 2 ) and higher: 

R ~ #<T lz [l - yO + q 0 ) z + (corrections of 0(z 2 ) depending on a 0 and q 0 ) j, 

(29.35 a) 

m ~ 5 log 10 z + 1.086(1 - q 0 )z + 0(z 2 ) - 2.5 log 10 L + const. (29.35b) 

Sheldon (1971) gives the exact solution for A ^ 0 in terms of the Weierstrass elliptic 
function. Refsdal et al. (1967) tabulate and plot the exact solution. 

By comparing the theoretical magnitude-redshift relation (29.35b) with observa- Measurement of deceleration 
tions of the brightest galaxies in 82 clusters, Allan Sandage (1972a,c,d) obtains the Parameter, <7„ 
following value for the deceleration parameter: 

q 0 = 1-0 ±0.5, if o 0 = < 7 o(i e. A = 0). (29.36) 

(Note: 0.5 is the “one-sigma” uncertainty. Sandage estimates with 68 per cent con¬ 
fidence that 0.5 < q o < 1.5, and with 95 per cent confidence that 0 < q 0 < 2— 
providing unknown evolutionary effects are negligible.) The observations leading 
to this result and the uncertainties due to evolutionary effects are described in Box 
29.4. Box 29.5 gives a glimpse of Edwin Hubble, the man who laid the foundations 
for such cosmological measurements. 

(continued on page 794) 
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Box 29.4 MEASUREMENT OF HUBBLE CONSTANT AND 
DECELERATION PARAMETER 

I. Hubble Constant , H 0 

A. Objective: To measure the constant H 0 by comparing observational data with 
the distance-redshift relation 

Z = HJ + 1(1 + q 0 )(H 0 l) 2 + 0{[H 0 lf). 

Here i is distance from Earth to source today; and z is redshift of source 
as measured at Earth. 

B. Key Difficulty: This distance-redshift relation does not apply to stars in our 
Galaxy: the Galaxy is gravitationally bound and therefore is impervious to 
the universal expansion. Nor does the distance-redshift relation apply to the 
separations between our Galaxy and nearby galaxies (the “local group”); 
gravitational attraction between our Galaxy and its neighbors is so great it 
perturbs their motions substantially away from universal expansion. Only on 
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Box 29.4 (continued) 


scales large enough to include many galaxies (scales where each galaxy or 
cluster of galaxies can be thought of as a “grain of dust,” with the grains 
distributed roughly homogeneously)—only on such large scales should the 
distance-redshift relation hold with good accuracy. But it is very difficult to 
obtain reliable measurements of the distances i to galaxies that are so far 
away! 

C. Procedure by which H 0 has been measured [Sandage and Tamman , as summar¬ 
ized in Sandage ( 1972a)]: 

1. Cepheid variables are pulsating stars with pulsation periods (as measured 
by oscillations in light output) that are very closely correlated with their 
luminosities L —or, equivalently, with their absolute (bolometric) magni¬ 
tudes, M: 

/apparent magnitude star would have were it at a\ 

\distance of 10 parsecs = 32.6 light years / (1) 

= —2.5 log 10 (L/3.0 x 10 35 erg sec -1 ) 

[see equation (29.20).] By measurements within our Galaxy, astronomers 
have obtained the “period-luminosity relation” for cepheid variables. 

2. Cepheid variables are clearly visible in galaxies as far away as Mpc 
(4 Megaparsecs = 4 X 10 6 parsecs). In each such galaxy one measures the 
periods of the cepheids; one then infers their absolute magnitudes M from 
the period-luminosity relation; one measures their apparent magnitudes 
m\ and one then calculates their distances i from Earth using the relation 

m-M=5 1og 10 (X/10pc). (2) 

By this means one obtains the distances l to all galaxies within —4 Mpc 
of our own. Unfortunately, such galaxies are too close to participate cleanly 
in the universal expansion. (They include only the “local group,” the “M81 
group,” and the “south polar group.”) Thus, one must push the distance 
scale out still farther before attempting to measure H 0 . 

3. Galaxies of types Sc, Sd, Sm, and Ir within —4 Mpc contain huge clouds 
of ionized hydrogen, which shine brightly in “Ha light.” These clouds, 
called “H II regions,” exhibit a very tight correlation between diameter 
D of the H II region and luminosity L of the galaxy (or, equivalently, 
between D and absolute magnitude of galaxy, M). In fact, for a given 
galaxy luminosity L, the fractional spread in H II diameters is o(AD/D ) 
^ 0.12. Using (a) the distances (<4 Mpc) to these galaxies as determined 
via cepheid variables, (b) the apparent magnitudes of the galaxies, and 
(c) the angular diameters of H II regions in the galaxies, one calculates 
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the actual HII diameters D and galaxy luminosities L, and thereby obtains 
the “diameter-luminosity relation” D(L ). 

4. H II regions are large enough to be seen clearly in galaxies as far away 
as —60 Mpc. By measuring the H II angular diameters a = D/l and 
galaxy apparent (bolometric) magnitudes 

^=-^h k { 2 j 52 xW hx^A < 3 » 

and by combining with the diameter-luminosity relation, one obtains the 
distances X to all galaxies of types Sc, Sd, Sm, and Ir which possess H II 
regions and lie within —60 Mpc of Earth. Unfortunately, this is still not 
far enough away for local motions to be negligible compared with the 
universal expansion. 

5. Within —60 Mpc reside enough galaxies of type Sc I for one to discover 
that their luminosities (absolute magnitudes) are rather constant (difference 
in L from one Sc I galaxy to another <50 per cent). Using the distances 
to such Sc I galaxies, as measured via H II regions, and using measurements 
of their apparent magnitudes, one calculates their universal absolute 
magnitude (measured photographically) to be M pg — —21.2. 



The Sc I galaxy M101 at a distance X — 3 Mpc from Earth, 
as photographed with the 200-inch telescope. (Courtesy of 
Hale Observatories) 



z (,l Soq 
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6. One then examines all known Sc I galaxies with distances greater than 
—70 Mpc. For each, one measures the apparent magnitude and compares 
it with the universal absolute magnitude to obtain the distance l from 
Earth. And for each, one measures the redshift z = ^4A/X of the spectral 
lines. From the resulting redshift-distance relation—and taking into ac¬ 
count the statistical uncertainties in all steps leading up to it—Sandage 
and Tamman (work carried out in 1965-1972) obtain the value 
H 0 — dz/di = 55 ± 7 (km/sec) Mpc -1 = 1/[(18 ± 2) X 10 9 years]. [For 
a review see Sandage (1972a).] The quoted error is purely statistical. Sys¬ 
tematic errors are surely larger—but they almost surely do not exceed a 
factor 2 [i.e., 30 < H 0 < 110 (km/sec) Mpc' 1 ]. 



Magnitude-redshift relation for Sc I galaxies at distances >70 Mpc. Solid line is 
a least-squares fit to the data; dotted line has the theoretical slope of 5. [From 
Sandage and Tamman.] 


II. Deceleration Parameter , q 0 . 

A. Objective: To measure the constant q 0 by comparing observational data with 
the magnitude-redshift relation: 


m = 5 log 10 z + 1.086(1 - q o )z + 0(z 2 ) - 2.5 log 10 L + const. (4) 
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[Note: This relation is valid even if the cosmological constant is nonzero, i.e., 
even if o 0 jz q 0 . Dependence on o 0 occurs only at 0(z 2 ) and higher.] 


B. Key Difficulty: One must use data for objects with the same absolute lumin¬ 
osity L (“standard candles”). But one cannot measure L at distances great 
enough for the effects of q 0 to show up. 

C. The Search for a Standard Candle: One obvious choice for the standard candle 
would be Sc I galaxies, since they were found to all have nearly the same 
L (see above). But they are not bright enough to be seen at distances great 
enough for effects of q 0 to show up. An alternative choice, quasars, are bright 
enough to be seen at very large redshifts (z as large as —3). But their absolute 
luminosities have enormous scatter—or so one infers from the failure of 
quasars to fall on a straight line, even at small z, in the magnitude-redshift 
diagram. The best choice is the brightest type of object that has small scatter 
in L . Sandage (1972a,b,c) chooses the brightest galaxy in “recognized regular 
clusters of galaxies.” Such clusters are composed predominately of E-type 
galaxies, and the brightest members are remarkably similar from one cluster 



The E-type galaxy M87 at a distance i — 11 Mpc from Earth, 
as photographed with the 200-inch telescope. (Courtesy of Hale 
Observatories) 


to another (scatter in L is —25 per cent). The similarity shows up in their 
spectra and in the very precise straight lines they give when one plots angular 
diameter versus redshift (next page), or apparent magnitude versus redshift 
(next page), or angular diameter versus apparent magnitude. 
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Angular diameter versus redshift 
for brightest galaxy in recognized 
regular clusters. From Sandage 
(1972a,b). [These data are not suffi¬ 
ciently precise to yield useful in¬ 
formation about q 0 and a Q \ but 
improvements in 1973 may bring 
the needed precision; see §29.5.] 


Log 10 0 (sec) 



Magnitude versus redshift for brightest 
galaxy in recognized regular clusters. 
V 26 - K v - A v is the apparent magni¬ 
tude with certain corrections taken into 
account. The line plotted corresponds to 
o 0 — q 0 - 1 (straight line of slope 5). 
From Sandage (1972a). 


^26 “ Kv - A v 
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D. Procedure by which q 0 has been measured [Sandage ( 1972a,c )]: 

1. Data on magnitude versus redshift have been gathered for the brightest 
galaxy in 82 recognized regular clusters (see above). 

2. The data, when fitted with a straight line, show a slope of 

dm/dlog 10 z — 5.150 ±: 0268 (rms), (5) 

by comparison with a theoretical slope of 5. 

3. The data, when fitted to the theoretical relation 


m = 5 log 10 z + 1.086(1 — q 0 )z + 0(z 2 ) 4- const, (6) 

[with the correct 0(z 2 ) and higher terms included; see equations (29.29), 
(29.32), and (29.34)] yield 


% = 1 ± °- 5 (one-sigma)] 

= 1±1 (two-sigma) J lf<7 °-^- 0) - 


(7) 


The data are inadequate to determine o 0 and q 0 simultaneously. [The 0(z 2 ) 
terms, which depend on o 0 , play a significant role in the fit to the data. 
For a graphical depiction of their theoretical effects see Figure 2 of Refsdal 
et. al (1967).] 

E. Evolutionary uncertainties 

1. Sandage’s fit of data to theory assumes that the luminosities of his “stand¬ 
ard candles” are constant in time. If, because of evolution of old stars 
and formation of new ones, his galaxies were to dim by 0.09 magnitudes 
per 10 9 years, then galaxies 10 9 light-years away, which one sees as they 
were 10 9 years ago, would be 0.09 magnitudes brighter intrinsically than 
identical nearby galaxies. Correction for this effect would lower the most 
probable value of q 0 from 1 to 0 [Sandage (1972c)]. 

2. Knowledge of the evolution of galaxies in 1972 is too rudimentary to 
confirm or rule out such an effect. [See references cited by Sandage 
(1972c).] 




J 
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Box 29.5 EDWIN POWELL HUBBLE 

November 20, 1889, Marshfield Missouri — 
September 28, 1953, Pasadena, California 



Edwin Hubble, at age 24, earned a law degree 
from Oxford University and began practicing law 
in Louisville, Kentucky, After a year of practice 
he became fed up and, in his own words, “chucked 
the law for astronomy, and I knew that even if 
I were second-rate or third-rate it was astronomy 
that mattered.” He chose the University of Chi¬ 
cago and Yerkes Observatory as the site for his 


astronomy education, and three years later (1917) 
completed a Ph.D. thesis on “Photographic Inves¬ 
tigations of Faint Nebulae.” 

When Hubble entered astronomy, it was sus¬ 
pected that some nebulae lie outside the Galaxy, 
but the evidence was exceedingly weak. During 
the subsequent two decades, Hubble, more than 
anyone else, was responsible for opening to man’s 
purview the extragalactic universe. Working with 
the 60-inch and 100-inch telescopes at Mount Wil¬ 
son, Hubble developed irrefutable evidence of the 
extragalactic nature of spiral nebulae, elliptical 
nebulae, and irregular nebulae (now called gal¬ 
axies). He devised the classification scheme for 
galaxies which is still in use today. He systematized 
the entire subject of extragalactic research: deter¬ 
mining distance scales, luminosities, star densities, 
and the peculiar motion of our Galaxy; and ob¬ 
taining extensive evidence that the laws of physics 
outside the Galaxy are the same as near Earth (in 
Hubble’s words: “verifying the principle of the 
uniformity of nature”). He discovered and quan¬ 
tified the large-scale homogeneity of the universe. 
And—his greatest triumph of all!—he discovered 
the expansion of the universe. 

The details of Hubble’s pioneering work are best 
sketched in his own words: 


"Extremely little is known of the nature of nebulae; and no classification has yet 
been suggested. . . . The agreement [between the velocity of escape from a spiral 
nebula and that from our galaxy ] is such as to lend some color to the hypothesis 
that the spirals are stellar systems at distances to be measured 

often in millions of light years. " 

(1920; Ph.D. THESIS; PUBLICATION DELAYED 3 YEARS BY WORLD WAR I) 


This box is based largely on the biography of Hubble by Mayall (1970). 
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"The present investigation [using Cepheid variables for the first time as an 
indicator of distances beyond the Magellanic clouds] identifies NGC 6822 as an 


isolated system of stars and nebulae of the same type as the Magellanic clouds, 
although somewhat smaller and much more distant. A consistent structure is thus 
reared on the foundation of the Cepheid criterion, in which the dimensions, 
luminosities, and densities, both of the system [NGC 6822] as a whole and its 
separate members, are of orders of magnitude which are thoroughly familiar. The 
distance is the only quantity of a new order. The principle of the uniformity of 
nature thus seems to rule undisturbed in this remote region of space." 


(1925) 


"Critical tests made with the 100-inch reflector, the highest resolving power 
available, show no difference between the photographic images of the so-called 
condensations in Messier 33 and the images of ordinary galactic stars. . . . The 
period-luminosity relation is conspicuous among the thirty-five Cepheids and 
indicates a distance about 8.1 times that of the Small Magellanic Cloud. Using 
S hap ley's value for the fatter, the distance of the spiral 
is about 263,000 parsecs." 

(1926a) 


"[To the present paper (1926b)] is prefaced a general classification of nebulae 
. . . the various types [of extragalactic nebulae] are homogeneously distributed 
over the sky. . . . The data are now available for deriving a value for the order of 
the density of space. This is accomplished by means of the formulae for the 
numbers of nebulae to a given limiting magnitude and for the distance in terms of 

the magnitude. [The result is] 

_ P = 1.5 X 10~ 31 grams per cubic centimeter. 

This must be considered as a lower limit, for loose material scattered between the 
systems is entirely ignored. The mean density of space can be used to determine 
the dimensions of the finite but boundless universe of general reletivity . . . 

R = — - C - ■ —. r 2.7 x 10 10 parsecs. " 

\4ttW Vp (1926b) 

"The data . . . indicate a linear correlation between distances and velocities [for 
extragalactic nebulae]. Two solutions have been made, one using the 24 nebulae 
individually, the other combining them into 9 groups according the proximity in 
direction and distance. The results are . . . 24 objects: K =: 465 ± 50 km/sec per 
/O 6 parsecs ; 9 groups: K = 513 ± 60 km/sec per 10 6 parsecs. ... The 
outstanding feature, however, is the possibility that the velocity-distance relation 
may represent the de Sitter effect, and hence that numerical data may be 
introduced into discussions of the general curvature of space. " 

(1929)* 


Hubble’s value of K (the “Hubble constant”) was later revised downward by the work 
of Baade and Sandage; see section titled The Hubble Time in Box 27.1. 




Box 29.5 (continued) 


"The velocity-distance relation is re-examined with the aid of 40 new 
velocities. ... The new data extend out to about eighteen times the distance 
available in the first formulation of the velocity-distance relation>, but the form of 
the relation remains unchanged except for [ Shapley's 10 per cent] 

revision of the unit of distance. " 

(1931). WITH M. L. HUMASON 

"Many ways of producing such effects [redshifts in extragalactic nebulae ] are 
known ; but of them all, only one will produce large redshifts without introducing 
other effects which should be conspicuous but actually are not found. This one 
known permissible explanation interprets redshifts as due 
to actual motion away from the observer ." 

(1934a) 

"We now have a hasty sketch of some of the general features of the observable 
region as a unit. The next step will be to follow the reconnaissance with a 
survey—to repeat carefully the explorations with an eye to accuracy and 
completeness. The program, with its emphasis on methods, will be a tedious 

series of successive approximations ." 

(1934b) 


Most of the remainder of Hubble’s career was 
dedicated to this “tedious series of successive ap¬ 
proximations.” Shortly before Hubble’s death the 
200-inch telescope went into operation at Palomar 


Mountain; and Hubble’s student, Alan Sandage, 
began using it in a continuation of Hubble’s quest 
into the true nature of the universe. (See Box 
29.4). 


EXERCISES 


Exercise 29.4. m(z) DERIVED USING STATISTICAL PHYSICS 

Derive the magnitude-redshift relation using a statistical description of the photon distribution 
[cf. eq. (22.49) and associated discussion]. 

Exercise 29.5. DOPPLER SHIFT VERSUS COSMOLOGICAL REDSHIFT 

(a) Consider, in flat spacetime, a galaxy moving away from the Earth with velocity u, and 
emitting light that is received at Earth. Let the distance between Earth and galaxy, as 
measured in the Earth’s Lorentz frame at some specific moment of emission, be r; and let 
the Doppler shift of the radiation when it is eventually received be z — 4A/\. Show that 
the flux of energy S received at the Earth is related to the galaxy’s intrinsic luminosity L 
by 

L 


S = 


477T 2 (1 + z) 4 ' 


(29.37) 


[Track-2 readers will find it most convenient to use the statistical formalism of equation 
(22.49).] 

(b) Compare this formula for the flux with formula (29.27), where the redshift is of 
cosmological origin. Why is the number of factors of 1 + z different for the two formulas? 
[Mathematical answer: equation (6.28a) of Ellis (1971).] 
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§29.5. SEARCH FOR “LENS EFFECT” OF THE UNIVERSE 

Curved space should act as a lens of great focal length. The curving of light rays 
has little effect on the apparent size of nearby objects. However, distant galaxies— 
galaxies from a quarter of the way up to halfway around the universe—are expected 
to have greatly magnified angular diameters [Klauder, Wakano, Wheeler, and Willey 
(1959)]. To see a normal galaxy at such a distance by means of an optical telescope 
seems out of the question. However, radio telescopes resolve features in quasistellar 
sources and other radiogalaxies at redshifts of z = 2 or more. Moreover, paired radio 
telescopes at intercontinental distances (for example, Goldstone, California, and 
Woomera, Australia) resolve distant sources to better than 0".001 or 4.8 x 10 -9 
radians or 15 lightyears for an object at a distance of 3 X 10 9 lightyears (Euclidean 
geometry temporarily being assumed). A radio telescope in space paired with a radio 
telescope on earth will be able to do even better on angular resolution. Will one 
be able to find any fiducial distance characteristic of any one class of objects that 
will serve as a natural standard of length, for very great distances (z = 2 to z = 3) 
as well as for galaxies closer at hand? Perhaps not. However, it would seem unwise 
to discount this possibility, with all the advantages it would bring, in view of the 
demonstrated ability of skilled observers to find regularities elsewhere where one 
had no right to expect them in advance. 

Let L denote the actual length of a fiducial element (if any be found) in a galaxy; 
and let 86 (radians!) denote the apparent length of the object, idealized as perpendi¬ 
cular to the line of sight, as seen by the observer. The ratio of these two quantities 
defines the “angle effective distance” of the source, 

(angle effective distance) = r aed = L/86. (29.38a) 

In flat space and for objects with zero relative velocity, this distance is to be identified 
with the actual distance, r, to the source or with the actual time of flight, t, of light 
from source to observer. The situation is changed in an expanding universe. 

To calculate the angle effective distance as a function of redshift, place the Earth 
(receiver) at x r = 0; and place the object under study (emitter) at x e - Let the fiducial 
length L lie on the sphere at x e (perpendicular to line of sight), and let it run from 
6 e to 6 e + 86 [one end of fiducial element at (x e , 0 e , <p e ); other at (x e , 6 e + 86 , <j> e )\. 
Then 

L = a(t e )2 ( Xe ) 86, 

and 

r aed = a(0^(Xe) = [d(t T )Z{x e - X r )\d{t e )/a{t T )\ 
i.e. [see equation (29.28), with x r and x e reversed], 

r aed = R/( 1 + z). (29.38b) 

Here R is given as a function of redshift of source, z, and cosmological parameters 
H 0 , q 0 , < 7 0 , by equations (29.32) in general, or by (29.33) if yl = 0. [Equation (29.38b) 
is modified if the beam preferentially traverses regions of low mass density (“vacuum 
between the galaxies”); see equation (22.37) and Gunn (1967).] 


The hope for a fiducial length 
in distant objects 


Angle effective distance 
defined 


Angle effective distance as 
function of redshift 
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Figure 29.2. 

Angle effective distance versus redshift for two typical cosmo¬ 
logical models—one open (0 < o 0 = q Q < 1); the other closed 
(a 0 = q o = 1); both with zero cosmological constant; both with 
H°~ l = 18 x 10 9 lyr. 


Angle effective distance as a 
tool for determining whether 
universe is closed 


Figure 29.2 shows angle effective distance as a function of redshift for a few 
selected choices of the relevant parameters. It is evident that the angle effective 
distance has a maximum for a redshift roughly of the order z ~~ 1, provided that 
the universe is closed. However, there is a big difference if the universe is open 
(Figure 29.2). The rapid improvements taking place in radio astronomy make 
increasingly attractive the possibility it provides for testing whether the universe is 
closed, as Einstein argued it should be [Einstein (1950), pp. 107-108]. Moreover, 
even with optical telescopes, in 1973 one may be on the verge of measuring q 0 by 
studies of angle effective distance: preliminary studies [Sandage (1972b)] suggest that 
the optical size of the brightest E-type galaxies may be a usable fiducial length. 


§29.6. DENSITY OF THE UNIVERSE TODAY 


Measurements of mean mass 
density of universe: 

(1) luminous matter in 
galaxies 


It is exceedingly difficult to measure the mean density p^ of the universe today. 
A large amount of matter may be in forms that astronomers have not yet managed 
to observe (intergalactic matter, black holes, etc.). Therefore, the best one can do 
is to add up all the luminous matter in galaxies and regard the resulting number 
as a lower limit on p^. Even adding up the luminous matter is a difficult and risky 
task, so difficult that even today no analysis is more definitive than the classic work 
of Oort (1958). [See, however, the very detailed review of the problem in Chapter 
4 of Peebles (1971)]. Oort’s result is 


^luminous matter 


(2 X 10 -31 g/cm 3 )(// 0 /55 km sec 1 Mpc 1 ) 2 , (29.39) 


corresponding to 


o 0 > 0.02 (independent of the value of H 0 ). 


(29.40) 
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As an example (albeit an atypical one) of the danger inherent in any such estimate, 
Oort points to the Virgo cluster of galaxies. If the Virgo cluster is not gravitationally 
bound, then its —2,500 galaxies will go flying apart, destroying any semblance of 
a cluster, in about one billion years. If it is gravitationally bound, then the mean 
velocity of its galaxies relative to each other, when combined with the virial theorem, 
yields an estimate of the cluster’s total mass. That estimate is 25 times larger than 
the value one gets by Oort’s method of adding up the luminous mass of the cluster. 

Although one has no definitive evidence for or against large amounts of matter 
(enough to close the universe) in intergalactic space, one has tentative indirect limits: 

(1) If A = 0 (in accord with Einstein), then o 0 = q 0 \ so Sandage’s value of q 0 < 
1—stretched to q 0 < 10 under the most wild of assumptions about galaxy evolu¬ 
tion—implies 

Pig < 10“ 28 g/cm 3 (o 0 = q 0 < 10). 

(2) Gott and Gunn (1971) point out that, if the density of gas in intergalactic space 
were >10~ 30 g/cm 3 (i.e., if o 0 were >0.1), one would expect gas falling into the 
Coma cluster of galaxies to form a shock wave, which would emit large amounts 
of X-rays. From the current X-ray observations, one can place a limit on the amount 
of such infalling matter—and therefrom a limit 

P ig < 10~ 3o g/cm 3 (a 0 < 0.1) 

on the density of gas in intergalactic space. But these limits, like others obtained 
in other ways [see Chapter 4 of Peebles (1971) for a review] are far from definitive; 
they depend too much on theoretical calculations to make one feel fully comfortable. 


§29.7. SUMMARY OF PRESENT KNOWLEDGE ABOUT 
COSMOLOGICAL PARAMETERS 

The best data available in 1973 [equations (29.18), (29.36), (29.40)] reveal 

Hfi 1 — (18 ± 2) x 10 9 years, 

q 0 = 1 ± 0.5 (one-sigma) if o 0 = q 0 (A = 0), (29.41) 

( 7 0 > 0 . 02 , 

for the observational parameters of the universe. These numbers are inadequate to 
reveal whether the universe is closed or open, and whether it will continue to expand 
forever or will eventually slow to a halt and recontract. 

If one is disappointed in this lack of knowledge, one can at least be consoled by 
the following. (1) There is excellent agreement between theory and observation for 
the linear (low-z) parts of the distance-redshift, magnitude-redshift, and angular 
diameter-redshift relations (Box 29.4). (2) There is remarkably good agreement 
between (a) the age of the universe (18 billion years if q 0 — o 0 < 1; 12 billion years 
if q 0 = o 0 — i) as calculated from the measured value of H 0 \ (b) the ages of the 


(2) matter in intergalactic 
space 


Summary of observational 
parameters of universe 


Some quantitative triumphs 
of cosmology 
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The bright prospects for 
observational cosmology 


EXERCISES 


oldest stars (—10 X 10 9 years) as calculated by comparing the theory of stellar 
evolution with the properties of the obser/ed stars; (c) the time (—9 billion years) 
since nucleosynthesis of the uranium, thorium, and plutonium atoms that one finds 
on Earth, as calculated from the measured relative abundances of various nucleides; 
and (d) the ages (4.6 billion years) of the oldest meteorites and oldest lunar rock 
samples, as calculated from measured relative abundances of other nucleides. For 
further detail see, e.g., Sandage (1968, 1970), Wasserburg et al. (1969), Wasserburg 
and Burnett (1968), and Fowler (1972). (3) Observations of the cosmic microwave 
radiation and measurements of helium abundance are now capable of giving direct 
information about physical processes in the universe at redshifts 2 > 1 (Chapter 28). 
(4) One may yet find “fiducial lengths” in radio sources, visible out to z > 1, with 
which to measure q 0 and o 0 by the angle-effective-distance method (§29.5). (5) The 
enigmas of the nature of quasars and of their peculiar distribution with redshift (great 
congregation at 2 — 2; absence at z > 3) may yet be cracked and may yield, in the 
process, much new information about the origin of structure in the universe 
(Box 28.1). (6) The next decade may well bring as many great observational surprises, 
and corresponding new insights, as has the last decade. 


Exercise 29.6. SOURCE COUNTS 

Suppose that one could find (which one cannot) a family of light or radio sources that 
(1) are all identical with intrinsic luminosities L, (2) are distributed uniformly throughout 
the universe, and (3) are born at the same rate as they die so that the number in a unit 
comoving coordinate volume is forever fixed. 

(a) Show that the number of such sources N(z) with redshifts less than z, as observed 
from Earth today, would be 

N(z) = (constant) • z 3 [l - y(l + q 0 )z + 0(z 2 ) j. (29.42) 

(b) Show that the number of sources N(S) with fluxes greater than S as observed at Earth 
today would be 


t_= z 2 + 0(z 3 ) I [first-order correction 1 
[independent of q 0 and o 0 J 

[Answer: See §15.7 of Robertson and Noonan (1968).] 

Exercise 29.7. COSMIC-RAY DENSITY (Problem devised by Maarten Schmidt) 

Suppose the universe has contained the same number of galaxies indefinitely into the past. 
Suppose further that the cosmic rays in the universe were created in galaxies and that a 
negligible fraction of them have been degraded or lost since formation. Derive an expression 
for the average density of energy in cosmic rays in the universe today in terms of: (1) the 
number density of galaxies, N 0 , today; and (2) the nonconstant rate, dE/dz , at which the 
average galaxy created cosmic-ray energy during the past history of the universe. [At redshift 
z in range dz, the average galaxy liberates energy (dE/dz) dz into cosmic rays.] 
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Exercise 29.8. FRACTION OF SKY COVERED BY GALAXIES 

Assume that the redshifts of quasars are cosmological. Let the number of galaxies per unit 
physical volume in the universe today be N 0 , and assume that no galaxies have been created 
or destroyed since a redshift of >7. Let D be the average angular diameter of a galaxy. 
Calculate the probability that the light from a quasar at redshift z, has passed through at 
least one intervening galaxy during its travel to Earth. [For a detailed discussion of this 
problem, see Wagoner (1967).] 



CHAPTER 


ANISOTROPIC AND 
INHOMOGENEOUS COSMOLOGIES 



This chapter is entirely Track 2. 

The main text requires no 
special preparation, although 
Chapters 27-29 would be 
helpful. 

Box 30.1 contains more 
technical sections: ideal 
preparation for it would be 
Chapters 4, 9-14, 21, and 
27-29, plus §25.2; minimal 
preparation would be 
exercises 9.13, 9.14, and 
25.2, Chapter 21 through 
§21.8, and §§27.8, 27.11, 
and 29.2. 

Chapter 30 is not needed as 
preparation for any later 
chapter. 


Motivation for studying 
inhomogeneous and 
anisotropic cosmologies: Why 
is universe so uniform? 


§30.1. WHY IS THE UNIVERSE SO HOMOGENEOUS 
AND ISOTROPIC? 

The last three chapters studied the Friedmann cosmological models and the relatively 
satisfactory picture they give of the universe and its evolution. This chapter describes 
less simplified cosmological models, and uses them to begin answering the question, 
“Why are the very simple Friedmann models satisfactory?” This question is intended 
to probe more deeply than the first, obvious answer—namely, that the models are 
satisfactory because they do not contradict observations. Accepting the agreement 
with observations, we want to understand why the laws of physics should demand 
(rather than merely permit ) a universe that is homogeneous and isotropic to high 
accuracy on large scales. Because this question cannot be answered definitively in 
1972, many readers will prefer to omit this chapter on the first reading and return 
to it only after they have surveyed the major results in other areas such as black 
holes (Chapter 33), gravitational waves (Chapters 35-37), and solar-system experi¬ 
ments (Chapter 40). 

The approach described here to the question “Why is the universe so highly 
symmetric?” is to ask Einstein’s equations to describe what would have happened 
if the universe had started out highly irregular. 

The first step in this approach is to ask what would have happened if the universe 
had started a little bit irregular. This problem can be tackled by analyzing small 
perturbations away from the high symmetry of the Friedmann models. Such an 
analysis is most fruitful in its discussion of the beginnings of galaxy formation, and 
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in its ability to relate small upper limits on the present-day anisotropy of the 
microwave background radiation to limits on density and temperature irregularities 
that might have existed ten billion years ago, when the radiation was emitted. These 
studies are described so well in the book by Zel’dovich and Novikov (1974) [see 
also Field (1973), Peebles (1969), Peebles and Yu (1970), Jones and Peebles (1972), 
and references cited therein] that we omit them here. 

Another approach is to allow large deviations from the symmetry of the Friedmann 
universes, but to put the asymmetries into only a few degrees of freedom. 


§30.2. THE KASNER MODEL FOR 

AN ANISOTROPIC UNIVERSE 

Kasner metric: an example of 
an anisotropic model 
universe 

which was first studied as a cosmological model by Schucking and Heckmann (1958). 

In this metric the p i are constants satisfying 

Pi + P 2 + Pa = C Pi ) 2 + (P 2) 2 + (Psf = 1. (30.2) 

Each t = constant hypersurface of this cosmological model is a fiat three-dimensional 
space. The world lines of constant x,y, z are timelike geodesics along which galaxies 
or other matter, treated as test particles, can be imagined to move. This model 
represents an expanding universe, since the volume element 

v=i = y/®g = t 

is constantly increasing. But it is an anisotropically expanding universe. The separa¬ 
tion between two standard (constant x,y, z) observers is t pi Ax if only their x-coor- 
dinates differ. Thus, distances parallel to the x-axis expand at one rate, X, oc t pl , 
while those along the y-axis can expand at a different rate, i 2 cc t P2 . Most remarkable 
perhaps is the fact that along one of the axes distances contract rather than expand. 

This contraction shows up mathematically in the fact that equations (30.2) require 
one of the p’s, say p v to be nonpositive: 

-j</>i<0. (30.3) 

As a consequence, in a universe of this sort, if black-body radiation were emitted 
at one time t and never subsequently scattered, later observers would see blue shifts 
near one pair of antipodes on the sky and red shifts in most other directions. In 
terms of this example, the fundamental cosmological question is why the Friedmann 
metrics should be a more accurate approximation to the real universe than is this 
Kasner metric. 


The prototype for cosmological models with great asymmetry in a few degrees of 
freedom is the Kasner (1921a) metric, 

ds 2 — —dt 2 + t 2p idx 2 + t 2p i dy 2 + t 2p *dz 2 , (30.1) 
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Kasner model with 
becomes isotropic i 
age" 


Anisotropy energy 


Adiabatic cooling 
anisotropy 


§30.3. ADIABATIC COOLING OF ANISOTROPY 

In seeking an answer, ask a question. Ask, in particular, what would become of a 
universe that starts out near / = 0 with a form described by the Kasner metric of. 
equation (30.1). This metric is an exact solution of the vacuum Einstein equation 
G — 0. It approximates a situation where the matter terms in the Einstein equations 
are negligible by comparison with typical non-zero components of the Riemann 
tensor. Schiicking and Heckmann (1958) give solutions with matter included as 
matter a pressureless fluid. In this situation, the curvature of empty spacetime dominates 

n old both the geometry and the expansion rate at early times, t —0; but after some 

characteristic time t m the matter terms become more important, and the metric 
reduces asymptotically to the homogeneous, isotropic model with k = 0. 

This example illustrates the possibility that the universe might achieve a measure 
of isotropy and homogeneity in old age, even if it were bom in a highly irregular 
state. Whether the symmetry of our universe can be explained along these lines is 
not yet clear in 1972. The model universe just mentioned is only a hint, especially 
since the critical parameter t m can be given any value whatsoever. 

The standard Einstein general-relativity physics of this model can be described 
in other language (Misner, 1968) by ascribing to the anisotropic motions of empty 
spacetime an “effective energy density” so , which enters the Coo component of 
the Einstein equation on an equal footing with the matter-energy density, and 
thereby helps to account for the expansion of the universe: 

H 2 = (y|ln V^) 2 = ^(p aniso + P matter ). (30.4) 

The anisotropy energy density is found to have an equation of state 

Paniso a (3, 2 -1 = (volume)- 2 , 


while 

Pmatter a i3) g~'" 2 = (volume)^. 

For pressureless matter y = 1; for a radiation fluid y = 4/3; for a nonrelativistic 
ideal gas y = 5/3). 

This arrangement of the Einstein equation allows one to think of the anisotropy 
motions as being adiabatically cooled by the expansion of the universe, just as the 
thermal motions of an ideal gas would be. Since the adiabatic index for homogeneous 
anisotropy is y = 2, the anisotropy will be the dominant source of “effective energy” 
in a highly compressed state, whereas the matter will dominate in an expanded state. 


§30.4. VISCOUS DISSIPATION OF ANISTROPY 

The model universe sketched above can be further elaborated by introducing dissi¬ 
pative mechanisms that convert anisotropy energy into thermal energy. Suppose that 
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such an anisotropic universe were filled at one time with thermal radiation. If the 
radiation were collisionless or nearly so, the quanta moving parallel to the contracting 
x-axis would get blueshifted and would develop an energy distribution corresponding 
to a high temperature. The quanta moving parallel to the other (expanding) axes 
would be redshifted to an energy distribution corresponding to a low temperature. 
Any collisions taking place between these two systems of particles would introduce 
a “thermal contact” between them, and would transfer energy from the hot system 
to the cold one, with a corresponding large production of entropy. This provides 
an irreversible dissipative process, which decreases so and increases p radiadon 
relative to the values they would have had under conditions of adiabatic expansion. 
[For further details, see, e.g., Matzner and Misner (1972).] 

It is possible that both the adiabatic cooling of anisotropy and the dissipation 
of anisotropy by its action on a gas of almost collisionless quanta have played 
significant roles in the evolution of our universe. In particular, neutrinos above 10 10 K 
may have undergone sufficient v-e scattering to have provided strong dissipation 
during the first few seconds of the life of the universe. 


§30.5. PARTICLE CREATION IN 

AN ANISOTROPIC UNIVERSE 

Adiabatic cooling and viscous dissipation might not be the chief destroyers of 
anisotropy in an expanding universe. More powerful still might be another highly 
dissipative process, which might occur at still earlier times, very near the initial 

“singularity.” This is a process of particle creation which was first treated by DeWitt Creation of particles by 

(1953), then explored by Parker (1966 and 1969) for isotropic cosmologies and finally anisotropy of expansion 

by ZePdovich (1970) in the present context of anisotropic cosmologies. In this process 

one again turns to the Kasner metric for the simplest example, but now quantum- 

mechanical considerations enter the picture. One realizes that not only would real 

quanta propagating in different directions be subject to red shifts and blue shifts, 

but that virtual quanta must be considered as well. Vacuum fluctuations (zero-point 

oscillations) entail a certain minimum number of virtual quanta, which are subject 

to the redshifting and blueshifting action of the strong gravitational fields. Virtual 

quanta that are blueshifted sufficiently violently can materialize as real particles, 

thanks to their energy gain. In this context “sufficiently violently” means not adia - 

batically. 

In an adiabatic expansion, the number of particles does not change, although the 
energy of each one does. This adiabatic limit is just the geometric-optics approxima¬ 
tion to wave equations, which was discussed in §22.5. There one saw that, if spacetime 
were not fiat on the scale of a wavelength, then the wave equation could not be 
replaced by a particle description with conserved particle numbers. Thus, the adia¬ 
batic limit (geometric-optics approximation) is violated in the conditions of high 
curvature near the singularity at the beginning of the universe. 

By studying wave equations in the Kasner background metric, ZePdovich and 
Starobinsky (1971) find quantitatively the consequences of the failure of the adia- 
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Anisotropy might have created 
the matter content of our 
universe, damping itself out 
in the process 


Inhomogeneous cosmological 
models: 

(1) with spherical symmetry 


(2) with (rather symmetric) 
gravitational waves 


(3) near a singularity, with 
few or no symmetries 


batic approximation near the singularity. Classically, the amplitudes of waves at 
frequencies comparable to the Hubble constant for any given epoch increase faster 
than a simple blue-shift calculation would imply (amplification through parametric 
resonance). Quantum-mechanically, the same amplification, applied to zero-point 
oscillations, leads to the creation of particle-antiparticle pairs. The calculations indi¬ 
cate that this effect is very strong at the characteristic time t q = V Gfi/c 5 ~ 10~ 43 
sec. (All calculations performed thus far are inadequate when the effect becomes 
strong, thus for t < t q ). 

For the creation of massless particles, it is essential that an anisotropically ex¬ 
panding universe be postulated (except for scalar particles, for which particle creation 
occurs already in the Friedmann universe, unless the particle satisfies the conformal- 
invariant wave equation). The isotropic Friedmann universes are all conformally 
fiat, so that solutions of the wave equation for a field of zero rest mass can be given 
in terms of solutions for fiat-space wave equations where there is no particle creation. 
There is some particle creation even in the isotropic Friedmann universe when the 
particle has finite rest mass and low energy. However, the particle-creation process 
normally uses anisotropy energy as the energy supply that it converts into radiation 
energy. 

The pioneering work by Parker and Zel’dovich suggests that one should study 
in detail cosmological models in which the initial conditions are a singularity, and 
in which quantum effects near the time t = t q dissipate all anisotropies and simulta¬ 
neously give rise to the matter content of the model. This program of research, which 
is in its infancy, seems to require extrapolating laws of physics down to the very 
natural looking but preposterously small dimension V Gti/c 5 ~ 10~ 43 sec, or equiv¬ 
alently V Gfi/c 3 - 10" 33 cm. - - — 


§30.6. INHOMOGENEOUS COSMOLOGIES 

The model universes considered above were all homogeneous although anisotropic. 
It is also crucial to study inhomogeneous cosmological models, in which the metric 
has a nontrivial dependence on the space coordinates. One class of such models is 
spherically symmetric universes, where the matter density, expansion rate, and all 
other locally measurable physical quantities have spherical symmetry about some 
preferred origin. Models of this sort were first considered by Lemaitre (1933a,b), 
Tolman (1934b), and Datt (1938), and were also treated by Bondi in 1947. These 
models provide a means for studying density perturbations of large amplitude. 

A recent tool is making it possible to study large-amplitude, spatially varying 
curvature perturbations of other symmetries; this tool is the Gowdy (1971, 1973) 
metrics. These metrics, which are exact solutions of the Einstein equations, represent 
closed universes with various topologies (S' 3 , S 1 X S 2 , T 3 ) containing gravitational 
waves. The wave form in these solutions is essentially arbitrary, but all the waves 
propagate along a single preferred direction and have a common polarization. 

A rather different approach to understanding the behavior of inhomogeneous and 
anisotropic solutions of the Einstein equations has been developed by Khalatnikov, 
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Lifshitz, and their colleagues. Rather than truncate the Einstein theory by limiting 
attention to specialized situations where exact solutions can be obtained, they have 
sought to study the widest possible class of solutions, but to describe their behavior 
only in the immediate neighborhood of the singularity. These studies give a greatly 
enhanced significance to some of the exact solutions, by showing that phenomena 
found in them are in fact typical of much broader classes of solutions. 

Thus, in the first large class of solutions studied [Lifshitz and Khalatnikov (1963)], 
it was found that near the singularity solutions containing matter showed no features 
not already found in the vacuum solutions. Furthermore, space derivatives in the 
Einstein equations became negligible near the singularity in these solutions, with 
the consequence that a metric of the Kasner form [equation (30.1)] described the 
local behavior of spacetime near the singularity, but with a different set of p i values 
possible at each point of the singular hypersurface. Subsequently, broadened studies 
of solutions near a singularity [Belinsky and Khalatnikov (1970)] showed that the 
mixmaster universe [Misner (1969b); Belinsky, Khalatnikov, and Lifshitz (1970)] is 
a still better homogeneous prototype for singularity behavior than the Kasner metric. 


§30.7. THE MIXMASTER UNIVERSE 

The simplest example of a mixmaster universe is described in Box 30.1. It shows 
how, near the singularity, the Kasner exponents p i can become functions of time. 
The result is most simply described in terms of the Khalatnikov-Lifshitz param¬ 
eter u: 

p x - -w/0 + u + « 2 X 

= 0 + «)/(! + u + « 2 )> (30.5) 

p 3 = m(1 + u)/{ 1 + u + w 2 ). 

As one extrapolates backward in time toward the singularity, one finds that the 
expansion rates in the three principal directions correspond to those of the Kasner 
metric of equation (30.1), with p i values corresponding to some fixed u parameter. 
In these mixmaster models, however, the metric is not independent of the space 
coordinates (the spacelike hypersurfaces can, for instance, have the same 3-sphere 
topology as the closed Friedmann universes). 

The Kasner-like behavior at fixed u can persist through many decades of volume 
expansion before effects of the spatial derivatives of the metric come into play. The 
role then played by the space curvature is brief and decisive. The expansion is 
converted from a type corresponding to a parameter value u = w 0 to a type corre¬ 
sponding to the value u — — u 0 (which is equivalent, under a relabeling of the axes, 
to the value w = w 0 — 1). Extrapolating still farther back toward the singularity, one 
finds a previous period with u = w 0 — 2, Throughout an entire sequence u = w 0 , 
w 0 — 1, w 0 — 2, u 0 — 3,..., with u 0 > 1, nearly the entire volume expansion is due 
to expansion in the 3-direction, whereas the 1- and 2-directions change very little, 
alternating at each step between expansion and contraction. Sufficiently far in the 
past, however, such a sequence leads to a value of u between 0 and 1. This value 


Mixmaster universe: 

(1) "anisotropy oscillations 
explained in terms of 
Kasner model 
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(2) as a prototype for generic 
behavior near 
singularities 


Are there any other generic 
types of behavior near 
singularities? 


can be interpreted as the starting point for another, similar sequence, through the 
transformation u —1/m, which interchanges the names of axes 2 and 3. 

The extrapolation of the universe’s evolution back toward the singularity at / = 0 
therefore shows an extraordinarily complex behavior, in which similar but not 
precisely identical sequences of behavior are repeated infinitely many times. In terms 
of a time variable which is approximately log (log r 1 ), these behaviors are quasi- 
periodic. In the generic example to which the Khalatnikov-Lifshitz methods lead, 
one has a metric whose asymptotic behavior near the singularity is at each point 
of the singular hypersurface described by a mixmaster-type behavior, but with the 
principal axes of expansion changing their directions as well as their roles (as 
characterized by the u parameter) at each step, and with the mixmaster parameters 
spatially variable. [For more details see Belinsky, Lifshitz, and Khalatnikov (1971), 
and Ryan (1971, 1972).] 

It is not yet (1972) known whether there are important solutions or classes of 
solutions relevant to the cosmological problem, with asymptotic singularity behavior 
not described by the Khalatnikov-Lifshitz generic case. The difficulty in reaching 
a definitive assessment here is that Khalatnikov and Lifshitz use essentially local 
methods, confined to a single coordinate patch, whereas the desired assessment poses 
an essentially global question. The global approaches (described in Chapter 34) have 
not, however, provided any comparable description of the nature of the singularity 
whose necessity they prove. One attempt to bridge these differences in technique 
and content is the work by Eardley, Liang, and Sachs (1972). 

(continued on page 8J5) 
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Box'30.1 THE MIXMASTER 
COSMOLOGY 


The Mixmaster Cosmology is a valuable example. 
As described in §30.7, it shows a singularity be¬ 
havior which illustrates most of the features of the 
most general examples known. In particular, it 
shows how properties of empty space reminiscent 
of an elastic solid become evident near the cos¬ 
mological singularity. 

The mathematical path to this example, as given 
in this box, also illustrates several important tech¬ 
niques in using the variational principles for the 
Einstein equations to elucidate the solution of 
these equations. The Mixmaster example can also 
be used to provide simple examples of superspace 
ideas and of quantum formulations of the laws of 
gravity [Misner (1972a)]. 



A Generalized Kasner Model 

Two generalizations must be implemented in order 
to progress from the Kasner example (30.1) of a 
cosmological singularity to the Mixmaster exam¬ 
ple. The first is to allow a more general time- 
dependence while preserving some of the sim¬ 
plicity of the conditions (30.2) on the exponents p { . 
Note that these exponents satisfy, e.g., p 2 = 
d\ng 22 /d\n g. Therefore one is led to parametrize 
the 3x3 spatial metric as 

g« = **{*% ( 1 ) 

or equivalently, (In g) if = 2a 8 i§ + 2 fi ij9 where 
is a traceless 3x3 symmetric matrix, and the 
exponential is a matrix power series, so det e 2f * = 1 
and 

Vg = e 3 *. (2) 

For the purposes of this paragraph only, define 
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p i; - = d(\n gXj/d In det g. Then from equations (1) 
and (2), one computes 

Pi} = j[8« + (dPif/da)]; (3) 

so the one Kasner condition 

- -JL = y. P i = trace p if = 1 + — trace (dp/da) 

i ~ - - 

is an identity in view of trace /? i; - = 0. The second 
condition on the Kasner exponents is trace ( p 2 ) = 
1, and becomes (dfi^/da) 2 = 6 by equation (3). 
This is not an identity, but a consequence of 
the Einstein equations in empty space. For the 
(Bianchi Type I) metric 

ds 2 =: — dt 2 4- dx* dx\ (4) 

and in the case when /? i; - is diagonal, the Einstein 
equations are, 

(ff = T[ T0 ° + -ih w « /d,)2 ] < 5 > 

and 



together with a redundant equation involving Tfcfc 
and the equation T 0k — 0* [The stress components 
here refer to an orthonormal frame with basis 
1-forms oo* = e a (eP) i jdx j .] From equation (5) one 
immediately derives 

Panisod, = (c 2 /16 mGXdPy/dt? (7) 

as a formula for the effectiveness of Type I anisot¬ 
ropy in contributing to the Hubble constant H = 
da/dt on a basis comparable to matter energy, 
as in equation (30.4). Similarly, for equation (6) 
in the case of fluid matter (isotropic pressures), the 
stress terms vanish, and one obtains Panisod)^ = 
const., as in the equation following (30.4). The 
Kasner condition 2p 2 = 1 or (dfi^/da) 2 — 6 fol¬ 
lows from equation (5) whenever T 00 < Paul¬ 
in the diagonal case, /? i; - has only two independ¬ 



ent components, and it is convenient at times to 
define them explicitly by the parameterization 

fin = fi+ + V5/L> 

p 22 = P + - V3/L, (8) 

^33 = — 2 /?+• 

For these the Kasner condition (dfi^/da) 2 = 6 
becomes 

{dpjdaf + (dpjda) 2 =1. (9) 

The p ± are related to the Kasner exponents p i or 
the u parameter of equations (30.5) by 

dpjda =1(1 -3p 3 ) 

= -1 + (3/2)(l +U + U*)-' 

dpjda = i V3( Pl - p 2 ) 

= - j V3(l + 2«)(1 + u + u 2 )~\ 

Introducing Space Curvature 

The first step in generalizing the Kasner metric has 
focused attention on the “velocity” j3' = (dfi + /da 9 
dfi_/da) which is a derivative of anisotropy with 
respect to expansion. The effects of matter or, as 
will soon appear, space curvature can change the 
magnitude lijB'll from the Kasner value of unity. 
The second step of generalization is to introduce 
space curvature. This one achieves in a simple 
example by retaining the metric components of 
equation (1), but employing them in a non-holo- 
nomic basis. Use the basis vectors introduced in 
exercises 9.13 and 9.14 on the rotation group 
S0(3), whose dual 1-forms are 

a 1 = cos d9 -f sin \p sin 8 d<j>, 
o 2 = sin d8 — cos \p sin 8 d<p, (11) 

a 3 = d\p + cos 8 d<p , 
to form the metric 

ds 2 = — N 2 dt 2 -f e 2a (e 2 ^) iJ - 0 i a J , (12) 
where N, a, and /? i; - are functions of t only. When 
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Box 30.1 (continued) 

a — 0 = fity the three-dimensional space metric 
here reduces to the one studied in exercise 13.15, 
which is the metric of highest symmetry on the 
group space 50(3). The simply connected covering 
space has the 3-sphere topology, and is obtained 
by extending the range of the Euler angle ^ to give 
it a 477 period [50(2) or spin \ covering of the 
rotation group]. With N — 1, \a — e a , and = 0, 
one obtains from equation (12) the same metric 
(in different coordinates) as that treated in exercise 
14.4 and in Chapter 27 in discussions of the closed 
Friedmann cosmological model. A non-zero value 
for /? i; - allows the 3-sphere to have a different cir¬ 
cumference on great circles in each of 3 mutually 
orthogonal principal directions, thus destroying its 
isotropy but not its homogeneity. 

Let us consider only the case with /? i; diagonal, 
as in equation (8). Then the 7 00 Einstein equation 
becomes (with N = 1 as a time-coordinate condi¬ 
tion) 

3(d 2 - fl + 2 - K 2 ) + j ( 3 *ix) = ^T 00 , (13) 
where only the term 

s R lx = y e~ 2a trace (2e~ 2(3 — e 4/? ) (14) 

is different from equation (5). This term [see equa¬ 
tion (21.92)] is the scalar curvature of a three- 
dimensional slice, t = const [which has symmetry 
properties known as “Bianchi Type IX” for the 
metric of equations (11) and (12)]. If equation (13) 
is interpreted in terms of an anisotropy energy 
density contributing, with T 00 , to the volume ex¬ 
pansion a 2 , then there are not only kinetic energy 
terms ft 2 [as in equations (5) and (7)], but also a 
potential energy term. This term shows that nega¬ 
tive scalar curvature, which can be produced by 
anisotropy (fi ^ 0), is equivalent to a positive po¬ 
tential (or “internal”) energy, and suggests that 
empty space has properties with analogies to an 
elastic solid and resists shear strains. The more 
detailed analysis which follows shows that, near 



the singularity, the scalar curvature is always neg¬ 
ligible when positive. 

Negative curvatures, however, arise in this 
closed universe from large shear (/?) deformations 
near the singularity and become large enough to 
reverse one Kasner shear motion [w-value, etc.; 
equation (10)] and change it to another. 

These conclusions and further details of the 
time-evolution of the “Mixmaster” metric (11, 12) 
require, in principle, the study of all the Einstein 
equations, not just equation (13) for 7 00 . As de¬ 
scribed in Chapter 21, however, this T 00 constraint 
equation is central, and actually contains implicitly 
the full content of the Einstein equations when 
formulated properly. 

Variational Principles 

One adequate formulation, adopted here, involves 
treating equation (13) not as an energy equation 
(involving velocities), but as a Hamiltonian (in¬ 
volving momenta). Take the Einstein variational 
principle (21.15) in ADM form (21.95) and carry 
out the space integration, using 

J o 1 A o 2 A o 3 = J sin 6 d<p A d0 A d\p = (4t7) 2 , 

to obtain the action integral in the form 

I={R) + 

+ e_6o (y( wi t) 2 - 

When introducing the specific form (1) and (8) for 
g i; , it is convenient also to parameterize the diago¬ 
nal matrix as follows: 



with 

6 P\=P+ +P-V- 3, 
6p\=p + -p_V3, (17) 

6 P 3 s = “2 P+ 
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[see equation ( 8 )]. The result is 

I = f p+df] + + p_ dp_ + p a da 

l~Pl+P + 2 +P- 2 
- 24wV*( 3 /? IX )] dt. 


Ne 


24 w 


This is cleaned up for further study as follows. 
Write 


3 *ix = fe- 2a (i - n 


(18) 


where 


straint for its conjugate Hamiltonian. Here an ob¬ 
vious and satisfactory choice is to set t = a, and 
solve % — 0 for 

#ADM = ~Pa = W + P 2 + e*(V ~ 1)] 1/2 - 

(23) 

The a equation [vary p a in equation (21)] is 

d = -(3 t r/2f/ 2 Ne-^p a (24) 

and shows that the choice a = t (so a = 1 ) re¬ 
quires 

JVadm = (2/3»)” W/W (25) 


V = V(P) = J trace (1 - le'W + e 4 ») (19) 

so K( 0 ) = 0 ; and adjust the zero of a (<x — : »- 
a — a 0 ) so that e 2a —( 67 r) -1 ^ 2 ®. Then the metric 
is 

ds 2 = —N 2 dt 2 (67 T)~ 1 e 2a (e 2 ^) ij o x o i , ( 20 ) 

and the variational integral is 

/= fp+dp + +p_dp_+p a da 
- ($7r/2) 1/2 Ne- 3a X dt, 

with 

2; % = -p a 2 + p 2 + />_ 2 + - 1). (22) 

Owe demands SI = 0 for arbitrary independent var¬ 
iations of p ± , p a , P ±9 a, N to obtain the Einstein 
equations. From varying A, one obtains the funda¬ 
mental constraint equation % - 0 [which would 
reduce to the vacuum version of equation (13) 
when the momenta are replaced by velocities (via 
equations obtained by varying the p's) if the coor¬ 
dinate condition N — 1 were imposed.] 

ADM Hamiltonian 


The reduced, canonical, variational principle 
which results when equation (23) is used to elimi¬ 
nate p a reads SI red = 0 with 

^red = f P+ + P- dP- ~ ^ADM d& (26) 

and must be supplemented by equation (25). 

Super-Hamiltonian 

A more convenient approach here is one .more 
closely related to the Dirac Hamiltonian methods 
than those of ADM. Note, however, that one does 
not remove the arbitrariness in the lapse function 
by taking it to be some specified function N(t) of 
the coordinates. Instead the procedure adopted 
here is to eliminate N from the variational princi¬ 
ple ( 21 ) by choosing it (coordinate condition!) to 
be some chosen function of the field variables and 
momenta, N = N (a, P±,p a ,p±)- Any such choice, 
inserted in equation ( 21 ), leaves a variational inte¬ 
gral in canonical Hamiltonian form. The content 
of this new variational principle becomes equiva¬ 
lent to the original one only when supplemented 
by the constraint 

ar = o, (27) 


The standard ADM prescription for reducing this 
variational principle to canonical (Hamiltonian) 
form is to choose one of the field variables or 
momenta as a time-coordinate, and solve the con¬ 



which can no longer be derived from the varia¬ 
tional principle. [The other Euler-Lagrange equa¬ 
tions for these two principles differ only by terms 
proportional to and thus are equivalent when 
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Box 30.1 (continued) 

DC = 0 is imposed on the initial conditions.] The 
choice 

N = (2/2*) l/2 e* (28) 

is obvious and convenient. It makes DC become a 
super-Hamiltonian in the resulting variational 
principle 

/ = f p+ d(i + + p_ dp_ + p a da — DC d\ (29) 

where t = X has been written to label the specific 
time-coordinate choice that equation (28) implies. 


Mixmaster Dynamics 

If matter terms with no additional degrees of free¬ 
dom are included, the super-Hamiltonian in equa¬ 
tion (29) is modified simply. For an example, 
choose 

T 00 _ _ T Q q _ (3/4)2(^-Sa + f e - 4«) (30) 

for the energy density of matter in a frame with 
time-axis e$ = N~\d/dt ). The two terms represent 
a nonrelativistic perfect fluid (p oc V~ 1 ) and a 
radiation fluid (p oc F~ 4/3 ), respectively, and lead 
to 


and 

(33) 

Thus the sign of the p a 2 term causes a to accelerate 
toward (rather than away from) higher values of 
the “potential” terms e**(V — 1) + pe 3a + Fe 2a . 
When \ V\ 4. 1 (small anisotropy), equation (33) is 
identical to its form in the isotropic Friedmann 
model, and allows a deceleration only when a is 
large enough that the positive curvature term 
( —e 4 ") dominates over matter (pe 3 ®) and radiation 
(/V*). Near the singularity (a —— oo), the 
positive curvature term is always negligible 
compared to radiation and matter. 

For studies of the singularity behavior, it is 
sufficient to study the simplified super-Hamilton¬ 
ian 

~- p a 2 + p + 2 + p- 2 + (34) 

since the other terms obviously vanish for a —*- 
— oo. This form retains only the V term in 3 R IX = 
§e~ 2a (l — V), which dominates when the cur¬ 
vature of this closed universe becomes negative, 
1, If the term in V(/3) were also negligible, 
then DC = — p a 2 + p + 2 -f p_ 2 would make each 
p a , p ± constant, giving the Kasner behavior with 


2% = -p 2 + p + 2 + pj + e*°(V - 1 ) 

+ i ue 3a + re 2 *. ( ’ 

This Hamiltonian, with its simple quadratic mo¬ 
mentum dependence, differs in only two ways 
from the Hamiltonians of elementary mechanics, 
namely, (1) in the sign of the p 2 term and (2) in 
the detailed shape of the “potential” term as func¬ 
tion of a and the study of which reduces to 
a study of the function Hamilton’s equa¬ 

tions, from varying a, p a , and p ± in equation 
(29), yield 


dp ± /da — p ± /p a ~ const 

and \dfi/da\ 2 = 1 as expected (since matter and 
curvature have been neglected). To proceed fur¬ 
ther, a study of V(fi) is required, based on equa¬ 
tions (19) and (8), and their immediate conse¬ 
quence: 

V(P) = - e~ sf3 * - - e~ 2f} * cosh 2 \/30_ 

3 , (35) 

+ 1 -f — e 4 ^+(cosh 4 \/3/L — 1). 


d 2 P± _ __ dDt __ ]_ e 4 a d Y_ Y32) One fi n ^ s that V(/3) is a positive definite “potential 

d\ 2 ~ d/3 ± ~ 2 well” which has the same symmetries as an equi- 



§30.7. THE MIXMASTER UNIVERSE 


811 


r~ 

lateral triangle in the /?+/?_ plane. Near the origin, 
fi ± — 0, the equipotentials are circles, since 


V(P) = 8 (P + * + P. 

- 2 ) + 0(p 3 ). 

(36) 

For large (3 values, one finds 



p + —>- -00, 

(37) 

and 



V(p) ~ 1 + 16 

P+ -+ 00, 

\p-\ < 1- 

(38) 


These two asymptotic forms, together with the 
triangular symmetry, give a complete asymptotic 
description of F(/?), as sketched in the figure, 
where on successive levels separated by zl/3 == 1, 
the potential F increases by a factor of e 8 = 
3 X 10 3 . 



“Bounce” Interrupts Kasner-like Steps 
Toward the Singularity 

The dominant feature of the F(/?) potential is 
evidently its steep (exponential) triangular walls, 
with equation (37) representing the typical one for 
study. Under the influence of this potential wall, 
the evolution of this model universe is governed 
by the super-Hamiltonian 

2% ~ -p tt 2 + p + 2 + pJ + j e iia ~ (39) 

If a —*- —oo with dfijda > 1/2 [recall d$+/da 
— const., \dfi/da\ = 1, when the last term in (39) 
is small], then the potential term grows and will 
eventually become large enough to influence the 
motion. A simple “Lorentz” transformation, sug¬ 
gested by the superspace metric (coefficients of the 





Some equipotentials, V(j3) = constant, are shown for 
the function defined in equation (35). Equipotentials 
near the origin of the /?-plane are closed curves for 
V < 1 and are omitted here. 
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Box 30.1 (continued) 

quadratic in the momenta) simplifies the compu¬ 
tation further. Set 

£ + = (/? + -ya)/V374; 

« = (« 


and find 

2% = -p a 2 + P 2 + P- + j exp (-4 vT/? + ). 

(40) 

For this super-Hamiltonian both_ p a and p_ are 
constants of motion, whereas the /^-Hamiltonian, 
p+ 2 + represents a simple bounce against 

a one-dimensional potential wall with the initial 
and final values of p + different only in sign. The 
behavior of the anisotropy parameters (3 ± near the 
singularity thus consists of a simple Kasner step 
(where dfijda — const., with the dfi+/da > or 
conditions equivalent by symmetry, satisfied rela¬ 
tive to one of the three walls), followed by a 
bounce against that wall, beginning a new Kasner 
step with other Kasner parameters. [The most de¬ 
tailed description of this behavior and its relation 
to more general cosmological models can be found 
in Belinsky, Khalatnikov, and Lifshitz (1970)—see 
also the briefer report, Khalatnikov and Lifshitz 
(1970)—using quite different methods. For de¬ 
tailed developments by Hamiltonian methods, 
which supercede the partial Lagrangian methods 
of Misner (1969b), see Misner (1970, 1972a), and 
Ryan (1972a,b).] 


Steady-State, Quasiperiodic Infinity of 
"Bounces" Approaching the Singularity 

Some comprehensive features of the singularity 
behavior, involving many Kasner-like steps, can 
be exhibited by another transformation of the 
parameter space (superspace) of the metric field. 


The transformation introduces a “radial” /- 
coordinate out from the origin of a/3 ± space, while 
respecting the metric properties of this superspace 
implied by the form of the super-Hamiltonian. 
Thus one defines (for any constant a 0 ) 

a 0 — a = e l cosh 

P + — e 1 sinh f cos <f>, (41) 

/?_ — e l sinh f sin<J>, 

and finds 


2X= e~ 2 \(-p 2 4- P y 2 + p 2 sinh“ 2 f) + e*e*°V]. 

(42) 


The advantage of this transformation is that in the 
limit t —oo (a —^ — oo, singularity) the po¬ 
tential terms become, in first approximation, inde¬ 
pendent of t. Thus equation (37) gives, for one 
potential wall, 


e 2t e 4ay^ _L e 2t 


exp 



— $e l ^sinh £ cos $ -f- y cosh £ j . 


(43) 


For t —oo this expression evidently tends to 
either zero or infinity, depending on the sign of 
the expression in parentheses. Therefore define the 
asymptotic potential walls by 

tanh £ + y sec <p = 0 (44) 


in the sector \<j> - tt\ < tt/3, and equivalent for¬ 
mulae in which <p is replaced by <j> zt (2 tt/3) for 
the other sides of the triangle. Consequently, an 
asymptotic approximation to the super-Hamil¬ 
tonian is 

2% — e~ 2t [-p t 2 + py 2 -h /? 0 2 sinh“ 2 f + K'(f, <*>)], 

(45) 

where V'(£, <p) vanishes inside the asymptotic walls 
(44) and equals + oo outside. Because the remain¬ 
ing r-dependence is a common factor in (45), a 
simple change of independent variable e~ 2t dX = 
dX' in equation (29)—equivalent to the choice 

N — (2/37 r) 1/2 e~ 2t exp [3(a 0 — ^ cosh f)] (46) 
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in place of equation (28)—gives a new super- 
Hamiltonian %’ s= e 2t % with the variational 
integral 

1 = f p t dt +Psd$ +Ptd<t>- X'd\’. (47) 

In the asymptotic approximation where 

2X' = -p 2 + p? + p 2 sinh" 2 f + K'(f, *), (48) 

one immediately sees that is a constant of mo¬ 
tion, and that the “bouncing” of the f<f> values 
within the asymptotic potential walls is a station¬ 
ary, quasi-periodic process in this time-coordinate 
X' (or t, since dt/dX ' = —p t — const). [More de¬ 
tailed studies based on this asymptotic super- 
Hamiltonian show that the motion is even ergodic, 
with £<(> approaching arbitrarily close to any given 
value infinitely many times as t —oo; see Chitre 
(1972a).] 

Summary 

One has found the singularity behavior in this 
Mixmaster example to be extraordinarily active. 
In the simple Kasner singularity, two axes col¬ 
lapse, but the third is stretched in a simple tidal 
deformation accompanied by volume compres¬ 
sion. But in the Mixmaster example, every such 
collapse attempt is defeated by the high negative 
curvature it implies. Or rather it is diverted to 
another attempt as compression continues in¬ 
exorably, but the tidal deformations attempt first 
one configuration, then another, in an infinitely 
recurring probing of all possible configurations. 

Speculations on Time and the Singularity 

The cosmological singularity (in all examples 
where its character is not known to be unstable) 
involves infinite curvature and infinite density. 
One’s abhorence of such a theoretical prediction 
is particularly heightened by the correlative pre¬ 
diction that these infinities occurred at a finite 
proper time in the past, and would—if they 




recur—occur again at some finite proper time in 
the future. The singularity prediction would be 
more tolerable if the infinite densities could be 
removed to the infinitely distant past. The universe 
could then, as now, find its natural state to be one 
of expansion, so every finite density will have been 
experienced at some suitably remote past time, but 
infinite density becomes a formal abstraction never 
realized in the course of evolution. 

To push infinite curvature out of the finite past 
might be achieved in two ways. It is not known 
which, if either, works. One way is to change the 
physical laws which require the singularity, chang¬ 
ing them perhaps only in obvious and desirable 
ways, such as stating the laws of gravity in a proper 
quantum language. Computations of quantum ge¬ 
ometry are not yet definitive, however, and some 
(perhaps inadequate) approximations [Misner 
(1972a)] do not remove the singularity problem. 

Another way to discard the singularity is to 
accept the mathematics of the classical Einstein 
equations, but reinterpret it in terms of an infinite 
past time. There are, of course, simple and utterly 
inadequate ways to do this by arbitrary coordinate 
transformations such as t = In r which change a 
r = 0 singularity into one at t — — oo. But an 
arbitrary coordinate is without significance. The 
problem is that the singularity occurs at a finite 
proper time in the past, and proper time is the most 
physically significant, most physically real time we 
know. It corresponds to the ticking of physical 
clocks and measures the natural rhythms of actual 
events. To reinterpret finite past time as infinite, 
one must attack proper time on precisely these 
grounds, and claim it is inadequately physical. On 
a local basis, where special relativity is valid, no 
challenge to the physical significance of proper 
time can succeed. It is on a more global scale that 
the physical primacy of proper time needs to be 
reviewed. 

“The cosmological singularity occurred ten 
thousand million years ago.” In this statement, 
take time to mean the proper time along the world 
line of the solar system, ephemeris time. Then the 
statement would have a most direct physical sig- 
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Box 30.1 (continued) 

nificance if it meant that the Earth had completed 
10 10 orbits about the sun since the beginning of 
the universe. But proper time is not that closely 
tied to actual physical phenomena. The statement 
merely implies that those 5 X 10 9 orbits which the 
earth may have actually accomplished give a 
standard of time which is to be extrapolated in 
prescribed ways, thus giving theoretical meaning 
to the other 5 X 10 9 years which are asserted to 
have preceeded the formation of the solar system. 

A hardier standard clock changes the details of 
the argument, but not its qualitative conclusion. 
To interpret 10 10 years in terms of SI (Systeme 
Internationale) seconds assigns a past history con¬ 
taining some 3 X 10 27 oscillations of a hyperfine 
transition in neutral Cesium. But again the critical 
early ticks of the clock (needed to locate the 
singularity in time by actual physical events) are 
missing. The time needed for stellar nucleo¬ 
synthesis to produce the first Cesium disqualifies 
this clock on historical grounds, and the still earlier 
high temperatures nearer the singularity would 
have ionized all Cesium even if this element had 
predated stars. 

Thus proper time near the singularity is not a 
direct counting of simple and actual physical 
phenomena, but an elaborate mathematical 
extrapolation. Each actual clock has its “ticks” 
discounted by a suitable factor—3 X 10 7 seconds 
per orbit from the Earth-sun system, 1.1 X 10~ 10 
seconds per oscillation for the Cesium transition, 
etc. Since no single clock (because of its finite size 
and strength) is conceivable all the way back to 
the singularity, a statement about the proper time 
since the singularity involves the concept of an 
infinite sequence of successively smaller and 
sturdier clocks with their ticks then discounted and 



added. “Finite proper time,” then, need not imply 
that any finite sequence of events was possible. It 
may describe a necessarily infinite number of 
events (“ticks”) in any physically conceivable his¬ 
tory, converted by mathematics into a finite sum 
by the action of a non-local convergence factor, 
the “discount” applied to convert “ticks” into 
“proper time.” 

Here one has the conceptual inverse of Zeno’s 
paradox. One rejects Zeno’s suggestion that a sin¬ 
gle swing of a pendulum is infinitely complicated— 
being composed of a half period, plus a quarter 
period, plus 2~ n ad infinitum —because the terms 
in his infinite series are mathematical abstractions, 
not physically achieved discrete acts in a drama 
that must be played out. By a comparable stand¬ 
ard, one should ignore as a mathematical abstrac¬ 
tion the finite sum of the proper-time series for 
the age of the universe, if it can be proved that 
there must be an infinite number of discrete acts 
played out during its past history. In both cases, 
finiteness would be judged by counting the num¬ 
ber of discrete ticks on realizable clocks, not by 
assessing the weight of unrealizable mathematical 
abstractions. 

Whether the universe is infinitely old by this 
standard remains to be determined. The quantum 
influences, in particular, remain to be calculated. 
The decisive question is whether each present- 
epoch event is subject to the influence of infinitely 
many previous discrete events. In that case statisti¬ 
cal assumptions (large numbers, random phases, 
etc.) could enter in stronger ways into theories of 
cosmology. The Mixmaster cosmological model 
does have an infinite past history in this sense, 
since each “bounce” from one Kasner-like motion 
to another is a recognizable cosmological event, 
of which infinitely many must be realized between 
any finite epoch and the singularity. 
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§30.8. HORIZONS AND THE ISOTROPY OF 
THE MICROWAVE BACKGROUND 


The fundamental cosmological question—“Must a universe that is bom chaotic 
necessarily become as homogeneous and isotropic as our universe is, and do so before 
life evolves?”—entails one further issue. This issue is horizons. As was discussed in 
§27.10, at any given epoch in the expansion of a Friedmann universe (e.g., the present 
epoch), there may be significant portions of the universe from which no fight signal 
or other causally propagating influence will have yet reached Earth in the time 
available since the initial singularity. “If we should live so long,” the question would 
arise, “will the new portions of the universe which first come into view during the 
next ten billion years look statistically identical to the neighboring portions which 
are already being seen?” 

Fortunately, this question need not be posed only for the future. It can be asked 
as of some past time, and the answer then is yes. Microwave background radiation 
arrives at the earth from all directions in the sky with very nearly the same tempera¬ 
ture. [The data of Boughn, Fram, and Partridge (1971) and of Conklin (1969) show 
AT/~T-< 0.004.] The plasma that emitted the microwave radiation coming to earth 
from one direction in the sky had not been able, before the epoch of emission, to 
communicate causally with the plasma emitting the radiation that arrives from other 
directions. If one adopts a Friedmann model of the universe, then different sectors 
of the microwave sky are disjoint from each other in this sense if they are separated 
from each other by more than 30°, even if the microwaves were emitted as recently 
as z — 7. (The critical angle is much smaller if the microwaves were last scattered 
at z — 1,000.) From this, one concludes that the foundations for the homogeneity 
and isotropy of the universe were laid long before the universe became approximately 
Friedmann, for if statistical homogeneity and isotropy of the universe had not already 
been achieved at the longest wavelengths earlier, these horizon limitations would 
have prevented any further synchronization of conditions over large scales while 
the universe was in a nearly Friedmann state, and small amplitude (10%) deviations 
from isotropy should be observed now. 

The mixmaster universe received its name from the hope that it could contribute 
to the solution of this problem. The very large u values that occur sporadically an 
infinite number of times near the singularity in a mixmaster universe give a geometry 
close to that of the Kasner model with p l — 1, p 2 = P 3 = 0- This model can be 
written in the form 


Horizons in a Friedmann 
universe 


Observed isotropy of 
microwave radiation proves 
foundations for homogeneity 
were laid before universe 
became Friedmann-like 


What made the universe 
homogeneous and isotropic? 

(1) Mixmaster oscillations?— 
probably not 


ds 2 = e 2v ( — drf 2 4 dx 2 ) 4 - dy 2 4 dz 2 y (30.6) 

where tj = In t. If this metric is converted into a closed-universe model by inter¬ 
preting x, y , z as angle coordinates each with period 47r, then one sees that a fight 
ray can circumnavigate the universe in the x-direction in a time interval Ar] = 4tt, 
which corresponds to a volume expansion by a factor V“ gi/V —~ e47r - Unfor¬ 
tunately, a quantitative analysis of the degree and frequency with which the mix- 
master universe achieves this specific Kasner form suggests that the horizon breaking 
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is inadequate to explain the present state of the universe [Doroshkevich, Lukash, 
(2) particle creation near and Novikov (1971); Chitre (1972)]. It may turn out that particle creation near the 

singularity? singularity can solve this horizon question, as well as provide for the dissipation 

of anisotropy. Hope is provided by the fact that particle creation, when described 
in purely classical terms, has some acausal appearances, even though it is a strictly 
causal process at the quantum level [Zel’dovich (1972)]. 



™VII 


GRAVITATIONAL COLLAPSE 
AND BLACK HOLES 

Wherein the reader is transported to the land of black holes, and 
encounters colonies of static limits, ergospheres, 
and horizons—behind whose veils are hidden gaping, 

ferocious singularities. 



CHAPTER 


SCHWARZSCHILD GEOMETRY 




§31.1. INEVITABILITY OF COLLAPSE FOR 
MASSIVE STARS 


There is no equilibrium state at the endpoint of thermonuclear evolution for a star 
containing more than about twice the number of baryons in the sun (A > A max — 2A Q ). 
This is one of the most surprising—and disturbing—consequences of the discussion 
in Chapter 24. Stated differently: A star with A > A max — 2A Q must eject all but 
^max i ts baryons—e.g., by nova or supernova explosions—before settling down 
into its final resting state;-Otherwise there will be no final resting state for it to settle 
down into. 

What is the fate of a star that fails to eject its excess baryons before nearing the 
endpoint of thermonuclear evolution? For example, after a very massive supernova 
explosion, what will become of the collapsed degenerate-neutron core when it 
contains more than ^ max baryons? Such a supercritical mass cannot explode, since 
it is gravitationally bound and it has no more thermonuclear energy to release. Nor 
can it reach a static equilibrium state, since there exists no such state for so large 
a mass. There remains only one alternative; the supercritical mass must collapse 
through its “gravitational radius,” r — 2M, leaving behind a gravitating “black hole” 
in space. 

The phenomenon of collapse through the gravitational radius, as described by 
classical general relativity, will be the subject of the next chapter. However, before 
tackling it, one must understand more fully than heretofore the Schwarzschild 
spacetime geometry, which surrounds black holes and collapsing stars as well as 
static stars. 

This chapter will concern itself with two topics that, at first sight, appear to be 
disconnected. One is thejfall of a test particle in a preexisting Schwarzschild geom¬ 
etry, which is regarded as static, but can also be visualized as all that remains of 
a star that underwent collapse some time ago. The second topic is the physical 


This chapter, on 
Schwarzschild geometry, is 
key preparation for 
understanding gravitational 
collapse (next chapter) and 
black holes (following 
chapter) 
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character of this geometry, regarded in and by itself. For the exploration of this 
geometry, the test particle serves as the best of all explorers. But the test particle 
may also be regarded in another light. It can be viewed as a rag-tag johnny-come- 
lately piece of the matter of the falling star. Regarded in this way, it provides the 
simplest of all illustrations of an asymmetryin the distribution of mass of a collapsing 
star. That this asymmetry irons itself out will therefore give one some preliminary 
insight into how more complicated asymmetries also iron themselves out. In brief, 
the motion of the test particle and the dynamics of the Schwarzschild geometry (for 
this geometry will prove to be dynamic), two apparently different problems, have 
the happy ability to throw light on each other. 


§31.2. THE NONSINGULARITY OF 

THE GRAVITATIONAL RADIUS 


The Schwarzschild spacetime geometry 

ds 2 = - (l - —) dt 2 + dr *,, + r2 (M 2 + sin 2 # <^ 2 ) ( 3L1 ) 

\ r / 1 — 2 M/r 


The Schwarzschild line 
element becomes singular at 
r = 2 M ("gravitational 
radius") 


appears to behave badly near r = 2M; there g tt becomes zero, and g„ becomes 
infinite. However, one cannot be sure without careful study whether this pathology 
in the line element is due to a pathology in the spacetime geometry itself, or merely 
to a patholo gy of the (t, r, 6, o) coordinate system near r = 2 M. (As an example of 
a coordinate-incluced pathology, consider the neighborhood of 6 = 0 on one of the 
invariant spheres, t - const and r = const. There g 00 becomes zero because the 
coordinate system behaves badly; however, the intrinsic, coordinate-independent 
geometry of the sphere is well-behaved there. For another example, see Figure 1.4. 

The worrisome region of the Schwarzschild geometry, r = 2 M, is called the “grav¬ 
itational radius,” or the “Schwarzschild radius,” or the “Schwarzschild surface,” 
or the “Schwarzschild horizon,” or the “Schwarzschild sphere.” It is also called the 
“Schwarzschild singularity” in some of the older literature; but that is a misnomer, 
since, as will be shown, the spacetime geometry is not singular there. 

To determine whether the spacetime geometry is singular at the gravitational 
radius, send an explorer in from far away to chart it. For simplicity, let him fall 
freely and radially into the gravitational radius, carrying his orthonormal tetrad with 
him as he falls. His trajectory through spacetime [“parabolic orbit”; radial geodesic 
of metric (31.1)] is 


t 

2M 


T 

2M 


2 l r \ 3/2 

— [ -2—) + constant, 

3 \2M} 


2 / r y»/2 ^ r \i/2 (r/2M) l/2 + 1 

T\2M/ “ 2 \2M) + (r/2M) in - 1 


-h constant. 


(31.2) 


[See §25.5 and especially equation (25.38) for derivation and discussion.] One obtains 
the r coordinate of the explorer in terms of the proper time measured on a clock 
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he carries, r(T), by inverting the first equation; one finds his r coordinate in terms 
of coordinate time, r\t ), by inverting the second equation. 

Of all the features of the traveler’s trajectory, one stands out most clearly and 
disturbingly: to reach the gravitational radius, r — 2M, requires a finite lapse of 
proper time, but an infinite lapse of coordinate time: 


r/2M — 1 — (t + constant)/2Af when near r — 2M\ 
r/2M = 1 + constant X exp (~t/2M) in limit as t —oo. 


(31.3) 


An infalling observer reaches 
r — 2 M in finite proper time 
but infinite coordinate time 


(see Fig. 25.5.) Of course, proper time is the relevant quantity for the explorer’s 
heart-beat and health. No coordinate system has the power to prevent him from 
reaching r = 2 M. Only the coordinate-independent geometry of spacetime could 
possibly do that; and equation (31.3) shows it does not! 

Let the explorer approach and reach r = 2M, then. What spacetime geometry does 
he measure there? Is it singular or nonsingular? Restated in terms of measurements, 
do infinite tidal gravitational forces tear the traveler apart and crusliliim as he 
approaches r = 2M, or does he feef only finite tidal forces which in principle his 
body can witM^nd? 

The tidal forces felt by the explorer as he passes a given radius r are measured 
by the components of the Riemann curvature tensor with respect to his orthonormal 
frame there (equation of geodesic deviation). To calculate those curvature compo¬ 
nents at r, proceed in two steps. (1) Calculate the components, not in the traveler’s 
frame, but rather in the “static” orthonormal frame 



dt, w r — 


_ dr _ 

(1 - 2M/r) 1/2 ’ 


w° = r </0, cv* = rsinO d§ 


(31.4a) 


located at the event through which he is passing; the result [obtainable from equa¬ 
tions (14.50) and (14.51) by setting e 2(p = e~ 2A — 1 — 2 M/r\ is 


II 

< K 

-2 M 

r 3 ’ 

D __ D__ _ M 

^tete “ ^(0f0 — r s > 

^0^0 — 

2 M 

r 3 • 

Rrdrd — ^r0r0 — ^3 


(31.4b) 


all other R^y% vanish except those obtainable 
from the above by symmetries of Riemann . 


(2) Calculate the components in the explorer’s frame by applying to the “static- 
frame” components (31.4b) the appropriate transformation—forr > .2 M, a Lorentz 
boost in the e- direction with ordinary velocity v T \ for r < 2M , not a “boost,” but 
a transformation given by the standard boost formula (Box 2.4) with v T > 1. Here 


(grr) 1/2 dr dr/dt /2AA 172 

(~gtt) 1/2 dt 1 - 2M/r V r / 


(3L5) 


The amazing result (a consequence of special algebraic properties of the Schwarzs- 
child geometry, and somewhat analogous to what happens—or, rather, does not hap- 
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pen—to the components of the electromagnetic field, E and B y when they are both 
parallel to a boost) is this: all the components of Riemann are left completely 
unaffected by the boost. If e- is the traveler’s radial basis vector, and e r — u is 
his time basis vector, then 

Rrbri, = - 2 A i/r\ R- T o-rO = *** = M/r\ (3 j £) 

Rem = 2M / r3 ’ Ryebe = R bi& = ~ 


The infalting observer does 
not feel infinite tidal forces at 
r = 2 M 


Thus, the spacetime 
geometry is well behaved at 
r = 2 M, but the coordinate 
system is pathological 


(See exercise 31.1.) 

The payoff of this calculation: according to equations (31.6), none of the compo¬ 
nents of Riemann in the explorer’s orthonormal frame become infinite at the 
gravitational radius. The tidal forces the traveler feels as he approaches r = 2 M are 
finite; they do not tear him apart—at least not when the mass M is sufficiently great, 
because at r = 2M the typical non-zero component of the curvature tensor 

is of the order 1 /A/ 2 . The gravitational radius is a perfectly well-behaved, nonsingular 
region of spacetime, and nothing there can prevent the explorer from falling on 
inward. 

By contrast, deep inside the gravitational radius, at r = 0, the traveler must 
encounter infinite tidal forces, independently of the route he uses to reach there. 
One says that “r = 0 is a physical singularity of spacetime.” To see this, one need 
only calculate from equation (31.4b) or (31.6) the “curvature invariant”: 


I=R^y h R«^ 1 = 48M 2 /r 6 . 


(31.7) 


( 

Box 31.1 THE "SCHWARZSCHILD SINGULARITY": HISTORICAL REMARKS 


"\ 


Although Eddington (1924) was the first to con¬ 
struct a coordinate system that is nonsingular at 
r — 2 M, he seems not to have recognized the sig¬ 
nificance of his result. Lemaitre (1933c, especially 
p. 82) appears to have been the first to recognize 
that the so-called “Schwarzschild singularity” at 
r = 2M is not a singularity. He wrote, “La singu¬ 
larity du champ de Schwarzschild est done une 
singularite Active, analogue k celle qui se presen- 
tait a l’horizon du centre dans la forme originale 
de l’univers de de Sitter”. He also provided a 
coordinate system to go through r — 2M. How¬ 
ever, his coordinate system, like Eddington’s, cov¬ 
ered only half of the Schwarzschild geometry: 

_i_ 


regions I and II of Figure 31.3. Synge (1950) was 
the first to discover the incompleteness in the Ed¬ 
dington and Lemaitre coordinate systems, and to 
provide coordinates that cover the entire geometry 
(regions I, II, III, IV of Figure 31.3). Fronsdal 
(1959), unaware of Synge’s work, rediscovered the 
global structure of the Schwarzschild geometry by 
means of embedding diagrams and calculations. 
The coordinate system that provides maximum 
insight into the Schwarzschild geometry is the one 
generally known as the Kruskal-Szekeres coordi¬ 
nate system. It was constructed independently by 
Kruskal (1960) and by Szekeres (1960). 
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In every local Lorentz frame this will be a sum of products of curvature components, 
and it will have the same value 48Af 2 /r 6 . Thus, in every local Lorentz frame, 
including the traveler’s, Riemann will have one or more infinite components as 
r —>- 0; i.e., tidal forces will become infinite. 


Exercise 31.1. TIDAL FORCES ON INFALLING EXPLORER 

(a) Carry out the details of the derivation of the Riemann tensor components (31.6). 

(b) Calculate, roughly, the critical mass A/ Crit such that, if M > A/ Crit the explorer’s body 
(a human body made of normal flesh and bones) can withstand the tidal forces at r — 2Af, 
but if M < M crii his body is mutilated by them. [Answer: M crit — 1000A/ 0 . Evidently, if 
M — M q the physicist should transform himself into an ant before taking the plunge! For 
details see §32.6.] 


§31.3. BEHAVIOR OF SCHWARZSCHILD 
COORDINATES AT r = 2M 

Since the spacetime geometry is well behaved at the gravitational radius, the singular 
behavior there of the Schwarzschild metric components, g tt = — (1 — 2M/r) and 
g n = (1 — 2M/r)~ l , must be due to a pathology there of the Schwarzschild coordi¬ 
nates t, r, 0 , (f>. Somehow one must find a way to get rid of that pathology—i.e., one 
must construct a new coordinate system from which the pathology is absent. Before 
doing this, it is helpful to understand better the precise nature of the pathology. 

The most obvious pathology at r = 2M is the reversal there of the roles of t and 
r as timelike and spacelike coordinates. In the region r > 2M, the t direction, d/dt, 
is timelike (g tt < 0) and the r direction, d/dr , is spacelike ( g^ > 0); but in the region 
r < 2M, d/dt is spacelike ( g tt > 0) and d/dr is timelike ( g^ < 0). 

What does it mean for r to “change in character from a spacelike coordinate to 
a timelike one”? The explorer in his jet-powered spaceship prior to arrival at r — 2M 
always has the option to turn on his jets and change his motion from decreasing 
r (infall) to increasing r (escape). Quite the contrary is the situation when he has 
once allowed himself to fall inside r = 2 M. Then the further decrease of r represents 
the passage of time. No command that the traveler can give to his jet engine will 
turn back time. That unseen power of the world which drags everyone forward 
willy-nilly from age twenty to forty and from forty to eighty also .drags the rocket 
in from time coordinate r — 2M to the later value of the time coordinate r — 0. 
No human act of will, no engine, no rocket, no force (see exercise 31.3) can make 
time stand still. As surely as cells die, as surely as the traveler’s watch ticks away 
“the unforgiving minutes,” with equal certainty, and with never one halt along the 
way, r drops from 2M to 0. 

At r — 2 A/, where r and t exchange roles as space and time coordinates, g tt vanishes 
while g^ is infinite. The vanishing of g u suggests that the surface r — 2M , which 


At r — 0 the curvature is 
infinite 


EXERCISE 


Nature of the coordinate 
pathology at r = 2 M: 


‘,(1) t and r reverse roles as 
j timelike and spacelike 
coordinates 


A 

v; 

S' 
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(2) the region r — 2 M, 

— oo < t < 4- sc is 
two-dimensional rather 
than three 


appears to be three-dimensional in the Schw aris child coordinate system (— oo < t < 
+ oo, 0 < d < 77, 0 < ^ < 2 k) has zero volume and thus is actually only two-di¬ 
mensional, or else is null; thus. 

f \gug9»gtt\ vt dt de d <> - °; 

T ~ 2M (31.8) 

f \geeg^\ in de d4> = 4 t(2M) 2 . 

(t~ 2 M,t — const) 

The divergence of at r = 2M does not mean that r — 2M is infinitely far from 
all other regions of spacetime. On the contrary, the proper distance from r = 2 M 
to a point with arbitrary r is 

[r(r - 2M)] 1/2 + 2M\n \(r/2M - 1) 1/2 + (r/2M) 1/2 \ 

when r > 2 M, 

(31.9) 

-2Mcor l [r l/2 /(2M - r) l/2 } - [r(2M - r)} 1/2 

when r < 2 M, 



which is finite for all 0 < r < cc. 

Just how the region r < 2M is physically connected to the region r > 2 M can 
be discovered by examining the radial geodesics of the Schwarzschild metric. Focus 
attention, for concreteness, on thelrajectory of a test particle that gets ejected from 
the singularity at r = 0 , flies radially outward through r = 2 M, reaches a maximum 
radius r max (“top of orbit”) at proper time r = 0 and coordinate time t = 0 , and 
then falls back down through r — 2M to r = 0. The solution of the geodesic equation 
for such an orbit was derived in §25.5 and described in Figure 25.3. It has the “cycloid 
form” (with the parameter 77 running from — m to + 77 ), 

r = j r max (l + cost,), (31.10a) 

t = (r maK 3 /SM) 1/2 (Tj + sin?,), (31.10b) 

..w, I C'-max/ 2M ~ 1Z2 + tan ^/ 2 ) 

' ( r max/ 2 ^ — 1) 1/2 — tan (tj/ 2) (31 1Qc) 


(3) radial geodesics reveal 
that the regions r = 2 M, 
t = ±oo are "finite" 
parts of spacetime 


Figure 31.1 plots this orbit in the r , f-coordinate plane (curve F-F'-F "), along with 
several other types of radial geodesics. 

Every radial geodesic except a “set of geodesics of measure zero” crosses the 
gravitational radius at t — -{-oo (or at / = — oo, or both), according to Figure 31.1 
and the calculations behind that figure (exercises for the student! See Chapter 25). 
One therefore suspects that all the physics at r — 2M is consigned to t — ±oo by 
reason of some unhappiness in the choice of the Schwarzschild coordinates. Aj3etter_ 
coordinate system, one begins to believe, will take these two “points at infinity” and 
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_L A\B\D\ F' 
M (at i = oo) 



Figure 31.1. 

Typical radial geodesics of the Schwarzschild geometry, as charted in Schwarzschild coordinates (sche¬ 
matic). FF'F " [see equations (31.10)] is the timelike geodesic of a test particle that starts at rest at r — 5.2 M 
and falls straight in, arriving in a finite proper time at the singularity r — 0 (zig-zag marking). The 
unhappiness of the Schwarzschild coordinate system shows in two ways: (1) in the fact that t goes to 
oo partway through the motion; and (2) in the fact that / thereafter decreases as r (not shown) continues 
to increase. The course of the same trajectory prior to t = 0 may be constructed by reflecting the diagram 
in the horizontal axis (“time inversion”)- The time-reversed image of F" marks the ejection of the test 
particle from the singularity. AA'A " is a timelike geodesic which comes in from r = + oo. BB'B " is the 
null geodesic travelled by a photon that falls straight in (no summit; never at rest!). DD'D" is a spacelike 
radial geodesic. So is CC', but E'E" is timelike. Neither of the latter two ever succeed in crossing r — 2M. 
(Unanswered questions about the_se geodesics will answer themselves in Figure 31.4, where the same 
world lines are charted in a “Kruskal-Szekeres diagram”). 

Described mathematically via equation (31.10), the geodesic F" inverse F' Averse FF'F" starts with ejection 
at 


(r / r \ it / t 3 . 1/2 

r = 0 at / = — 1} (-^ + 1), T = _ ; 

V 2 M ) V AM / 2 \ 2 M ) 

it flies outward with increasing proper time t, but decreasing coordinate time, /, until it reaches the 
gravitational radius 

r = 2 M at / = — oo, r ■ 

it then continues to fly on outward, but with coordinate time now increasing from t = — oo, until it 
reaches its maximum radius 


('■”“ 3 V /2 cos-‘( 4M -l 

\ r ( 


' 8^ ’ ''•max 

1 ^max l 

r } 

' max ' 


r = r max at / = 0, r = 0 (event F in diagram); 
it then falls inward, with t continuing to increase, until it crosses the gravitational radius again 


2 M \ 1/2 


(event F* in diagram); 

and it finally falls on in with decreasing t (but, of course, still increasing t) to 

r = 0 at / = +2lV 7^+ l\ T= 

\2 M 1 \ AM I 2 V 2 M ) 

(event F" in diagram). 
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spread them out into a line in a new (r new , / new )-plane; and will squeeze the “line” 
(r — 2 A/, t from — oo to + 00) into a__smgle point m the (> ne ^,j new )-plane- One is 
the more prepared to accept this tentative conclusion and act on it because one has 
already seen (equation 31.8) that the region covering the (i9, <» 2-sphere at r = 2 M, 
and extending from t — — oo to / = -boo, has zeroj)rpper_volume. What timelier 
indication could one want that the “line” r = 2A/, — oo < / < oo, is actually a point? 


Novikov coordinates: 
(1) how constructed 


(2) line element 


§31.4. SEVERAL WELL-BEHAVED COORDINATE SYSTEMS 


The well-behaved coordinate system that is easiest to visualize is one in which the 
radially moving test particles of equations (31.10) remain always at rest (“comoving 
coordinates”). Such coordinates were first used by Novikov (1963). Novikov attaches 
a specific value of his radial coordinate, R *, to each test particle as it emerges from 
the singularity of infinite tidal forces at r — 0, and insists that the particle carry that 
value of R* throughout its “cycloidal life”—up through r = 2 M to r — r max , then 
back down through r = 2M to r — 0. For definiteness, Novikov expresses the R* 
value for each particle in terms of the peak point on its trajectory by 

R* = (r max /2M - 1) 1/2 . (31.11) 

As a time coordinate, Novikov uses proper time r of the test particles, normalized 
so t = 0 at the peak of the orbit. Every particle in the swarm is ejected in such 
a manner that it arrives at the summit of its trajectory (r = r max , r = 0) at one and 
the same value of the Schwarzschild coordinate time; namely, at t — 0. 

Simple though' they may be conceptually, the Novikov coordinates are related 
to the original Schwarzschild coordinates by a very complicated transformation: (1) 
combine equations (31.10b) and (31.11) to obtain 17(7, i?*); (2) combine t/(t, R*) with 
(31.10a) and (31.11) to obtain r(r, R *); (3) combine 17(7, R *) with (31.10c) and (31.11) 
to obtain t(r, R*). The resulting coordinate transformation, when applied to the 
Schwarzschild metric (31.1), yields the line element 

ds 2 = -dr 2 + 1 ) dR* 2 + r 2 (d6 2 + sin 2 0 d<t> 2 ). (31.12a) 

(“ Schwarzschild geometry in Novikov coordinates”.) Here r is no longer a radial 
coordinate; it is now a metric function Hr, R*) given implicitly by 


2 M 


= ± (R* 2 + 



{r/2Mf 1 1/2 

R* 2 + 1 J 


+ (R* 2 + 


I) 3 / 2 COS - 



r/2M 
R* 2 + 1 



(31.12b) 


Figure 31.2 shows the locations of several key regions of Schwarzschild spacetime 
in this coordinate system. The existence of two distinct regions with r = 0 (singulari¬ 
ties) and two distinct regions with r —*- oo (asymptotically flat regions; recall that 
4mr 2 = surface area!) will be discussed in §31.5. 



r 



Figure 31.2. 

The Novikov coordinate system for Schwarzschild 
spacetime (schematic). The dashed curves are curves of 
constant r (recall: far 2 - — surface area about center of 
symmetry). The region shaded gray is not part of 
spacetime; it corresponds to r < 0, a region that cannot 
be reached because of the singularity of spacetime at 
r = 0. Notice that the “line” (r = 2A/, — oo < 

/<+ oo) of the Schwarzschild coordinate diagram 
(Figure 31.1) has been compressed into a point here, 
in accordance with the discussion at the end of §31.3. 



Although Novikov’s coordinate system isjvery _simplej:qnceptually, the mathe¬ 
matical expressions for the metric, components in it are rather unwielcfy. Simpler, 
more usable expressions have been obtained in a different coordinate system 
(“Kruskal-Szekeres coordinates”) by Kruskal (I960), and independently by Szekeres 

(1960). ----- - .- 

Kruskal and Szekeres use a dimensionless radial coordinate it and a dimensionless 



u = (r/2M — \y/2 e r/4M cos h ( 7 / 4 ^) 1 
V = (r/2M - \) 1/2 e r/ * M sinh (t/AM) j When T > 2M ’ 


(31.13a) 


u — (1 — r/2M) 1/2 e r/4M sinh (t/4M)) 
v = (1 - r/2M) 1/2 e r/4M cosh (t/4M) J When ^ < 1M ' 


(31.13b) 


Kruskal-Szekeres coordinates 


XMbtivation for mtroducing such coordinates is given in Box 31.2.) By making this 
change of coordinates in the Schwarzschild metric (31.1), one obtains the following 
line element: 


ds°- = (32 M 3 /r)e~ r/2M (-dv 2 + du 2 ) + r\d6 2 + sin 2 0 ^ 2 ) (31.14a) 

(“Schwarzschild geometry in Kruskal-Szekeres coordinates ”). Here r is to be regarded 
as a function of u and v defined implicitly by 

(r/2M - l)e T/2M = u 2 - v 2 (31.14b) 


[cf. equations (31.13)]. 


(continued on page 833) 
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\ 

Box 31.2 MOTIVATION FOR KRUSKAL-SZEKERES COORDINATES* 

A. EDDINGTON-FINKELSTEIN COORDINATES 

The motivation for the Kruskai-Szekeres system begins by introducing a different 
coordinate system, first devised by Eddington (1924) and rediscovered by Finkelstein 
(1958). Eddington and Finkelstein use as the foundation of their coordinate system, 
not freely falling particles as did Novikov, but freely falling photons. More par¬ 
ticularly, they introduce coordinates U and V\ which are labels for outgoing and 
ingoing, radial, null geodesics. The geodesics are given by 

ds 2 — 0 — —(1 — 2M/r) dt 2 + (1 — 2M/r)~ l dr 2 . 

Equivalently, outgoing geodesics are given by U = const, where 

U=t - r*\ (la) 

and ingoing geodesics are given by V = const, where 

V~t + r\ (lb) 

Here r* is the “tortoise coordinate” of §25.5 and Figure 25.4: 

r* = r + 2Mln \r/2M - 1|. . (lc) 


Ingoing Eddington-Finkelstein Coordinates—Adopt r and V as 
coordinates in place of r and t 


The Schwarzschild metric becomes. 

ds 2 = -(1 - 2 M/f) dV 2 + 2 dVdr + r 2 dQ 2 . 
The radial light cone, ds 2 = 0, has one leg 

dV/dr - 0, 


and the other leg 


dV _ 2 

dr 1 — 2 M/r ’ 

From this, and this alone, one can infer all features of the drawing. 


( 2 ) 

(3a) 


(3b) 


This box is based on Misner (1969a). 
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Box 31.2 ( continued) 

The radial light cone, ds 2 = 0, has one leg 

dD/dr = 0, 


and the other leg 


dU_ _ _ 2 

dr 1 - 2 M/r ' 

From this, and this alone, one can infer all features of the drawing. 



(5a) 


(5b) 



r = 2 M is world line 
of infalling photons 


Outgoing Eddington-Finkelstein coordinates (one rotational degree of free¬ 
dom is suppressed). (Surfaces of constant V, being outgoing null surfaces, 
are plotted on a 45-degree slant, just as they would be in flat spacetime.) 
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Notice that both Eddington-Finkelstein coordinate systems are better behaved at 
the gravitational radius than is the Schwarzschild coordinate system; hu t^hey are 
not fully well-behave d. The outgoing coordinates ( U,r,6 ,$) describe in a non- 
pathological manner the ejection of particles outward from r = 0 through r = 2M; 
but their description._oLinfall thro ugh r = 2M has the sa me pathology as the d e- 
scription„given,.h.y,.-Schwarzschild coordinates (Figure 31.1). Similarly, the ingoing 
coordinates (F, r, 6, <t>) describe well the'infalTori particle through r = 2M, but they 
give a pathological description of outgoing trajectories. Moreover, the contrast 
between the two diagrams seems paradoxical: in one the gravitational radius is made 
up of world lines of outgoing photons; in the other it is made up of world lines 
of ingoing photons! To resolve the paradox, one must seek another, better-behaved 
coordinate system. [But note: because the ingoing Eddington-Finklestein coordinates \ \ 
describe infall so well, they are used extensively in discussions of gravitational f' j 
collapse (Chapter 32) and black holes (Chapters 33 and 34).] V j 


B. TRANSITION FROM EDDINGTON-FINKELSTEIN 
TO KRUSKAL-SZEKERES 

Perhaps one would obtain a fully well-behaved coordinate system by dropping 
r from view and using U, V, as the two coordinates in the radial-time plane. The 
resulting coordinate system is related to Schwarzschild coordinates by [see equations 
(1)1 

V-U = 2 r *, (6a) 

V + U = 2t\ (6b) 

and the line element in terms of the new coordinates reads 

ds 2 = -(1 - 2 M/r) dUd?+ r 2 (d6 2 + sin 2 0 d<f> 2 ). (7) 

Contrary to one’s hopes, this coordinate system is pathological at r = 2M. 

Second thoughts about the construction reveal the trouble: the surfaces U = 
constant (outgoing null surfaces) used in constructing it are geometrically well- 
defined, as are the surfaces V - constant (ingoing null surfaces); but the way of 
labeling them is not. Any relabeling, u = F( U) and v = G( V), will leave the surfaces 
unchanged physically. What one needs is a relabeling that will get rid of the singular 
factor 1 — 2 M/r in the line element (7). A successful relabeling is suggested by the 
equation 


exp [(V — U)/4M] = exp (r*/2M) = (r/2M - 1) exp (r/2M), (8) 




Box 31.2 (continued) 


which follows from equations ( 6 a) and (1c). Experimenting with this relation quickly 
reveals that the relabeling 

u = - e ~ D/4M = —(r/2M - \)m e r/4M e' t/4M , (9a) 

v = e + ^ /4U ~ (r/2M— l) 1/2 e r/43 V /43/ , (9b) 

will remove the offending 1 — 2 M/r from the metric coefficients. In terms of these 
new coordinates, the line element reads 

ds 2 = — (3>2M 3 /r)e~ r/2M dv du + r 2 (d6 2 + sin 2 0 d(p 2 ). ( 10 a) 

Here r is still defined by 4 vr 2 = surface area, but it must be regarded as a function 
of v and u: 

(r/2M - \)e T/2M = -uv. (10b) 

One can readily verify that this equation determines r uniquely (recall: r > 0!) in 
terms of the product uv [details in Misner (1969a)]. 

The coordinates, % v , which label the ingoing and outgoing null surfaces, are null 
coordinates; i.e., 

d/du • d/du — = 0, d/dv * d/dv = = 0 

[see equation (10a)]. If one is not accustomed to working with null coordinates, it 
is helpful to replace u and v by spacelike and timelike coordinates, u and v (Kruskal- 
Szekeres coordinates!) defined by 

M =I(y _ u) = (r/2M — l) 1 / V /4if cosh (11a) 

y = 1 ( v + u) = ( r /2M - iy/2 e r/4M sinh (t/4M), (lib) . 

so that 

dv 2 — du 2 = dv du. ( 12 ) 

In terms of these coordinates, the line element has the Kruskal form (31.14), which 
is fully well-behaved at the gravitational radius. 

Although the Kruskal-Szekeres line element is well behaved at r = 2M, the 
transformation (11) from Schwarzschild to Kruskal-Szekeres is not; it becomes 
meaningless (u and v “imaginary”) when one moves from r > 2 M to r < 2M. Of 
course, this is a manifestation of the pathologies of Schwarzschild coordinates. By 
trial and error, one readily finds a new transformation, to replace ( 11 ) at r <C 2 Af, 
leading from Schwarzschild to Kruskal-Szekeres coordinates: 

u = (\ - r/2M) 1/2 e T/4M sinh (f/4M), (11 c) 

v = (\ - r/2M) 1/2 e T/4M cosh (t/4M). (lid) 
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§31.5. RELATIONSHIP BETWEEN KRUSKAL-SZEKERES 

COORDINATES AND SCHWARZSCHILD COORDINATES 


In the Kiuskal-Szekeres coordinate system, the singularity r — 0 is located at 
v 2 — u 2 = 1. Thus there are actually two singularities, not one; both 

v = + (1 + u 2 ) 1/2 and v = —(1 + w 2 ) 172 correspond to r = 0! (31.15) 

This is not the only surprise that lies hidden in the Kruskal-Szekeres line element 
(31.14). Notice also that r > 2Af (the region of spacetime far outside the gravitational 
radius) is given by u 2 > v 2 . Thus there are actually two exterior regions*; both 


u > + \v\ and u < —|u| correspond to r > 2Af! (31.16) 

How can this be? When the geometry is charted in Schwarzschild coordinates, 
it contains one singularity and one exterior region; but when expressed in Kruskal- 
Szekeres coordinates, it shows two of each. The answer must be that the Schwarzs- 
chi ld coor dinates cover only part of the spacetime manifold] they must be only a 
local coordinate patch on the full manifold. Somehow, by means of the coordinate 
transformation that leads to Kruskal-Szekeres coordinates, one has analytically 
extended the limited Schwarzschild solution for the metric to cover all (or more 
nearly all) of the manifold. 

To understand this covering more clearly, transform back from Kruskal-Szekeres 
coordinates to Schwarzschild coordinates (see Figure 31.3). The transformation 
equations, as written down in (31.13) were valid only for the quadrants u > \v\ 
[equation (31.13a)] and v > |w| [equation (31.13b)] of Kruskal coordinates. Denote 
these quadrants by the numerals I and II; and denote the other quadrants by III 
and IV (see Figure 31.3). In the other quadrants, one can also transform the Kruskal- 
Szekeres line element (31.14) into the Schwarzschild line element (31.1); but slightly 
different transformation equations are needed. One easily verifies that the following 
sets of transformations work: 


(I) 


u ~ (r/2M — \) l/2 e T/4M cosh (t/4M) 
V = (r/2M — iy/2 e r/4M giHh (//4 jV/) ’ 


f u — (1 — r/2M) 1/2 e T/4M sinh ( t/AM ) 
[u = (1 — r/2M) 1/2 e T/4M cosh(r/4A/)’ 


(III) 


v 


~(r/2M — l) 1/2 e r/4Jf cosh (f/4A/) 
— ( r/2M — 1 ) 1 /2 e r / 4M gjjjj My ’ 


(IV) 


u 
K V 


— (1 — r/2M) 1/2 e T/4M $inh(t/4M) 

- (1 - r/2M) 1/2 e T/4M cosh (t/4M) * 


(31.17a) 

(31.17b) 

(31.17c) 

(31.17d) 


*The global structure of the Schwarzschild geometry, including the existence of two singularities and 
two exterior regions, was first discovered by Synge (1950). See Box 31.1. 


Kruskal-Szekeres coordinates 
reveal that Schwarzschild 
spacetime has two “r — 0 
singularities” and two 
'V—oo exterior regions” 


Transformation between 
Schwarzschild coordinates 
and Kruskal-Szekeres 
coordinates 
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coordinates. Two Schwarzschild coordinate patches I, II, and III, IV (illustrated in the upper and lower 
portions of Figure 31.5,a) are required to cover the complete Schwarzschild geometry, whereas a single 
Kruskal-Szekeres coordinate system suffices. The Schwarzschild geometry consists of four regions I, II, 
III, IV. Regions I and III represent two distinct, but identical, asymptotically flat universes in which 
r > 2 M\ while regions II and IV are two identical, but time-reversed, regions in which physical singulari¬ 
ties (r = 0) evolve. The transformation laws that relate the Schwarzschild and Kruskal-Szekeres coordi¬ 
nate systems to each other are given by equations (31.17) and (31.18). In the Kruskal-Szekeres u,u-plane, 
curves of constant r are hyperbolae with asymptotes u — dzv, while curves of constant I are straight 
lines through the origin. 


Two Schwarzschild 
coordinate patches are 
required to cover all of 
spacetime 


The inverse transformations are 

(r/2M - l)e r/2M = u 2 - v* in I, II, III, IV; 

4 M tanh _1 (i;/w) in I and III, 

4A/ tanh -1 (w/i;) in II and IV. 

These coordinate transformations are exhibited graphically in Figure 31.3. Notice 
that two Schwarzschild coordinate patches, I, II, and III, IV, are required to cover 
the entire Schwarzschild geometry; but a single Kruskal coordinate system suffices. 
Schwarzschild patch I, II, is divided into two regions—region I, which is outside 
the gravitational radius (r > 2 M), and region II, which is inside the gravitational 
radius (r < 2M). Similarly, Schwarzschild patch III, IV, consists of an exterior 
region (III) and an interior region (IV). 



(31.18a) 

(31.18b) 
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Schwarzschild Kruskal-Szekeres 

Figure 31.4. 

(a) Typical radial timelike (A, E, F), lightlike (5), and spacelike (C, D ) geodesics of the Schwarzschild 
geometry, as seen in the Schwarzschild coordinate system (schematic only). This is a reproduction of 
Figure 31.1. 

(b) The same geodesics, as seen in the Kruskal-Szekeres coordinate system, and as extended either to 
infinite length or to the singularity of infinite curvature at r — 0 (schematic only). 


Equations (31.18) reveal that the regions of constant r (constant surface area) are 
hyperbolae with asymptotes u = ±v in the Kruskal-Szekeres diagram, and that 
regions of constant t are straight lines through the origin. 

Several radial geodesics of the complete Schwarzschild geometry are depicted in Properties of the 

the Kruskal-Szekeres coordinate system in Figure 31.4. Notice how much more\ Kruskal-Szekeres coordinate 

- system 

reasonable the geodesic curves look in Kruskal-Szekeres coordinates than inf' 

Schwarzschild coordinates. Notice also that radial lightlike geodesics (paths of radial ) 
light rays) are 45-degree lines in the Kruskal-Szekeres coordinate system . This can 
be seen from the Kruskal-Szekeres line element (31.14), for which du = ±dv guar¬ 
antees ds — 0. Because of this 45-degree property, the radial light cone in a ^ 

Kruskal-Szekeres diagram has the sam e form as i n the s pace-time diagram of special 
relativity. Any radial curve that points “generally upward” (i.e., makes an angle of 
less than 45 degrees with the vertical, v , axis) is timelike; and curves that point 
“generally outward” are spacelike. This property enables a Kruskal-Szekeres diagram 
to exhibit easily the causality relation between one event in spacetime and another 
(see exercises 31.2 to 31.4). 
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Exercise 31.2. 


NONRADIAL LIGHT CONES 


Show that the world line of a photon traveling nonradially makes an angle less than 45 
degrees with the vertical u-axis of a Kruskal-Szekeres coordinate diagram. From this, infer 
that particles with finite rest mass, traveling nonradially or radially, must always move 
“generally upward” (angle less than 45 degrees with vertical u-axis). 


Exercise 31.3. THE CRACK OF DOOM 

Use a Kruskal diagram to show the following. 

(a) If a man allows himself to fall through the gravitational radius r = 2A/, there is no 
way whatsoever for him to avoid hitting (and being killed in) the singularity at r — 0. 

(b) Once a man has fallen inward through r = 2A/, there is no way whatsoever that he 
can send messages out to his friends at r > 2 M, but he can still receive messages from them 
(e.g., by radio waves, or laser beam, or infalling “CARE packages”). 


Exercise 31.4. HOW LONG TO LIVE? 

Show that once a man falling inward reaches the gravitational radius, no matter what he 
does subsequently (no matter in what directions, how long, and how hard he blasts his rocket 
engines), he will be pulled into the singularity and killed in a proper time of 

T < T max = = 1.54 x 10“ 5 (M/A/ o ) seconds. (31.19) 


[Hint: The trajectory of longest proper time lapse must be a geodesic. Use the mathematical 
tools of Chapter 25 to show that the geodesic of longest proper time lapse between r — 2 M 
and r — 0 is the radial geodesic (31.10a), with r raax = 2 A/, for which the time lapse is t:M] 

Exercise 31.5. EDDINGTON-FINKELSTEIN AND KRUSKAL-SZEKERES COMPARED 

Use coordinate diagrams to compare the ingoing and outgoing Eddington-Finkelstein coor¬ 
dinates of Box 31.2 with the Kruskal-Szekeres coordinates. Pattern the comparison after that 
between Schwarzschild and Kruskal-Szekeres in Figures 31.3 and 31.4. 

Exercise 31.6. ANOTHER COORDINATE SYSTEM 

Construct a coordinate diagram for the U, V, 0, <p coordinate system of Box 31.2 [equations 
(6) and (7)]. Show such features as (1) the relationship to Schwarzschild and to Kruskal- 
Szekeres coordinates; (2) the location of r = 2M\ and (3) radial geodesics. 


§31.6, DYNAMICS OF THE SCHWARZSCHILD GEOMETRY 

What does the Schwarzschild geometry look like? This question is most readily 
answered by means of embedding diagrams analogous to those for an equilibrium 
star (§23.8; Figure 23.1; and end of Box 23.2) and for Friedmann universes of 
positive and negative spatial curvature [equations (27.23) and (27.24) and Box 27.2]. 

Examine, first, the geometry of the spacelike hypersurface v — 0, which extends 
from u = + oo (r = oo) into u — 0 (r = 2 M) and then out to u = — oo (r = oo). 
In Schwarzschild coordinates this surface is a slice of constant time, t = 0 [see 
equation (31.18b)]; it is precisely the surface for which an embedding diagram was 
calculated in equation (23.34b). The embedded surface, with one degree of rotational 
freedom suppressed, is described by the paraboloid of revolution 


7 = 2M 4- z 2 /SM 


(31.20) 





Figure 31.5. 

(a) The Schwarzschild space geometry at the “moment of time” t = v = 0, with one degree of rotational 
freedom suppressed (6 = rr/2). To restore that rotational freedom and obtain the full Schwarzschild 
3-geometry, one mentally replaces the circles of constant? = (jc 2 + J 2 ) l/2 with spherical surfaces of area 
A'nr 2 . Note that the resultant 3-geometry becomes flat (Euclidean) far from the throat of the bridge in 
both directions (both “universes”). 

(b) An embedding of the Schwarzschild space geometry at “time” t = v s= 0, which is geometrically 
identical to the embedding (a), but which is topologically different. Einstein’s field equations fix the 
local geometry of spacetime, but they do not flx its topology; see the discussion at end of Box 27.2. 
Here the Schwarzschild “wormhole” connects two distant regions of a single, asymptotically flat universe. 
For a discussion of issues of causality associated with this choice of topology, see Fuller and Wheeler 
(1962). 


in the flat Euclidean space with metric 

do 2 = dr 2 + dz 2 + 7 2 d<j> 2 . (31.21) 


(See Figure 31.5.) 

Notice from the embedding diagram of Figure 31.5,a, that the Schwarzschild 
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The 3-surface v — t = 0 is a 
"wormhole" connecting two 
asymptotically flat universes, 
or two different regions of 
one universe 


Schwarzschild geometry is 
dynamic in regions r < 2 M 


Time evolution of the 
wormhole: creation; 
expansion; recontraction; and 
pinch-off 


Communication through the 
wormhole is impossible: it 
pinches off too fast 


geometry on the spacelike hypersurface t — const consists of a bridge or “wormhole” 
connecting two distinct, but identical, asymptotically flat universes. This bridge is 
sometimes called the “Einstein-Rosen bridge” and sometimes the “Schwarzschild 
throat” or the “Schwarzschild wormhole.” If one so wishes, one can change the 
topology of the Schwarzschild geometry by connecting the two asymptotically flat 
universes together in a region distant from the Schwarzschild throat [Fuller and 
Wheeler (1962); Fig. 31.5b]. The single, unique universe then becomes multiply 
connected, with the Schwarzschild throat providing one spacelike path from point 
d to point and the nearly flat universe providing another. For concreteness, focus 
attention on the interpretation of the Schwarzschild geometry, not in terms of 
Wheeler’s multiply connected single universe, but rather in terms of the Einstein- 
Rosen double universe of Figure 31.5,a. 

One is usually accustomed to think of the Schwarzschild geometry as static. 
However, the static “time translations,” t —»- t + At, which leave the Schwarzschild 
geometry unchanged, are time translations in the strict sense of the words only in 
regions I and III of the Schwarzschild geometry. In regions II and IV, 
t —t + At is a spacelike motion, not a timelike motion (see Fig. 31.3). Conse¬ 
quently, a spacelike hypersurface, such as the surface t — const of Figure 31.5,a, 
which extends from region I through u = v = 0 into region III, is not static. As 
this spacelike hypersurface is pushed forward in time (in the +v direction of the 
Kruskal diagram), it enters region II, and its geometry begins to change. 

In order to examine the time-development of the Schwarzschild geometry, one 
needs a sequence of embedding diagrams, each corresponding to the geometry of 
a spacelike hypersurface to the future of the preceding one. But how are the hyper¬ 
surfaces to be chosen? In Newtonian theory or special relativity, one chooses hyper¬ 
surfaces of constant time. But in dynamic regions of curved spacetime, no naturally 
preferred time coordinate exists. This situation forces one to make a totally arbitrary 
choice of hypersurfaces to use in visualizing the time-development of geometry, and 
to keep in mind how very arbitrary that choice was. 

Figure 31.6 uses two very different choices of hypersurfaces to depict the time- 
development of the Schwarzschild geometry. (Still other choices are shown in Figure 
21.4.) Notice that the precise geometry of the evolving bridge depends on the 
arbitrary choice of spacelike hypersurfaces, but that the qualitative nature of the 
evolution is independent of the choice of hypersurfaces. Qualitatively speaking, the 
two asymptotically flat universes begin disconnected, with each one containing a 
singularity of infinite curvature (r = 0). As the two universes evolve in time, their 
singularities join each other and form a nonsingular bridge. The bridge enlarges, 
until it reaches a maximum radius at the throat of r = 2M (maximum circumference 
of 4nM; maximum surface area of 167tA/ 2 ). It then contracts and pinches off, leaving 
the two universes disconnected and containing singularities (r = 0) once again. The 
formation, expansion, and collapse of the bridge occur so rapidly, that no particle 
or light ray can pass across the bridge from the faraway region of the one universe 
to the faraway region of the other without getting caught and crushed in the throat 
as it pinches off. (To verify this, examine the Kruskal-Szekeres diagram of Figure 
31.3, where radial light rays move along 45-degree lines.) 
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V 





Spacelike slices 


History 

A-B-C-D-E-F-G 


History 

A-W-X-D-Y-Z-G 


Figure 31.6. 

Dynamical evolution of the Einstein-Rosen bridge of the vacuum Schwarzschild geometry (schematic). 
Shown here are two sequences of embedding diagrams corresponding to two different ways of viewing 
the evolution of the bridge—History A-B-C-D-E-F-G, and History A-W-X-D-Y-Z-G. The embedding 
diagrams are skeletonized in that each diagram must be rotated about the appropriate vertical axis in 
order to become two-dimensional surfaces analogous to Figure 31.5,a. [Notice that the hypersurfaces 
of which embedding diagrams are given intersect the singularity only tangentially. Hypersurfaces that 
intersect the singularity at a finite angle in the w,i/-plane are not shown because they cannot be embedded 
in a Euclidean space. Instead, a Minkowski space (indefinite metric) must be used, at least near r = 0. 
For an example of an embedding in Minkowski space, see the discussion of a universe with constant 
negative spatial curvature in equations (27.23) and (27.24) and Box 27.2C.] Figure 21.4 exhibits embed¬ 
ding diagrams for other spacelike slices in the Schwarzschild geometry. 


From the Kruskal-Szekeres diagram and the 45-degree nature of its radial light 
rays, one sees that any particle that ever finds itself in region IV of spacetime must 
have been “created” in the earlier singularity; and any particle that ever falls into 
region II is doomed to be crushed in the later singularity. Only particles that stay 
forever in one of the asymptotically flat universes I or III, outside the gravitational 
radius (r > 2 A/), are forever safe from the singularities. 

Some investigators, disturbed by the singularities at r — 0 or by the “double-uni¬ 
verse” nature of the Schwarzschild geometry, have proposed modifications of its 
topology. One proposal is that the earlier and later singularities be identified with 
each other, so that a particle which falls into the singularity of region II, instead 
of being destroyed, will suddenly reemerge, being ejected, from the singularity of 
region IV. One cannot overstate the objections to this viewpoint: the region r — 0 
is a physical singularity of infinite tidal gravitation forces and infinite Riemann 
curvature. Any particle that falls into that singularity must be destroyed by those 


Creation and destruction in 
the singularities 


Nonviable proposals for 
modifying the topology of 
Schwarzschild spacetime 
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forces. Any attempt to extrapolate its fate through the singularity using Einstein's 
field equations must fail; the equations lose their predictive power in the face of 
infinite curvature. Consequently, to postulate that the particle reemerges from the 
earlier singularity is to make up an ad hoc mathematical rule, one unrelated to 
physics. It is conceivable, but few believe it true, that any object of finite mass will 
modify the geometry of the singularity as it approaches r — 0 to such an extent 
that it can pass through and reemerge. However, whether such a speculation is correct 
must be answered not by ad hoc rules, but by concrete, difficult computations within 
the framework of general relativity theory (see Chapter 34). 

A second proposal for modifying the topology of the Schwarzschild geometry is 
this: one should avoid the existence of two different asymptotically flat universes 
by identifying each point (u, w. 0, <>) with its opposite point ( —u, — h, 0, <t>) the 
Kruskal-Szekeres coordinate system. Two objections to this proposal are: (1) it pro¬ 
duces a sort of “conical” singularity (absence of local Lorentz frames) at ( v, u) = 
(0,0), i.e., at the neck of the bridge at its moment of maximum expansion; and 
(2) it leads to causality violations in which a man can meet himself going backward 
in time. 

One good way for the reader to become conversant with the basic features of 
the Schwarzschild geometry is to reread §§31.1-31.4 carefully, reinterpreting every¬ 
thing said there in terms of the Kruskal-Szekeres diagram. 


Exercise 31.7. SCHWARZSCHILD METRIC 

IN ISOTROPIC COORDINATES 

(a) Show that, rewritten in the isotropic coordinates of Exercise 23.1, the Schwarzschild metric 
reads 

ds 2 = - ( ] ~ df- + (l + [dT 2 + V + sin 2 # d* 2 )]; (31.22) 

and derive the transformation 


r — r(l 4- M/27) 2 


(31.23) 


between the two radial coordinates. 

(b) Which regions of spacetime (I, II, III, IV; see Figure 31.3) are covered by the isotropic 
coordinate patch, and which are not? 

(c) Calculate and construct an embedding diagraniTbr the _ spacelike^Tiypersurfatc i —T), 
0 < 7 < oo. 

(d) Find a coordinate transformation that interchanges the region near 7 = 0 with the 
region near 7 — oo, while leaving the metric coefficients in their original form. 

Exercise 31.8. REISSNER-NORDSTR0M GEOMETRY 

(a) Solve the Einstein field equations for a spherically symmetric, static gravitational field 


ds 2 = _^20<r) ^2 _j_ e 2A(r) rf r 2 _j_ r ^dQ 2 + sin 2 # d$ 2 ). 
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with no matter present, but with a radial electric field B- 0, E = /(r)e f in the static 
orthonormal frame 

w l = e* dt, ut r = e A dr, w § = r d6, at* — r sin 8 d$. 

Use as a source in the Einstein field equations the stress-energy of the electric field. [Answer: 

E = ( Q/r 2 )e p, (31.24a) 

* 2 = - (! - yy- + ys) dt * + ( 1 - + yr) * dr2 + r2( - d02 + sin20 d( i> 2 )- ( 3 1 -24b) 

This is called the “Reissner (1916)-Nordstr0m (1918) metric”.] 

(b) Show that the constant Q is the total charge as measured by a distant observer (r > 2M 
and r > Q ), who uses a Gaussian flux integral, or who studies the coulomb-force-dominated 
orbits of test charges with charge-to-mass ratio e/n > M/Q. What is the charge-to-mass ratio, 
in dimensionless units, for an electron? Show that the constant Mis the total mass as measured 
by a distant observer using the Keplerian orbits of electrically neutral particles. 

(c) Show that for Q > M, the Reissner-Nordstrom coordinate system is well-behaved from 
r = oo down to r = 0, where there is a physical singularity and infinite tidal forces. 

(d) Explore the nature of the spacetime geometry for Q < M, using all the techniques 
of this chapter (coordinate transformations, Kruskal-like coordinates, studies of particle 
orbits, embedding diagrams, etc.). 

[Solution: see Graves and Brill (1960); also Fig. 34.4 of this book.] 

(e) Similarly explore the spacetime geometry for Q = M. [Solution: see Carter (1966b).] 

(f) For the case of a large ratio of charge to mass [Q > M as in part (c)], show that the 
region near r = 0 is unphysical. More precisely, show that any spherically symmetric distri¬ 
bution of charged stressed matter that gives rise to the fields (31.24) outside its boundary 
must modify these fields for r < r 0 = Q 2 /2M . [Hint: Study the quantity m(r) defined in 
equations (23.18) and (32.22h), noting its values deduced from equation (31.24), on the one 
hand, and from the appropriate Einstein equation within the matter distribution, on the other 
hand. See Figure 26 of Misner (1969a) for a similar argument.] 
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GRAVITATIONAL COLLAPSE 


Now, here, you see, it takes ail the running you can do, to keep in the same 
place. If you want to get somewhere else, you must run twice as fast as that. 

The Red Queen, in Through the Looking Glass, 
LEWIS CARROLL (1871) 




§32.1. RELEVANCE OF SCHWARZSCHILD GEOMETRY 

The story that unfolded in the preceeding chapter was fantastic! One began with 
the innocuous looking Schwarzschild line element 

ds 2 = - (l - m) dt 2 + - + r 2 (d8 2 + sin 2 # d<j> 2 ), (32.1) 

which was derived originally as the external field of a static star. One asked what 
happens if the star is absent; i.e., one probed the nature of the Schwarzschild 
geometry when no star is present to generate it. One might have expected the 
geometry to be that of a point mass sitting at r = 0. But it was not. It turned out 
to represent a “wormhole” connecting two asymptotically flat universes. Moreover, 
the wormhole was dynamic. It was created by the “joining together” of two ‘V = 0” 
singularities, one in each universe; it expanded to a maximum circumference of 4ttM; 
it then recontracted and pinched off, leaving the two universes disconnected once 
again, each with its own ‘V = 0” singularity. 

The roles and relevance of As a solution to Einstein’s field equations, this expanding and recontracting 

the Schwarzschild geometry wormhole must be taken seriously. It is an exact solution; and it is one of the simplest 

of all exact solutions. But there is no reason whatsoever to believe that such worm- 
holes exist in the real universe! They can exist only if the expanding universe, 
—10 x 10 9 years ago, was “bom” with the necessary initial conditions—with ‘V = 0” 
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Schwarzschild singularities ready and waiting to blossom forth into wormholes. There 
is no reason at all to believe in such pathological initial conditions! 

Why, then, was so much time and effort spent in Chapter 31 on understanding 
the Schwarzschild geometry? (1) Because it illustrates clearly the highly non- 
Euclidean character of spacetime geometry when gravity becomes strong; 
(2) because it illustrates many of the techniques one can use to analyze strong 
gravitational fields; and most importantly (3) because, when appropriately truncated, 
it is the spacetime geometry of a black hole and of a collapsing star—as well as 
of a wormhole. 

This chapter explores the role of the Schwarzschild geometry in gravitational 
collapse; the next chapter explores its role in black-hole physics. 


§32.2. BIRKHOFF'S THEOREM 


That the Schwarzschild geometry is relevant to gravitational collapse follows from 
Birkhoff’s (1923) theorem: Let the geometry of a given region of spacetime (1 ) be 
spherically symmetric , and (2) be a solution to the Einstein field equations in vacuum. 
Then that geometry is necessarily a piece of the Schwarzschild geometry . The external 
field of any electrically neutral, spherical star satisfies the conditions of Birkhoff’s 
theorem, whether the star is static, vibrating, or collapsing. Therefore the external 
field must be a piece of the Schwarzschild geometry. 

Birkhoff’s theorem is easily understood on physical grounds. Consider an equilib¬ 
rium configuration that is unstable against gravitational collapse and that, like all 
equilibrium configurations (see §23.6), has the Schwarzschild geometry as its external 
gravitational field. Perturb this equilibrium configuration in a spherically symmetric 
way, so that it begins to collapse radially. The perturbation and subsequent collapse 
cannot affect the external gravitational field so long as exact spherical symmetry 
is maintained. Just as Maxwell’s laws prohibit monopole electromagnetic waves, so 
Einstein’s laws prohibit monopole gravitational waves. There is no possible way for 
any gravitational influence of the radial collapse to propagate outward. 

Not only is Birkhoff’s theorem easy to understand, but it is also fairly easy to 
prove. Consider a spherical region of spacetime. Spherical symmetry alone is suffi¬ 
cient to guarantee that conditions (i), (ii), and (iii) of Box 23.3 are satisfied, and 
thus to guarantee that one can introduce Schwarzschild coordinates 


The uniqueness of the 
Schwarzschild geometry: 
Birkhoff’s theorem 


The physics underlying 
Birkhoff's theorem 


Proof of Birkhoff's theorem 


ds 2 = -e 2 * dt°- + e 2a dr 2 + r°-(d6 2 + sin 2 6 d<j> 2 ), 

<P = <P(t , r), and A = A(t, r). (32.2) 

[See Box 23.3 for proof; and notice that: (1) for generality one must allow g tt = - e 20 
an d Stt = e2A 1° be positive or negative (no constraint on sign!); (2) at events where 
the gradient of the “circumference function” r is zero or null, Schwarzschild coor¬ 
dinates cannot be introduced. The special case (Vr) 2 = 0 is treated in exercise 32.1.] 
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Impose Einstein’s vacuum field equation on the metric (32.2), using the orthonormal 
components of the Einstein tensor as derived in exercise 14.16: 


G ii = 

r~ 2 (l 

- e~ 2A ) + 2(A, r 

/r)e~ 2A = 0, 

(32.3a) 

G» = 

G ri ~ 

2(A, l /r)e~ (A + ,t ' ) 

= 0 , 

(32.3b) 


2{<PJr)e- 2A + r-\e~ 

2A - 1) = 0, 

(32.3c) 

II 

^00 “ 

+($,„ + <P, r 2 

- <P, r A, r + <PJr - A, r /r)e~ 2A 




- (A« + A .t 2 

- A t <P t )e~ 2,p = 0. 

(32.3d) 


Equation (32.3b) guarantees that A is a function of r only, and equation (32.3a) 
then guarantees that A has the same form as for the Schwarzschild metric: 

A = -yln|l - 2M/r\. (32.4a) 

Equations (32.3c,d) then become two equivalent equations for <P(t, r) —equivalent 
by virtue of the Bianchi identity, ▼ • G = 0—whose solution is 

<P = iln |1 - 2M/r\ + f{t). (32.4b) 

Here /is an arbitrary function. Put expressions (32.4) into the line element (32.2); 
thereby obtain 

ds 2 = -e 2fW ( 1 - dt 2 + - d C, -— + r\d6 2 + sin 2 0 d<f> 2 ). 

\ r / 1 — IM/r 

Then redefine the time coordinate 

{ — F dt 

‘new J Uh 

and thereby bring the line element into the Schwarzschild form 

ds 2 = - (l - 1M-) dt 2 + - dr l --— + r 2 (d8 2 + sin 2 ^ d<f> 2 ). 

Conclusion: When the spacetime surrounding any object has spherical symmetry 
and is free of charge, mass, and all fields other than gravity, then one can introduce 
coordinates in which the metric is that of Schwarzschild. Conclusion restated in 
coordinate-free language: the geometry of any spherically symmetric vacuum region 
of spacetime is a piece of the Schwarzschild geometry (Birkhoff’s theorem), q.e.d. 


Exercise 32.1. UNIQUENESS OF REISSNER-NORDSTR0M 
GEOMETRY [Track 2] 

Prove the following generalization of Birkhoff’s theorem. Let the geometry of a given region 
of spacetime (1) be spherically symmetric, and (2) be a solution to the Einstein field equations 
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with an electromagnetic field as source. Then that geometry is necessarily a piece of the 
Reissner-Nordstrom geometry [equation (3124b)] with electric and magnetic fields, as meas¬ 
ured in the standard static orthonormal frames 


E = ( Q e /r 2 )e h B = (Q m /r 2 )e h Q = ( Q e 2 + QJ) " 2 . 

[Hints: (1) First consider regions of spacetime in which (Vr) 2 ^ 0, using the same methods 
as the text uses for BirkhofFs theorem. The result is the Reissner-Nordstrom solution. 
(2) Any region of dimensionality less than four, in which (Vr) 2 = 0 (e.g., the Schwarzschild 
radius), can be treated as the join between four-dimensional regions with (Vr) 2 ^ 0. More¬ 
over, the geometry of such a region is determined uniquely by the geometry of the adjoining 
four-dimensional regions (“junction conditions”; §21.13). Since the adjoining regions are 
necessarily Reissner-Nordstrom (step 1), then so are such “sandwiched” regions. (3) Next 
consider four-dimensional regions in which Vr = dr is null and nonzero. Show that in such 
regions there exist coordinate systems with 

ds 2 = —24' drdt + r 2 (d8 2 + sin 2 0^ 2 ), 

where 'F = 'Ffr, /). Show further that the Ricci tensor for this line element has an orthonor- 
malized component 


Ree = 1/r 2 , 

whereas the stress-energy tensor for a spherically symmetric electromagnetic field has 

8 vT re = 8 ir[r r) - = Q 2 /r\ Q - const. 

These quantities, R§§ and 8must be equal (Einstein’s field equation) but cannot be 
because of their different r-dependence. Thus, an electromagnetic field cannot generate 
regions with dr ^ 0, dr* dr = 0. (4) Finally, consider four-dimensional regions in which 
dr — 0. Denote the constant value of r by a, and show that any event can be chosen as 
the origin of a locally well-behaved coordinate system with 

ds 2 = tf 2 ( — dr 2 -f- e 2X dz 2 + dd 2 + sin 2 0d£ 2 ), 

X = A(r, z), A(t = 0, z) = 0, A(r = 0, z) = 0. 

[Novikov-type coordinate system; see §31.4.] Show that, in the associated orthonormal frame, 
spherical symmetry demands 

E = ( QJa 2 )e h B = {QJa 2 )e h Q = (Q 2 + Qj)" 2 , 

and that the Einstein field equations then require Q = a and e x = cos r, so that 
ds 2 = Q 2 ( — dr 2 + cos 2 ? dz 2 + dd 2 + sin 2 d d<f> 2 ). 

(5) This solution of the field equations [sometimes called the “Bertotti (1959)-Robinson (1959a) 
Electromagnetic Universe,” and explored in this coordinate system by Lindquist (I960)] is 
actually the throat of the Reissner-Nordstrom solution for the special case Q = M. Verify 
this claim by performing the following coordinate transformation on the Reissner-Nordstrom 
throat region [equation (31.24b) with Q = M and \r — Q\ < Q]: 


r — Q — Qe z cos r. 


t = Qe z tan?. 
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(6) Thus, each possible case leads either to no solution at all, or to a segment of the 
Reissner-Nordstrom geometry, q.e.d.1 Note: The missing case. (Vr) 2 = 0, in the text’s proof 
of Birkhoff's theorem, is resolved by noting that, for 0 = 0, steps (3) and (4) above lead 
to no solutions at all. We thank G. F. R. Ellis for pointing out the omission of the case 
(V>) 2 = 0 in the preliminary version of this book. 


§32.3. EXTERIOR GEOMETRY OF A COLLAPSING STAR 


Gravitational collapse 
analyzed by examining the 
star's exterior, Schwarzschild 
geometry 


The gravitational radius as a 
point of no return, and the 
"crushing" at r = 0 


Consider a star that is momentarily static, but will subsequently begin to collapse. 
Its space geometry at the initial moment of Schwarzschild coordinate time, t = 0, 
has two parts: in the exterior, vacuum region (r > R > 2M), it is the Schwarzschild 
geometry (BirkhofTs theorem!); but in the star’s interior, it is some other, totally 
different geometry. Whatever the interior geometry may be, it has an embedding 
diagram at time t — 0 which is qualitatively like that of Figure 23.1. (For discussion 
and proof of this, see §23.8.) Notice that the star’s space geometry is obtained by 
discarding the lower universe of the full Schwarzschild geometry (Figure 31.5,a), and 
replacing it with a smooth “bowl” on which the matter of the star is contained. 

To follow the subsequent collapse of this star in the Schwarzschild coordinate 
system, or in the Kruskal-Szekeres coordinate system, or in an ingoing Eddington- 
Finkelstein coordinate system, one can similarly discard that part of the coordinate 
diagram which lies inside the star’s surface, and keep only the exterior Schwarzschild 
region. (See Figure 32.1.) In place of the discarded interior Schwarzschild region, 
one must introduce some other coordinate system, line element, and diagram that 
correctly describe the interior of the collapsing star. 

From truncated coordinate diagrams (such as Figures 32.1,a,b,c), one can readily 
discover and understand the various peculiar features of collapse through the gravi¬ 
tational radius. 

(1) No matter how stiff may be the matter of which a (spherical) star is made, 
once its surface has collapsed within the gravitational radius, the star will continue 
to collapse until its surface gets crushed in the singularity at r = 0. This one discovers 
by recalling that the star’s surface cannot move faster than the speed of light, so 
its world line must always make an angle of less than 45 degrees with the u-axis 
of the Kruskal-Szekeres diagram. 

(2) No signal (e.g., photon) emitted from the star’s surface after it collapses inside 
the gravitational radius can ever escape to an external observer. Rather, all signals 
emitted from inside the gravitational radius get caught and destroyed by the collapse 
of the surrounding geometry into the singularity at r = 0 as space “pinches off” 
around the star. 

(3) Consequently, an external observer can never see the star after it passes the 
gravitational radius; and he can never see the singularity that terminates its col¬ 
lapse—unless he chooses to fall through the gravitational radius himself and pay 
the price of death for the knowledge gained. 
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Does this mean that the collapsing star instantaneously and completely disappears 
from external view as it reaches the gravitational radius? No, not according to the 
analysis depicted in Figure 31.1,c: Place an astrophysicist on the surface of a 
collapsing star, and have him send a series of uniformly spaced signals to a distant 
astronomer, at rest at r > 2 M, to inform him of the progress of the collapse. These 
signals propagate along null lines in the spacetime diagram of Figure 31.1c. The 
signals originate on the world line of the stellar surface, and they are received by 
the distant astronomer when they intersect his world line, r — constant > M. As the 
star collapses closer and closer to its gravitational radius, R = 2M, the signals, which 
are sent at equally spaced intervals according to the astrophysicist’s clock, are 
received by the astronomer at more and more widely spaced intervals. The astrono¬ 
mer does not receive a signal emitted just before the gravitational radius is reached 
until after an infinite amount of time has elapsed; and he never receives signals 
emitted after the gravitational radius has been passed. Those signals, like the astro¬ 
physicist who sends them, after brief runs get caught and destroyed by the collapsing 
geometry in the singularity, at r = 0. It is not only the star that collapses. The 
geometry around the star collapses. 

Hence, to the distant astronomer, the collapsing star appears to slow down as it 
approaches its gravitational radius: light from the star becomes more and more 
red-shifted. Clocks on the star appear to run more and more slowly. It takes an 
infinite time for the star to reach its gravitational radius; and, as seen by the distant 
astronomer, the star never gets beyond there. 

The optical appearance of a collapsing star was first analyzed mathematically, 
giving main attention to radially propagating photons, by J. R. Oppenheimer and 
H. Snyder (1939). More recently a number of workers have reexamined the problem 
[see, e.g., Podurets (1964), Ames and Thome (1968) and Jaffe (1969)]. The most 
important quantitative results of these studies are as follows. In the late stages of 
collapse, when the distant astronomer sees the star to be very near its gravitational 
radius, he observes its total luminosity to decay exponentially in time 


The redshift of signals 
emitted just before passage 
through the gravitational 
radius 


Optical appearance of the 
collapsing star 


L oc exp 


( 


——)• 
3\/3 2 Ml 


(32.5) 


Simultaneously, photons that travel to him along radial trajectories arrive with 
exponentially increasing redshifts 

z = A\/X oc e t/4M . (32.6) 

However, the light from the star is dominated in these late stages, not by photons 
flying along radial trajectories from near the gravitational radius, but by photons 
that were deposited by the star in unstable circular orbits as its surface passed 
through r — 3M (see §25.6 and Box 25.7). As time passes, these photons gradually 
leak out the diffuse spherical shell of trapped photons at r = 3 A/and fly off to the 
distant observer, who measures them to have redshift z ~ 2. Consequently, in the 
late stages of collapse the star’s spectral lines are broadened enormously, but they 
are brightest at redshift z zz 2. 

(continued on page 850) 
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(a) Schwarzschild 


(b) Kruskal-Szekeres 


Figure 32.1. 

The free-fall collapse of a star of initial radius R { = 10 M as depicted alternatively in (a) Schwarzschild 
coordinates, (b) Kruskal-Szekeres coordinates, and (c) ingoing Eddington-Finkelstein coordinates (see 
Box 31.2). The region of spacetime inside the collapsing star is grey, that outside it is white. Only 
the geometry of the exterior region is that of Schwarzschild. The curve separating the grey and white 
regions is the geodesic world line of the surface of the collapsing star (equations [31.10] or [32.10] with 
r max = = 10 M). This world line is parameterized by proper time, t, as measured by an observer 

who sits on the surface of the star; the radial light cones, as calculated from ds 2 — 0, are attached to it. 

Notice that, although the shapes of the light cones are not all the same relative to Schwarzschild 
coordinates or relative to Eddington-Finkelstein coordinates, they are all the same relative to Kruskal- 
Szekeres coordinates. This is because light rays travel along 45-degree lines in the u,u-plane (dv = ±du ), 
but they travel along curved paths in the r,/-plane and r, K-plane. 

The Kruskal-Szekeres spacetime diagram shown here is related to the Schwarzschild diagram by 
equations (31.13) plus a translation of Schwarzschild time: t —*-/ + 42.8 M. The Eddington-Finklestein 
diagram is related to the Schwarzschild diagram by 

V=t + r* = t + r + 2M\n \r/2 M — 1[ 

(see Box 31.2). 

It is evident from these diagrams that the free-fall collapse is characterized by a constantly diminishing 
radius, which drops from R = 10 M to R = 0 in a finite and short comoving proper time interval, 
At = 35.1 M. The point R = 0 and the entire region r = 0 outside the star make up a physical “singular¬ 
ity” at which infinite tidal gravitational forces—according to classical, unquantized general relativity—can 
and do crush matter to infinite density (see end of §31.2; also §32.6). 




(c) Eddington-Finkelstein 

The Eddington-Finkelstein diagram depicts a series of photons emitted radially from the surface of 
the collapsing star, and received by an observer at r = /? initial — 10 M. The observer eventually-receives 
all photons emitted radially from outside the gravitational radius; all photons emitted after the star passes 
through its gravitational radius eventually get pulled into the singularity at r = 0; and any photon emitted 
radially at the gravitational radius stays at the gravitational radius forever. 

Non-free-fall collapse is similar to the collapse depicted here. When pressure gradients are present, 
only the detailed shape of the world line of the star’s surface changes. 
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Notice how short is the characteristic e-folding time for the decay of luminosity 
and for the radial redshift: 


T cha r = 2M- 1 X 10 ~ 5 (M/M q ) sec 

( light-travel time across a flat-space 
distance equal to the gravitational radius 

Here A/ 0 denotes one solar mass. 


(32.7) 


EXERCISE 



Exercise 32.2. REDSHIFTS DURING COLLAPSE 

(a) Let a radio transmitter on the surface of a collapsing spherical star emit monochromatic 
waves of wavelength \ e : and let a distant observer, at the same 6 , <£, as the transmitter, receive 
the waves. Show that at late times the wavelength received varies as 

\ /\ cc e t/4M (32.8a) 

v rec' v em v 7 

[equation (32.6)], where t is proper time as measured by the distant observer. 

(b) [Track 2] Use kinetic theory for the outgoing photons (conservation of density in phase 
space: Liouville’s theorem; §22.6) to show that the energy flux of the radiation received 
(ergs/cm 2 sec) varies as 


F oc e~ t/M . 


(32.8b) 


- \ 

The rest of this chapter is 
Track 2. No, previous Track-2 
material is needed as 
preparation for it, but it is 
needed as preparation for 

(1) the Track-2 part of 
Chapter 33 (black holes), and 

(2) Chapter 34 (singularities 
and global methods). 

_ J 


(c) Suppose that nuclear reactions at the center of the collapsing star generate neutrinos 
of energy £ e , and that these neutrinos flow freely outward (negligible absorption in star). 
Show that the energy of the neutrinos received by a distant observer decreases at late times 
as 

E n JE e oc (32.9a) 

(d) Show that the flux of neutrino energy dies out at late times as 

F oc e ~ t/2M . (32.9b) 

(e) Explain in elementary terms why the decay laws (32.8a) and (32.9a) for energy are 
the same, but the decay laws (32.8b) and (32.9b) for energy flux are different. 

(f) Let a collapsing star emit photons from its surface at the black-body rate 


dN /, r irm photons \ w /surface area\ ^ /temperature^ 3 
— - yl.5 x 10 cm 2 sec £3/ x \ Q f star / x \of surface / 


Let a distant observer count the photons as they pass through his sphere of radius r M. 
Let him begin his count (time t = 0) when he sees (via photons traveling radially outward) 
the center of the star’s surface pass through the radius r = 3M. Show that, in order of 
magnitude, the time he and his associates must wait, until the last photon that will ever 
get out has reached them, is 

t = (A//A/ g )[8 X lO" 4 + 5 X 10“ 5 log 10 (r n A//A/ o )] seconds, (32.9c) 

where T n is the star’s surface temperature in units of 10 11 K. 
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§32.4. COLLAPSE OF A STAR WITH UNIFORM DENSITY 
AND ZERO PRESSURE 


When one turns attention to the interior of a collapsing star and to the precise world 
line that its surface follows in the Schwarzschild geometry, one encounters rather 
complicated mathematics. The simplest case to treat is that of a “star” with uniform 
density and zero pressure; and, indeed, until recently that was the only case which 
had been treated in detail. The original—and very complete—analysis of the collapse 
of such a uniform-density “ball of dust” was given in the classic paper of Oppen- 
heimer and Snyder (1939). More recently, other workers have discussed it from 
slightly different points of view and using different coordinate systems. The ap¬ 
proach taken here was devised by Beckedorff and Misner (1962). 

Because no pressure gradients are present to deflect their motion, the particles 
on the surface of any ball of dust must move along radial geodesics in the exterior 
Schwarzschild geometry. For a ball that begins at rest with finite radius, R — R ix 
at time t = 0, the subsequent geodesic motion of its surface is given by equations 

(31.10): 

R = (Ri/2)(\ + cos?)), (32.10a) 


t = 2M\n 


(RJ2M - 1 ) 1/2 + tan (t)/2) 


(RJ2M - 1) 1/2 - tan (tj/2) 

+ 2M(RJ2M — l) 1/2 h + (V4 AD(t] + sin rj)]. 


(32.10b) 


The collapse, from rest, of a 
uniform-density ball of 
"dust": 


(1) world line of ball's 
surface in exterior 
Schwarzschild 
coordinates 


Here R is the Schwarzschild radial coordinate (i.e., 4 t tR 2 is the star’s surface area) 
at Schwarzschild time t. This world line is plotted in Figure 32.1 for R x = 10 M, in 
terms of Schwarzschild coordinates, Kruskal-Szekeres coordinates, and Eddmgton- 
Finkelstein coordinates. The proper time read by a clock on the surface of the 
collapsing star is given by equation (31.10b): 

t = (fl?/8A0 1/2 (7) + sini)). (32.10c) 

Note that the collapse begins when the parameter rj is zero (R = R t , t = t — 0), 
and it terminates at the singularity (R = 0, ij = w) after a lapse of proper time, as 
measured on any test particle falling with the dust, equal to 


At = ir(R?/8M) 1/2 . 


It is interesting, though coincidental, that this is precisely the time-lapse required 
for free-fall collapse to infinite density in Newtonian theory [see equation (25.27'), 
Figure 25.3, and associated discussion]. 

What is the behavior of the interior of the ball of dust as it collapses? A variety 
of different interiors for pressureless dust can be conceived (exercise 32.8). But here 
attention focuses on the simplest of them: an interior that is homogeneous and 
isotropic everywhere, except at the surface—i.e., an interior locally identical to a 
dust-filled Friedmann cosmological model (Box 27.1). Is the Friedmann interior to 
be “open” (k = -1), “flat” (k = 0), or “closed” (k = +1)? Only the closed case 


(2) interior of ball is identical 
to a portion of a closed 
Friedmann universe 
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is appropriate, since one has already demanded [equation (32.10)] that the star be 
at rest initially (initial rate of change of density equals zero; "‘moment of maximum 
expansion”). 

Using comoving hyperspherical coordinates, x> f° r star’s interior, and 
putting the origin of coordinates at the star’s center, one can write the line element 
in the interior in the familiar Friedmann form 


the join between 
Friedmann interior and 
Schwarzschild exterior 


ds 2 = —dr 2 + a 2 (r)[dx 2 + s\n 2 x(d6 2 + sin 2 8 d$ 2 )]. 
Here a(r) is given by the familiar cycloidal relation, 

a = y a mO + COST)), 

T = + sinrj); 

and the density is given by 

p = (3a m /87T)a- 3 = (3/87ra m 2 )[T(i + cost))] 


(32.11) 


(32.12) 


(32.13) 


[see equations (1), (9), (4), and (5) of Box 27.1, with ?) replaced by tj + 7r], 

There is one possible difficulty with this interior solution. In the cosmological case, 
the solution was homogeneous and isotropic everywhere. Here homogeneity and 
isotropy are broken at the star’s surface—which lies at some radius 

X = Xo (32.14) 


for all r, as measured in terms of the hyperspherical polar angle x, a comoving 
coordinate (first picture in Box 27.2). At that surface (i.e., three-dimensional world 
tube enclosing the star’s fluid) the interior Friedmann geometry must match smoothly 
onto the exterior Schwarzschild geometry. If the match cannot be achieved, then 
the Friedmann line element (32.11) cannot represent the interior of a collapsing star. 
An example of a case in which the matching could not be achieved is an interior 
of uniform and nonzero pressure, as well as uniform density. In that case there would 
be an infinite pressure gradient at the star’s surface, which would blow off the outer 
layers of the star, and would send a rarefaction wave propagating inward toward 
its center. The uniform distribution of density and pressure would quickly be de¬ 
stroyed. 

For the case of zero pressure, the match is possible. As a partial verification of 
the match, one can examine the separate and independent predictions made by the 
interior and exterior solutions for the star’s circumference, C = as a function 
of proper time r at the star’s surface. The external Schwarzschild solution predicts 
the cycloidal relation, 

C = 2irR = 2*r(JV2)(l + cost]), 

* (32.15) 

T = + sin 7]) 


[equations (32.10)]. The interior Friedmann solution predicts a similar cycloidal 
relation: 
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C = 2? tR = 2rra sin x 0 = 2?r (y a m sin Xo)( 1 + cos tj), 

T = + sin V)- (32.16) 

The two predictions agree perfectly for all time if and only if 

R i = a m sin Xo- 

M = ya m sin 3 x 0 . 

A more complete verification of the match is given in exercise 32.4. 

For further insight into this idealized model of gravitational collapse, see Box 32.1. 


(32.17a) 

(32.17b) 


Exercise 32.3. EMBEDDING DIAGRAMS AND PHOTON PROPAGATION 
FOR COLLAPSING STAR 

Verify in detail the features of homogeneous collapse described in Box 32.1. 

Exercise 32.4. MATCH OF FRIEDMANN INTERIOR 
TO SCHWARZSCHILD EXTERIOR 

The Einstein field equations are satisfied on a star’s surface if and only if the intrinsic and 
extrinsic geometries of the surface’s three-dimensional world tube are the same, whether 
measured on its interior or on its exterior (see §21.13 for proof and discussion). Verify that 
for the collapsing star discussed above, the intrinsic and extrinsic geometries match at the 
join between the Friedmann interior and the Schwarzschild exterior. [Hints: (a) Use tj , 0 , <£, 
as coordinates on the world tube of the star’s surface, and show that the intrinsic geometry 
has the same line element 


ds 2 = a 2 (7i)l-drj 2 + sin 2 x 0 (^ 2 + sin 2 0 d$ 2 )], (32.18a) 

whether measured in the Schwarzschild exterior or in the Friedmann interior, (b) Show that 
the extrinsic curvature of the world tube has the same components 


K vv = K ve = = K h = 0, 

K ee = ^/sin 2 6> = -a(rj) sin Xo cos x 0 , 

whether measured in the Schwarzschild exterior or in the Friedmann interior.] 


(32.18b) 


Exercise 32.5. STARS THAT COLLAPSE FROM INFINITY 

(a) Patch together a truncated Schwarzschild geometry and the geometry of a truncated “flat” 
(j k — 0), dust-filled Friedmann universe to obtain a model of a star that collapses from rest 
at an infinite initial radius. [Hint: The world line of the star’s surface in the Schwarzschild 
geometry is given by equations (31.2).] 

(b) Similarly patch together a truncated Schwarzschild geometry and the geometry of a 
truncated “open” (k — — 1), dust-filled Friedmann universe to obtain a star which collapses 
from infinity with finite initial inward velocity. 


EXERCISES 


{continued on page 857) 
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Box 32.1 AN IDEALIZED COLLAPSING STAR 
WITH FRIEDMANN INTERIOR 
AND SCHWARZSCHILD EXTERIOR 


(See §32.4 and exercises 32.3 and 32.4 
for justification of the results 
described here.) 


Initial State 

(1) Take a F riedmann universe of radius a — a m at its moment of maximum expansion, 
r) = 0; and slice off and discard the region Xo < X < w > where x 0 is some angle less 
than it/ 2. (2) Take a Schwarzschild geometry of mass M = (aj 2) sin 3 x 0 at the 
moment t - 0; and slice off and discard the region r < R { = a m sin x 0 - (3) Glue 
the remaining pieces of Friedmann and Schwarzschild geometry together smoothly 
along their cut surfaces. The resultant object will be a momentarily static star of 
uniform density p ; = 3/(87 ra m 2 ), of mass M = (fl m /2) sin 3 x 0 . an< i °f radius R t = 
a m sin Xo- 


Subsequent Evolution 

Release this star from its intial state, and let it collapse in accord with Einstein’s 
field equations. The interior, truncated Friedmann universe and the exterior, truncated 
Schwarzschild geometry will evolve just as though they had never been cut up and 
patched together; and this evolution will preserve the smoothness of the match 
between interior and exterior! 


Details of the Collapse 

Probe the details of the collapse using sequences of embedding diagrams (histories 
ABCD and AWXY), and using photons that propagate radially outward (photons 
a, /?, y, S, «). The example shown here has Xo = 0-96 and R { /M = 2/sin 2 x 0 = 3. 


/ 


Interior 

(Friedmann) 



Exterior 



a 


History of Collapse as Probed 
by Hypersurfaces ABCD: 

(1) Initial configuration, A — A' — A'\ is that 
constructed by cutting and sewing at times 
rj = t = 0. 

(2) Each subsequent configuration has as its inte¬ 
rior a slice of constant Friedmann time rj. 




(3) The interior remains always a spherical cup of 
half-angle Xo> but it contracts from radius 
R = R. = a m sin x 0 t0 R = 0 as time increases. 

(4) The matter in the star is all crushed simultane¬ 
ously to infinite density when R reaches zero, 
and the external Schwarzschild “funnel” de¬ 
velops a cusp-like singularity at that point. 

(5) As time increases further, this cusp pulls the 
region r < 2M of the funnel into r — 0 so fast 
that the outward-traveling photon 8 is gobbled 
up and crushed. 



P 



These embedding diagrams must be rotated about the vertical 
axes in order to become 2-dimensional surfaces analogous to 
Figure 23.1. 





Box 32.1 {continued) 


A" 


History as Probed by Hypersurfaces AWXY 

(1) Initial configuration, A — A' — A", is again 
that constructed by cutting and sewing at 

7 ] — / — 0 . 

(2) Subsequent hypersurfaces are very different 
from 7] = const. 

(3) As time passes, a neck develops in the geom¬ 
etry just outside the surface of the star. 

(4) This neck becomes tighter and tighter and then 
pinches off, leaving the star completely iso¬ 
lated from the rest of the universe, and leaving 
a deadly cusp-like singularity in the exterior 
geometry where the star used to be. 

(5) The isolated star, in its own little closed uni¬ 
verse, continues to contract until it is crushed 
to infinite density, while the external geometry 
begins to develop another neck and the cusp 
quickly gobbles up photon 8. 





x 



The extreme difference between histories ABCD and AWXY dramatizes the 
“many-fingered time” of general relativity. The hypersurface on which one explores 
the geometry can be pushed ahead faster in time in one region, at the option of 
the party of explorers. Thus whether one region of the star collapses first, or another, 
or the entire star collapses simultaneously, is a function both of the spacetime 
geometry and of the choice of slicing. The party of explorers has this choice of 
slicing at their own control, and thus they themselves to this extent govern what 
kind of spacelike slices they will see as their exploration moves forward in time. 

The spacetime geometry that they slice, however, is in no way theirs to control or 
to change. To the extent that their masses are negligible and they serve merely as 
test objects, they have no influence whatsoever on the spacetime. It was fixed 
completely by the specification of the initial conditions for the collapse. In brief, 
spacetime is four-dimensional and slices are only three-dimensional (and in the — 
pictures here look only two-dimensional or one-dimensional). Any one set of slices 
captures only a one-sided view of the whole story. To see the entire picture one 
must either examine the dynamics of the geometry as it reveals itself in varied choices 
of the slicing or become accustomed to visualizing the spacetime geometry as a whole. 
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§32.5. SPHERICALLY SYMMETRIC COLLAPSE WITH 
INTERNAL PRESSURE FORCES 

So far as the external gravitational field is concerned, the only difference between Spherical collapse with 
a freely collapsing star and a collapsing, spherically symmetric star with internal P^ssure is qualitatively the 
pressure is this: that the surfaces of the two stars move along different world lines same as w,thout pressure 
in the exterior Schwarzschild geometry. Because the exterior geometry is the same 
in both cases, the qualitative aspects of free-fall collapse as described in the last section 
can be carried over directly to the case of nonnegligible internal pressure . 

An important and fascinating question to ask is this: can large internal pressures 
in any way prevent a collapsing star from being crushed to infinite density by infinite 
tidal gravitational forces? From the Kruskal-Szekeres diagram of Figure 32.1,b, it 
is evident that, once a star has passed inside its gravitational radius (R < 2M), no 
internal pressures, regardless of how large they may be, can prevent the star’s surface 
from being crushed in a singularity. The surface must move along a time-like world 
line, and all such world lines inside r = 2 M hit r = 0. Although there is no such 
theorem now available, one can reasonably conjecture that, if the surface of a 
spherical configuration is crushed in the r = 0 singularity, the entire interior must 
also be crushed. 

The details of the interior dynamics of a spherically symmetric collapsing star 
with pressure are not so well-understood as the exterior Schwarzschild dynamics. 

However, major advances in one’s understanding of the interior dynamics are now 
being made by means of numerical computations and analytic analyses [see Misner 
(1969a) for a review]. In these computations and analyses, no new features (at least, 
no unexpected ones) have been encountered beyond those that occurred in the simple 
uniform-density, free-fall collapse of the last section. 


Exercise 32.6. GENERAL SPHERICAL COLLAPSE: METRIC EXERCISES 

IN COMOVING COORDINATES 

Consider an inhomogeneous star with pressure, undergoing spherical collapse. Spherical 
symmetry alone is enough to guarantee the existence of a Schwarzschild coordinate system 
(t, r, 6, <p) throughout the interior and exterior of the star [see equation (32.2) and preceding 
discussion]. Label each spherical shell of the star by a parameter a , which tells how many 
baryons are contained interior to that shell. Then r(a, t) is the world line of the shell with 
label a. The expression for these world lines can be inverted to obtain a(t , r), the number 
of baryons interior to radius r at time t. Show that there exists a new time coordinate ?(/, r), 
such that the line element (32.2), rewritten in the coordinates (7, a, 0,0), has the form 

ds~ = -e 2 * dt 2 + + r 2 dS2 2 , (32.19a) 

$ — <7(7, a ), r = r(t, a ), r — F(7, a). (32.19b) 

These are “comoving, synchronous coordinates” for the stellar interior. 
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Exercise 32.7. ADIABATIC SPHERICAL COLLAPSE: EQUATIONS 
OF EVOLUTION [Misner and Sharp (1964)] 

Describe the interior of a collapsing star by the comoving, synchronous metric (32.19), by 
the number density of baryons h, by the total density of mass-energy p, and by the pressure 
p. The 4-velocity of the star’s fluid is 

u — e~* c/dt. (32.20) 

since the fluid is at rest in the coordinate system. Let a dot denote a proper time derivative 
as seen by the fluid—e.g., 

h = u[n] — e~*(dn/dt) a ; 

and let a prime denote a partial derivative with respect to baryon number,—e.g. 

n' =E (dn/da)j. 

Denote by U the rate of change of (1/2 it) X (circumference of shell), as measured by a 
man riding in a given shell: 

U = h (32.21a) 

and denote by m(t ‘ a) the “total mass-energy interior to shell a at time 7: 

m(t, a) = f 4 t7t 2 p(/, ci)r' da. (32.21b) 

J o 

(See Box 23.1 for discussion of this method of localizing mass-energy.) Assume that the 
collapse is adiabatic (no energy flow between adjacent shells; stress-energy tensor entirely 
that of a perfect fluid). 

(a) Show that the equations of collapse [baryon conservation, (22.3); local energy conser¬ 
vation, (22.11a); Euler equation, (22.13); and Einstein field equations (ex. 14.16)] can be 
reduced to the following eight equations for the eight functions <?, f, r, n , p, U, m\ 


r = U 

(dynamic equation for r); 

(32.22a) 

(nr 2 )’ _ U’ 
nr 2 r' 

(dynamic equation for n); 

(32.22b) 

p except at a shock front, where adiabaticity 

p _j_ p “ n ’ breaks down (dynamic equation for p); 

(32.22c) 

r 2 p' m + Amr^p 

P + P r' r 2 

(dynamic equation for U)\ 

(32.22d) 

p = P(n, p) 

(equation of state); 

(32.22e) 

$' — —p'/(p + /?), $ = 0 at star’s surface 

(source equation for <?); 

(32.22f) 

m ' = 4rrr 2 pr', m = 0 at a = 0, 

(source equation for m); 

(32.22g) 

r = sign (r')(l + U 2 - 2 m/r ) 1/2 

(algebraic equation for F). 

(32.22h) 


(b) The preceding equations are in a form useful for numerical calculations. [For particular 
numerical solutions and for the handling of shocks, see May and White (1966).] For analytic 
work it is often useful to replace (32.22b) by 


n = /Y(4tjtV), 


(32.22 b') 
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and (32.22d) by 


m — — 4vr 2 pU. 


(32.22d') 


Derive these equations. 

(c) Explain why equations (32.22g) and (32.22d / ) justify the remarks made in Box 23.1 
about localizability of energy. 

Exercise 32.8. ANALYTIC SOLUTIONS FOR PRESSURE-FREE COLLAPSE 
[Tolman (1934b); Datt (1938)] 

Show that the general solution to equations (3222) in the case of zero pressure can be 
generated as follows. 

(a) Specify the mass inside shell a , m{a)\ by equation (32.22d'), with p = 0, it will not 
change with time t. 

(b) Assume that all the dust particles have rest masses p that depend upon radius, p(a); 
so 

p = pn. (32.23a) 

(c) Calculate r from the equation 

r = m'/p\ (3223b) 


it will be independent of 7. 

(d) Specify an initial distribution of circumference Inr as function of a , and solve the 
dynamic equation 


/3r\ 2 2 m(a) 

\3// r 


r\a) - 1 


(32.23c) 


to obtain the subsequent evolution of r(t , a). Notice that this equation has identically the 
same form as in Newtonian theory! 

(e) Calculate the remaining quantities of interest from the algebraic equations 


ds 2 = -di 2 + (r' da/T) 2 + r 2 dS2 2 , (32.23d) 

p = pn = m'/(4irr 2 r')> (32.23e) 

£ = 0, U=dr/d7. (32.23f) 


[Note: In this solution, successive “shells” may pass through each other, producing a surface 
of infinite density as they do (r' —0 where m' ^ 0), since there is no pressure built up 
to stop shell crossing. When this happens, the coordinate system becomes pathological (a 
no longer increases monotonically outward), but spacetime remains well-behaved. The surface 
of infinite density (1) produces negligible tidal forces on neighboring dust particles; and (2) 
like the surface layers of §21.13, it is an idealization that gets smeared down to finite density 
by finite pressure.] 


Exercise 32.9. COLLAPSE WITH UNIFORM DENSITY 

Recover the Friedmann-Schwarzschild solution for collapse with uniform density and zero 
pressure by specifying appropriate forms for m(a) and r(a) in the prescription of exercise 
32.8. In the interior of the star, give the dust particles nonzero rest masses, p = constant ^ 0; 
in the exterior give them zero rest masses, p = 0 (“imaginary dust particles” in vacuum). 
Reduce the resulting metric (32.23d) to that of Friedmann inside the star, and to that of 
Novikov for the Schwarzschild geometry outside the star [equations (31.12)]. 



860 


32. GRAVITATIONAL COLLAPSE 


§32.6. THE FATE OF A MAN WHO FALLS INTO 
THE SINGULARITY AT r = 0 


The effect of tidal forces on 
the body of a man falling 
into the r = 0 singularity: 


Stage 1: body resists 
deformation; stresses build up 


Consider the plight of an experimental astrophysicist who stands on the surface of 
a freely falling star as it collapses to R — 0. 

As the collapse proceeds toward R = 0, the various parts of the astrophysicist’s 
body experience different gravitational forces. His feet, which are on the surface 
of the star, are attracted toward the star’s center by an infinitely mounting gravita¬ 
tional force; while his head, which is farther away, is accelerated downward by a 
somewhat smaller, though ever rising force. The difference between the two acceler¬ 
ations (tidal force) mounts higher and higher as the collapse proceeds, finally be¬ 
coming infinite as R reaches zero. The astrophysicist’s body, which cannot withstand 
such extreme forces, suffers unlimited stretching between head and foot as R drops 
to zero. 

But this is not all. Simultaneous with this head-to-foot stretching, the astrophysicist 
is pulled by the gravitational field into regions of spacetime with ever-decreasing 
circumferential area, 4 r nr l . In order to accomplish this, tidal gravitational forces must 
compress the astrophysicist on all sides as they stretch him from head to foot. The 
circumferential compression is actually more extreme than the longitudinal stretch¬ 
ing; so the astrophysicist, in the limit R —0, is crushed to zero volume and 
indefinitely extended length. 

The above discussion can be put on a mathematical footing as follows. 

There are three stages in the killing of the astrophysicist: (1) the early stage, when 
his body successfully resists the tidal forces; (2) the intermediate stage, when it is 
gradually succumbing; and (3) the final stage, when it has been completely over¬ 
whelmed. 

During the early stage, one can analyze the tidal forces by means of the equation 
of geodesic deviation, evaluated in the astrophysicist’s orthonormal frame w T , a/ p , 
uj4> ( see §31.2). In this frame, the nonvanishing components of the Riemann 
curvature tensor are given by equations (31.6): 


R rprp = -2M/r 3 , R- tM = R-rfri = M/r 3 , 

= 2M/r 3 , R Wi = R~m = -M/r 3 . 


(32.24a) 


The equation of geodesic deviation says that two freely moving particles, momentarily 
at rest in the astrophysicist’s local inertial frame, and separated by the 3-vector 

f = Ve h 

must accelerate apart with a relative acceleration given by 


D 2 V/dr 2 = -Rhkr^ = -Rfrir^ 
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Using the components (32.24a) of the curvature tensor, one sees that 

D 2 i h /dT 2 = + (2M/r 3 )^, 

DH § /dT 2 = -(M/r 3 )^, (32.24b) 

D 2 ^/dT 2 = -(M/r 3 )^. 

To apply these equations to the astrophysicist's body, idealize it (for simplicity) 
as a homogeneous rectangular box of mass jll ~ 165 pounds ~ 75 kg, of length 
l ~ 70 inches ^ 1.8 m in the e~ direction, and of width and depth w zz 10 inches 
~ 0.2 m in the e§ and e$ directions. Then calculate the stresses that must be set 
up in this idealized body to prevent its particles from moving along diverging (and 
converging) geodesics. 

From the form of equations (32.24), it is evident that the principal directions of 
the stress will be e~, e$, and e^ (i.e., in the e-, e§ 9 e^ basis, the stress tensor will 
be diagonal). The longitudinal component of the stress, at the astrophysicist’s center 
of mass, can be evaluated as follows. A volume element of his body with mass d\i, 
located at a height h above the center of mass (distance h measured along e- 
direction) would accelerate with a = (.2M/r 3 )h away from the center of mass, if it 
were allowed to move freely. To prevent this acceleration, the astrophysicist’s muscles 
must exert a force 

dF = a dii = (2 M/r 3 )h dy. 


This force contributes to the stress across the horizontal plane (e^ A e^ plane) 
through the center of mass. The total force across that plane is the sum of the forces 
on all mass elements above it (which is also equal to the sum of the forces on the 
mass elements below it): 




(region above plane) 




1 jtiAM 
~4 r 3 ' 


The stress is this force divided by the cross-sectional area w 2 , with a minus sign 
because it is a tension rather than a pressure: 


ipi5 d y nes 

4 w 2 r 3 ~ ’ ( r /l 1^3 cm 2 


(32.25 a) 


The components of the stress in the and e^ directions at the center of mass 
are, similarly. 


ni = 7«=+-[^s+0.7X 10« 


M/Mq dynes 
(r/1 km) 3 cm 2 


(32.25 b) 


(Recall that one atmosphere of pressure is 1.01 x 10 6 dynes/cm 2 .) 
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Stage 2: body gives way; 
man dies 


Stage 3: body gets crushed 
and distended 


Review of spherical collapse 


The human body cannot withstand a tension or pressure of >100 atmospheres 
~ 10 s dynes/cm 2 without breaking. Consequently, an astrophysicist on a freely 
collapsing star of one solar mass will be killed by tidal forces when the star’s radius 
is R ~ 200 km > 2 M tz, 3 km. 

By the time the star is much smaller than its gravitational radius, the baryons 
of the astrophysicist’s body are moving along geodesics; his muscles and bones have 
completely given way. In this final stage of collapse, the timelike geodesics are curves 
along which the Schwarzschild “time”-coordinate, t, is almost constant [cf the nar¬ 
rowing down of the light cones near r — 0 in Figure 32.1,a; also equation (31.2) in 
the limit r < 2 M]. The astrophysicist’s feet touch the star’s surface at one value of 
t —say t — t f —while his head moves along the curve t — t h > t f . Consequently, the 
length of the astrophysicist’s body increases according to the formula 

^astroph = [&,(*)] 1/2 fr ~ tf = [2M/R)^t h ~ tf 

« *” V2 « (^collapse - -)' 1/3 . (3 2 - 26a ) 

Here r = [-f R \gJ 1/2 dr + constant] is proper time as it would be measured by 
the astrophysicist if he were still alive, and r collapse is the time at which he hits r = 0. 
The gravitational field also constrains the baryons of the astrophysicist’s body to 
fall along world lines of constant 0 and <£ during the final stages of collapse. Conse¬ 
quently, his cross-sectional area decreases according to the law 

tfastroph = lgee(K)g^W] 1/2 MA<t>ccR 2 cc ( TcoUapse - t) 4 ' 3 . (32.26b) 

By combining equations (32.26a,b), one sees that the volume of the astrophysicist’s 
body decreases, during the last few moments of collapse, according to the law 

^'astroph = -tastroplAstroph 0C ^ “ ^collapse - T>. (32.26 c) 

This crushing of matter to infinite density by infinitely large tidal gravitational 
forces can occur not only on the surface of the collapsing star, but also at any other 
point along the r = 0 singularity outside the surface of the star. Hence, any foolish 
rocketeer who ventures below the radius r — 2M of the external gravitational field 
is doomed to destruction. 

For further discussion of spacetime singularities, and of the possibility that quan¬ 
tum gravitational effects might force a reconsideration of the singularities predicted 
by classical gravitation theory, see Chapter 30, §34.6, and Chapter 44. 


§32.7. REALISTIC GRAVITATIONAL COLLAPSE— 

AN OVERVIEW 

Instability, implosion, horizon, and singularity; these are the key stages in the 
spherical collapse of any star. Instability: The star, having exhausted its nuclear fuel, 
and having contracted slowly inward, begins to squeeze its pressure-sustaining 
electrons or photons onto its atomic nuclei; this softens the equation of state, which 
induces an instability [see, e.g., §§10.15 and 11.4 of Zel’dovich and Novikov (1971) 
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for details]. Implosion: Within a fraction of a second the instability develops into 
a full-scale implosion; for realistic density distributions, the stellar core falls rapidly 
inward on itself, and the outer envelopes trail along behind [see, e.g., the numerical 
calculations of Colgate and White (1966), Arnett (1966,1967), May and White (1966), 
and Ivanova, Imshennik, and Nadezhin (1969)]. Horizon: In the idealized spherical 
case, the star’s surface falls through its gravitational radius (“horizon”; end of 
communication with the exterior; point of no return). From the star’s vantage point 
this happens after a finite, short lapse of proper time. But from an external vantage 
point the star requires infinite time to reach the horizon, though it becomes black 
exponentially rapidly in the process [e-folding time — M— \0~ 5 (M/M Q ) sec]. The 
result is a “black hole”, whose boundary is the horizon (gravitational radius), and 
whose interior can never communicate with the exterior. Singularity: From the star’s 
interior vantage point, within a short proper time interval Jr — M — 10“ 5 (M/A/ o ) sec 
after passing through the horizon, a singularity is reached (zero radius, infinite 
density, infinite tidal gravitational forces). 

Does this basic picture—instability, implosion, horizon, singularity—have any 
relevance for real stars? Might complications such as rotation, nonsphericity, mag¬ 
netic fields, and neutrino fluxes alter the qualitative picture? No, not for small initial 
perturbations from sphericity. Perturbation theory analyses described in Box 32.2 
and exercise 32.10 show that realistic, almost-spherically symmetric collapse, like 
idealized collapse, is characterized by instability, implosion, horizon; and Penrose 
(1965b; see §34.6) proves that some type of singularity then follows. 

Highly nonspherical collapse is more poorly understood, of course. Nevertheless, 
a number of detailed calculations and precise theorems point with some confidence 
to two conclusions: (1) horizons (probably) form when and only when a mass M gets 
compacted into a region whose circumference in EVERY direction is 6 < 4tt M (Box 
32.3); (2) the external gravitational field of a horizon (black hole), after all the “dust” 
and gravitational waves have cleared away, is almost certainly the Kerr-Newman 
generalization of the Schwarzschild geometry (Chapter 33). If so, then the external 
field is determined uniquely by the mass, charge, and angular momentum that went 
“down the hole.” (This nearly proved theorem carries the colloquial title “A black 
hole has no hair.”) 

The interior of the horizon, and the endpoint (if any) of the collapse are very 
poorly understood today. The various possibilities will be reviewed in Chapter 34. 
That a singularity occurs one can state with much certainty, thanks to theorems of 
Penrose, Hawking, and Geroch. But whether all, only some, or none of the collapsing 
matter and fields ultimately encounter the singularity one does not know. 


Exercise 32.10. PRICE'S THEOREM FOR A SCALAR FIELD 

[See Price (1971, 1972a), also Thorne (1972), 
for more details than are presented here.] 

A collapsing spherical star, with an arbitrary nonspherical ‘‘scalar charge distribution,” 
generates an external scalar field 0. The vacuum field equation for 0 is D0 = 0. a a = 0. 
Ignore the back-reaction of the field’s stress-energy on the geometry of spacetime. 

(continued on page 868 ) 


Summary of 1 972 
knowledge about realistic, 
nonspherical collapse: 


(1) horizon 

(2) black hole 

(3) singularity 


EXERCISES 
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Box 32.2 COLLAPSE WITH SMALL NON-SPHERICAL PERTURBATIONS 

[based on detailed calculations by Richard H. Price (1971, 1972a,b)]. 


A. Density Perturbations 

1. When star begins to collapse, it possesses a 
small nonspherical “lump” in its density distri¬ 
bution. 

2. As collapse proceeds, lump grows larger and 
larger [instability of collapse against small per¬ 
turbations—a phenomenon well known in 
Newtonian theory; see, e.g., Hunter (1967); Lin, 
Mestel, and Shu (1965)]. 

3. The growing lump radiates gravitational waves. 

4. Waves of short wavelength (A < M), emitted 
from near horizon (r — 2 M < M\ partly prop¬ 
agate to infinity and partly get backscattered 
by the “background” Schwarzschild curvature 
of spacetime. Backscattered waves propagate 
into horizon (surface of black hole; gravita¬ 
tional radius) formed by collapsing star. 

5. Waves of long wavelength (A > M ), emitted 
from near horizon (r — 2 M < M\ get fully 
backscattered by spacetime curvature; they 
never reach out beyond r ~~ 3 M\ they end up 
propagating “down the hole.” 

6. Is lump on star still there as star plunges 
through horizon, and does star thereby create 
a deformed (lumpy) horizon? Yes, according to 
calculations. 

7. But external observers can only learn about 
existence of “final lump” by examining defor¬ 
mation (quadrupole moment) in final gravita¬ 
tional field. That final deformation in field does 
not and cannot propagate outward with infinite 
speed (no instantaneous “action at a distance”). 
It propagates with speed of light, in form of 
gravitational waves with near-infinite wave¬ 
length (infinite redshift from edge of horizon to 
any external radius). Deformation in final field, 
like any other wave of long wavelength, gets 
fully backscattered by curvature of spacetime 
at r < 3M; it cannot reach external observers. 
External observers can never learn of existence 



Collapse depicted in ingoing 
Eddington-Finkelstein coordinates 
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r~ 

of final lump. Final external field is perfectly 
spherical , lump-free, Schwarzschild geometry! 

8. Even in region of backscatter (2 M < r < 3AT), 
final external field is lump-free. Backscattered 
waves, carrying information about existence of 
final lump, interfere destructively with outgoing 
waves carrying same information. Result is de¬ 
struction of all deformation in external field 
and in horizon! 

9. Final black hole is a Schwarzschild black hole! 


B. Perturbations in Angular Momentum 

1. When star begins to collapse, it possesses a 
small, nonzero intrinsic angular momentum 
(“spin”) 5. 

2. As collapse proceeds, S is conserved (except for 
a tiny, negligible change due to angular mo¬ 
mentum carried off by waves; that change is 
proportional to square of amplitude of waves, 

i.e., to square of amplitude of perturbations in 
star, i.e., to S 2 ). 

3. Consequently, external field always and every¬ 
where carries imprint of angular momentum S 
(on imprints, see Chapter 19). There is no need 
for that imprint to propagate outward from 
near horizon. Moreover, it could not so propa¬ 
gate even if it tried, because of the conservation 
law for S (absence of dipole gravitational 
waves; see §§36.1 and 36.10). 

4. Hence, the final external field is that of an 
undeformed, slowly rotating black hole: 

ds 2 = -(l -ML\ d[ 2 + dr 2 +r 2 dB 2 
\ r / 1 — 2 M/r 

Schwarzschild geometry 



C. Perturbations in Electromagnetic Field 

1. Star possesses a magnetic field generated by 
currents in its interior, and an electric field due 
to an arbitrary internal charge distribution; and 
electromagnetic radiation is emitted by its hot 
matter. For simplicity, S is assumed zero. 

2. Evolution of external electromagnetic field is 
similar to evolution of perturbations in external 
gravitational field. Distant observer can never 
learn “final” values of changeable quantities 
(magnetic dipole moment, electric dipole mo¬ 
ment, quadrupole moments, . . .). Final values 
try to propagate out from horizon, carried by 
electromagnetic waves of near-infinite wave¬ 
length. But they cannot get out: spacetime cur¬ 
vature reflects them back down the hole; and 
they superpose destructively with their outgoing 
counterparts, to produce zero net field. 

3. By contrast with all other quantities, which are 
changeable, the electric monopole moment 
(total flux of electric field; equal to 477 times- 
total electric charge) is conserved. It never 
changes from before star collapses, through the. 
collapse stage, into the quiescent black-hole 
stage. 

4. Hence, the final external electromagnetic field 
is a spherically symmetric coulomb field 

E — ( Q/r 2 )e -] as measured by static 
B = 0 j observer (r, 9 , <f>, constant); 

and the final spacetime geometry is that of 
Reissner and Nordstrom (charged black hole; 
see exercises 31.8 and 32.1): 



/4Ssin0\ . .... 

— I- ^—\(r sin 8 d§) dt. 

rotational imprint, see exercise 
26.1; also Chapter 19. 

Here the polar axis has been oriented along S. 



+ 


_ drf_ _ 

(1 - 2 M/r + Q 2 /r 2 ) 


+ r 2 d£2 2 . 

J 
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Box 32.2 (continued) 

D. Generalization; Price's Theorem 

1. Let the star generate a “zero-rest-mass, integer- 
spin field.” [“Zero rest mass” refers to the 
quantized particles associated with the classical 
field. Classically it means the field has a Cou¬ 
lomb-law (1/r) fall off at large distances. The 
spin also is a property of the quantized parti¬ 
cles; classically it is most easily visualized as 
describing the symmetries of a monochromatic 
plane wave under rotations about the direction 
of propagation; see §35.6. A scalar field has 
spin zero; an electromagnetic field has spin one; 
Einstein’s gravitational field has spin two; .... 
Of such fields, only gravitational (s = 2) and 
electromagnetic (s = 1) are known to exist in 
the real universe. See, e.g., Dirac (1936), Gird¬ 
ing (1945), Bargmann and Wigner (1948), Pen¬ 
rose (1965a), for further discussion.] 

2. Let the spin-s field be sufficiently weak that its 
stress-energy perturbs the star’s external, 
Schwarzschild geometry only very slightly. 

3. Resolve the external field into spherical har¬ 
monics (scalar spherical harmonics for s = 0; 
vector spherical harmonics for s = 1; tensor 
spherical harmonics for 5 > 2); and label the 
spherical harmonics by the usual integer X (X = 
0 for monopole; 1 = 1 for dipole; X = 2 for 
quadrupole; etc.). 

4. All multipole fields with X < 5 are conserved 
during the collapse (theorem from classical ra¬ 
diation theory). A scalar field (s = 0) conserves 


nothing. The electromagnetic field (s = 1) con¬ 
serves only its monopole parts (electric Cou¬ 
lomb field, and vanishing magnetic Coulomb 
field). The gravitational field (s = 2) conserves 
its monopole part (with imprint equal to mass), 
and its dipole parts (with imprints measuring 
the angular momentum, and the standard grav¬ 
itational dipole moment—which vanishes if 
coordinate system is centered on star). 

5. For X > s, and only for X > s, radiation is pos¬ 
sible (scalar waves can have any multipolarity; 
electromagnetic waves must be dipole and 
higher; gravitational waves must be quadrupole 
and higher; see §36.1). 

6. Price’s theorem states that, as the nearly spheri¬ 
cal star collapses to form a black hole, all things 
that can be radiated (all multipoles X > s) get 
radiated completely away—in part “off to in¬ 
finity”; in part “down the hole” (“what is per¬ 
mitted is compulsory”). The final field is char¬ 
acterized completely by its conserved quantities 
(multipole moments with X < 5 ). 

7. For proof of Price’s theorem in the case of a 
scalar field, see exercise 32.10. 


E. Generalization to Nonclassical Fields 

See Hartle (1971, 1972) and Teitelboim (1972b,c) 
for neutrino fields; Bekenstein (1972a,b) and 
Teitelboim (1972a) for pion fields. 
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Box 32.3 COLLAPSE IN ONE AND TWO DIMENSIONS 


A. The Question 

To produce a black hole (horizon from which 
nothing can emerge), must one compact matter 
strongly in all three spatial dimensions, to circum¬ 
ferences G < 47 tM( quasispherical compaction); or 
is it sufficient to compact only in one or two di¬ 
mensions? 


B. The Answer for One Dimension 

Consider, as an example readily generalized, the 
gravitational collapse of a spheroid of dust (zero 
pressure). Let the spheroid be highly Newtonian 
(r »> 2M) in its initial, momentary state of rest; 
and let it be slightly flattened (oblate). In Newto¬ 
nian theory, any homogeneous, nonrotating 
spheroid of dust remains homogeneous as it col¬ 
lapses; but its deformations grow [see, e.g., Lin, 
Mestel, and Shu (1965) for details]. Hence, the 
spheroid of interest implodes to form a pancake 
of infinite density but finite mass per unit surface 
area. The final kinetic energy of the dust particles 
is roughly equal to their final potential energy: 

V—«_ 

2 ((?/2») 

M =s mass of spheroid, 

6 = circumference of final pancake. 

Consequently, so long as <?/2w > 2 M, the collapse 
velocities remain much smaller than light, and the 
gravitational energy remains much smaller than 
the rest mass-energy. This means that for 6/In > 
2M, the Newtonian analysis is an excellent ap¬ 
proximation to general relativity all the way down 
to the pancake endpoint. Hence, no horizon can 
form, hardly any gravitational waves are emitted, 
and the whole story is exceedingly simple and fully 
Newtonian. However, since the pancake endpoint 
is not a singularity of spacetime (see the remarks 
at end of exercise 32.8), the evolution can proceed 
beyond it; and as (? contracts to <4wM, the evolu¬ 


tion will become very complicated and highly rel¬ 
ativistic (see the “collapse, pursuit, and plunge 
scenario” of Figure 24.3). 


C. The Answer for Two Dimensions 

Consider, as an example not so readily general¬ 
ized, the gravitational collapse of a homogeneous 
prolate spheroid of dust, initially highly Newto¬ 
nian. Such a spheroid collapses to form a thin 
“thread” or “spindle” [see Lin, Mestel, and Shu 
(1965)]. Assume that the spheroid is still Newto¬ 
nian when its threadlike state is reached. It 
then has a length l, a mass per unit length 
X = M/l <4: 1, and a rapidly contracting equato¬ 
rial radius R i. Subsequently, each segment of 
the thread collapses radially as though it were part 
of an infinite cylinder. [Ignore the instability of 
breakup into “beads”; see, e.g.. Hunter (1967), 
Chandrasekhar (1968).] The radial collapse veloc- 
ity approaches the speed of light and the gravita¬ 
tional energy approaches the rest mass-energy 
only when the thread has become exceedingly 
thin, R < ~ i exp (— 1/4A). At this stage, 

relativistic deviations from Newtonian collapse 
come into play. Thorne (1972) and Morgan and 
Thorne (1973) have analyzed the relativistic effects 
using an idealized infinite-cylinder model. The 
results are very different from either the spherical 
case or the pancake case. The collapsing cylinder 
emits a large flux of gravitational waves; but they 
are powerless to halt the collapse. The collapse 
proceeds inward to a thread-like singularity, without 
the creation of any horizon (no black hole!). 

D. Objection to the Answer, a Reply, 
and a Conjecture 

One can object that the collapses of both pancake 
and cylinder can be halted short of their endpoints, 
especially that of the pancake. As the thickness of 
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r 

Box 32.3 (continued) 

the pancake approaches zero, the vertical pull of 
gravity remains finite, but the pressure gradient 
caused by any finite pressure goes to infinity. 
Hence, pressure halts the collapse. Subsequently 
the rim of the pancake contracts toward the relativ¬ 
istic regime G/2tt < 2M. In the collapse of a 
cylinder according to Newtonian theory, with a 
pressure-density relation p p 7 , the gravitational 
acceleration a g and pressure-buoyancy accelera¬ 
tion a p vary as 

S = -2A /R, a p - p~\p/R ) oc p Y- i/r. 

Hence, for y > 1 (the most common realistic case) 
pressure halts the collapse, but for y < 1 it does 


not. Whether this is true also after the relativistic 
domain is reached, one does not yet know. 

Actually, the ability of pressure to halt the col¬ 
lapse is of no importance to the issue of black holes 
and horizons. The important thing is that in oblate 
collapse with final circumference 6 > 4ttM, and 
also in prolate collapse with final thread length 
1 > 2M, no horizons are created . This, coupled 
with the omnipresent horizons in nearly spherical 
collapse (Box 32.2) suggests the following conjec¬ 
ture [Thorne (1972)]: Black holes with horizons 
form when and only when a mass M gets compacted 
into a region whose circumference in EVERY direc¬ 
tion is G < 4'nM. (Like most conjectures, this one 
is sufficiently vague to leave room for many differ¬ 
ent mathematical formulations!) 






(a) Resolve the external field into scalar spherical harmonics, using Schwarzschild coor¬ 
dinates for the external Schwarzschild geometry: 


I r 

Show that the vacuum field equation reduces to 

_ _ /, 2M\/2M X(X+1)\ T 

“V* + *t,rV - ( 1 - —J\— + ^2 L* 

where r* is the “tortoise coordinate” of §25.5 and Figure 25.4: 

r* ~ r + 2M In (r/2M — 1). 


(32.27a) 


(32.27b) 


(32.27c) 


Notice that (32.27b) is a flat-space, one-dimensional wave equation with effective potential 




(32.27d) 


Part of this effective potential [X(X H- 1 )/r 2 ] is the “centrifugal barrier,” and part [2M/r] is 
due to the curvature of spacetime. Notice the similarity of this effective potential for scalar 
waves, to the effective potentials for particles and photons moving in the Schwarzschild 
geometry, 

(PVuoes = (1 - 2M/r)(l + I 2 //- 2 ), 

(5- 2 ) photons = (1 - 2 M/r)r~ 2 


(Boxes 25.6 and 25.7). The scalar-wave potential, like the photon potential, is positive for 
all r > 2 M. It rises, from 0 at r = 2 M, to a barrier summit; then falls back to 0 at r = oo. 
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(b) Show that there exist no physically acceptable , static scalar-wave perturbations of a 
Schwarzschild black hole. [More precisely, show that all static solutions to equation (32.27b) 
become infinite at either the horizon (r = 2 M, r* = — oo) or at radial infinity.] This suggests 
that somehow the black hole formed by collapse must divest itself of the star’s external scalar 
field before it can settle down into a quiescent state. 

(c) The general solution to the wave equation (32.27b) can be written in terms of a Fourier 
transform. For waves that begin near the horizon, propagate outward, and are partially 
transmitted and partially reflected (“rightward-propagating waves”), show that the general 
solution is 


* x (/, r*) = f A(k)R l k (r*)e~ m dk, (32.28a) 

where 

<PRl/dr * 2 = [- k 2 + V en (r)]Rl (32.28b) 

Al - e ikr * + as r* — -oo, 

R{ = rgV fcr * as r* —^ oo. (32.28c) 

Show that the “reflection and transmission coefficients for rightward-propagating waves,” 
and Tp, have the following asymptotic forms for \k\ < 1/M (short wave number; long 
wavelength): 

- _i + alMik, 


'produces complete reflection and complete 1 
destructive interference in limit k —0; 
see Box 32.2 for detailed discussion of 
consequences _ J 

where a and fi are constants of order unity. Give a similar analysis for waves that impinge 
on a Schwarzschild black hole from outside (“leftward-propagating waves”). 

(d) Show that, as the star collapses into the horizon, the world line of its surface in (r, r*) 
coordinates is 


TT = (21 _ 1)!! +1 ( 32 28d) 


'no transmission 1 

in limit k —0; see 
t Box 32,2 _, 


r* = R*(t) ~-t- R 0 * exp ( — t/2M) + const., (32.29a) 

where R 0 * is related to the magnitude a of the surface’s 4-acceleration (a > 0 for outward 
4-acceleration) by 


(32.29b) 

Thus, the world line of the surface appears to become null near the horizon (/ + r* == V = 
constant); of course, this is due to pathology of the coordinate system there. Show, further, 
that the scalar field on the star’s surface (V — constant) must vary as 

= Q l0 + Qne- U/4 ", U = t- r*. (32.29c) 

when the star is approaching the horizon (/ —*- oo, r* —>- — oo, U —*- oo), in order that 
the rate of change of 'k t be finite as measured on the star’s surface. Notice that 0( O is the 
“final value” of the scalar field on the star’s surface. It can be regarded as an outgoing wave 
with zero wave number (infinite wavelength); and, consequently, it gets completely and 


R* = (8M/e)|l + 1 6Ma[Ma + ^M 2 a 2 + jj 
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destructively reflected by the effective potential [see equation (32.28d): also Box 32.2]. Conclu¬ 
sion: All multipoles of the scalar field die out at finite r* as t —oo. (Price’s theorem for 
a scalar field.) For a more detailed analysis, including the rates at which the multipoles die 
out. see Price (1971, 1972a) or Thorne (1972). 

Exercise 32.11. NEWMAN-PENROSE "CONSTANTS" 

[See Press and Bardeen (1971), Bardeen and Press (1972), 
and Piir (1971) for more details than are presented here.] 

Wheeler (1955) showed that Maxwell’s equations for an X-pole electromagnetic field residing 
in the Schwarzschild geometry can be reduced to the wave equation 


T T /l 

-*i,« + *i.rV =(1 ~ — 


2M\ia + 1 ) 


r 


*i 


(32.30) 


[electromagnetic analogue of (32.27b)]. After this equation has been solved, the components 
of the electromagnetic field can be obtained by applying certain differential operators to 

*1 (unrjfiA). 

(a) Show that the general solution to the electromagnetic wave equation (32.30) for dipole 
(X = 1) fields, with outgoing-wave boundary conditions at r* —-f oo, has the form 


*i=/o(&)* 


A(U) , MO) 


( 32 : 31 §) 


where 


fi fo’ fz — 


U = t — r* is “retarded time”, and 




(n + l)(w - 2) 
In 


•/„-!+ (w-2)M/ b _ 2 . (32.31b) 


When spacetime is flat (M — 0), this solution becomes 

= f x '(V) + A(D)/r. (32.3 IN) 

[The 1/r fall-off of the radiation field//((/) has been factored out of see the scalar-wave 
function (32.27a).] The terms + • • • > which are absent in fiat spacetime, are 

attributable to backscatter of the outgoing waves by the curvature of spacetime. They are 
sometimes called the “tail” of the waves. 

(b) Show that the general static dipole field has the form (32.31a) with 


(/o)stattc = °; (/i)stattc = D = dipole moment; 
UzWic - jMD; ... . 


(32.32) 


(c) Consider a star (not a black hole!) with a dipole field that is initially static. At time 
t = 0, let the star suddenly change its dipole moment to a new static value D f . Equations 
(32.31b) demand that f 2 be conserved [“Newman-Penrose {1965) constant”]. Hence, f 2 will 
always exhibit a value, %MD, corresponding to the old dipole moment; it can never change 
to \MD’. This is a manifestation of the tail of the waves that are generated by the sudden 
change in dipole moment. To understand this tail effect more clearly, and to discover an 
important flaw in the above result, evaluate the solution (32.31) for retarded time U > 0, 
using the assumptions 

(1) field has static form (32.32) for U < 0, 

(32.33) 
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Put the answer in the form 



r 2 


+ s 


2M(D' - D)(-l) n + 1 (n + 1 )U n ~ 2 

(2 r) n 


+ O 


(M 2 M 2 U\ 

\ r 3 r 4 /* 


(32.34) 


(The terms neglected are small compared to those kept for all U/r t so long as r > M.) 
Evidently, so long as the series converges the Newman-Penrose “constant” (coefficient of 
l/r 2 ) remembers the old D value and is conserved, as claimed above. Show, however, that 
the series diverges for U > 2r—i.e., it diverges inside a sphere that moves outward with 
asymptotically § the speed of light. Thus, the Newman-Penrose “ constant ” is well-defined and 
conserved only outside the “\-speed-of-light cone.” 

(d) Sum the series in (32.34) to obtain a solution valid even for U > 2r: 


3 MD' 2Mfjy - D) (U + 3 r) (M 2 

r 2 r 2 r (f7+2r) 2 + W 3 

new static “tail term” 

solution 


= the series (32.34) for U < 2r (domain of convergence of that series) 


= 3 MD' + M 2 \ 

r 2 r 2 \ Ur' r 3 / 


for U > r > M. 


(32.35) 


From this result conclude that at fixed r and late times the electromagnetic field becomes 
asymptotically static , and the Newman-Penrose “ constant ” assumes the new value \MD' 
appropriate to the new dipole moment . 
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33 


BLACK HOLES 


A luminous star, of the same density as the Earth, and whose diameter should be 
two hundred and fifty times larger than that of the Sun, would not, m 
consequence of its attraction, allow any of its rays to arrive at us; it is therefore 
possible that the largest luminous bodies in the universe may, through this cause, 

be invisible. 

P. S. LAPLACE (1798) 


V 


A dialog explaining why 
black holes deserve their 
name 


§33.1. WHY ' BLACK HOLE"? 

Sagredus. What is all this talk about “black holes”? When an external observer 
watches a star collapse, he sees it implode with ever-increasing speed, until 
the relativistic stage is reached. Then it appears to slow down and become 
“frozen,” just outside its horizon (gravitational radius). However long the 
observer waits, he never sees the star proceed further. How can one reasonably 
give the name “black hole” to such a frozen object, which never disappears 
from sight? 

Salvatius. Let us take the name “black hole” apart. Consider first the blackness. 
Surely nothing can be blacker than a black hole. The very redshift that makes 
the collapsing star appear to freeze also makes it darken and become black. 
In the continuum approximation, where one ignores the discreteness of 
photons, the intensity of the radiation received by distant observers decreases 
exponentially as time passes, L oc exp ( — 1/3 \/3M), with an exceedingly short 
e-folding time 

r = 3 y/3M = (2.6 X lO' 5 sec ){M/M 0 ). 

Within a fraction of a second, the star is essentially black. Discreteness of 
photons makes it even blacker. The number of photons emitted before the 
star crosses its horizon is finite, so the exponential decay cannot continue 

For a more detailed exposition of the foundations of “black-hole physics,” see DeWitt and DeWitt 

(1973). 


§33.1. WHY THE NAME "BLACK HOLE"? 


873 


forever. Eventually—only 10" 3 (A//A/ o ) seconds after the star begins to dim 
(see exercise 32.2)—the last photon that will ever get out reaches the distant 
observers. Thereafter nothing emerges. The star is not merely “essentially 
black”; it is “absolutely black” 

Sagredus. Agreed. But it is the word “hole” that concerns me, not “black.” How 
can one possibly regard the name “hole” as appropriate for an object that 
hovers forever just outside its horizon. True, absence of light makes the object 
invisible. But couldn’t one always see it by shining a flashlight onto its surface? 
And couldn’t one always fly down to its surface in a rocket ship and scoop 
up a few of the star’s baryons? After all, as seen from outside the baryons 
at its surface will never, never, never manage to fall into the horizon! 

Salvatius. Your argument sounds persuasive. To test its validity, examine the collapse 
of a spherically symmetric system, using the ingoing Eddington-Finkelstein 
diagram of Figure 33.1. Let a family of external observers shine their flash¬ 
lights onto the star’s surface, as you have suggested. Let the surface of the 
star be carefully silvered so it reflects back all light that reaches it. Initially 
(low down in the spacetime diagram of Figure 33.1) the ingoing light beams 


v 



Figure 33.1. 

Spherical gravitational collapse of a star to form a black hole, as viewed in ingoing 
Eddington-Finkelstein coordinates. The “surface of last influence,” is an ingoing null 
surface that intersects the horizon in coincidence with the surface of the collapsing star. 
After an external observer, moving forward in time, has passed through the surface of 
last influence, he cannot interact with and influence the star before it plunges through 
the horizon. Thus, one can think of the surface of last influence as the “birthpoint” of 
the black hole. Before passing through this surface, the external observer can say his 
flashlight is probing the shape of a collapsing star; afterwards, he can regard his signals 
as probes of a black hole. For further discussion, see text. 
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reach the star’s surface and are reflected back to the flashlights with no 
difficulty. But there is a critical point—an ingoing radial null surface X— 
beyond which reflection is impossible. Photons emitted inward along X reach 
the star just as it is passing through its horizon. After reflection these photons 
.fly “outward” along the horizon, remaining forever at r = 2M. Other photons, 
emitted inward after the flashlight has passed through X, reach the surface 
of the star and are reflected only after the star is inside its horizon. Such 
photons can never return to the shining flashlights. Once inside the horizon, 
they can never escape. Thus, the total number of photons returned is finite 
and is subject to the same blackness decay law as is the intrinsic luminosity 
of the star. Moreover, if the observers do not turn on their flashlights until 
after they pass through the null surface X , they can never receive back any reflec¬ 
ted photons! Evidently, flashlights are of no help in seeing the “frozen star.” 

Sagredus. I cannot escape the logic of your argument. Nevertheless, seeing is not 
the only means of interacting with the frozen star. I have already suggested 
swooping down in a rocket ship and stealing a few baryons from its surface. 
Similarly, one might let matter fall radially inward onto the frozen star. When 
the matter hits the star’s surface, its great kinetic energy of infall will be 
converted into heat and into outpouring radiation. 

Salvatius. Thus it might seem at first sight. But examine again Figure 33.1. No 
swooping rocket ship and no infalling matter can move inward faster than 
a light ray. Thus, if the decision to swoop is made after the ship passes through 
the surface X , the rocket ship has no possibility of reaching the star before 
it plunges through the horizon; the rocket and pilot cannot touch the star, 
sweep up baryons, and return to tell their tale. Similarly, infalling matter 
to the future of X can never hit the star’s surface before passing through the 
horizon. The surface X is, in effect, a “surface of last influence.” Once anybody 
or anything has passed through X, he or it has no possibility whatever of 
influencing or interacting with the star in any way before it plunges through 
the horizon. Thus , from a “causal” or “interaction” standpoint, the collapsing 
star becomes a hole in space at the surface X. This hole is not black at first. 
Radiation from the collapsing star still emerges after X because of finite 
light-propagation times, just as radiation still reaches us today from the hot 
big-bang explosion of the universe. But if an observer at radius r > 2 M waits 
for a time 2 r after passing through X (time for X to reach horizon, plus time 
for rays emitted at R ~~ 3 M to get back to observer), then he will see the 
newly formed hole begin to turn black; and within a time At ~~ (10 -3 
seconds)(A//A/ 0 ) thereafter, it will be completely black. 

Sagredus. You have convinced me. For all practical purposes the phrase “black hole” 
is an excellent description. The alternative phrases “frozen star” and “col¬ 
lapsed star,” which I find in the pre-1969 physics literature, emphasize an 
optical-illusion aspect of the phenomenon. Attention must be directed away 
from the star that created the black hole, because beyond the surface of last 
influence one has no means to interact with that star. The star is irrelevant 
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to the subsequent physics and astrophysics. Only the horizon and its external 
spacetime geometry are relevant for the future. Let us agree to call that 
horizon the “surface of a black hole,” and its external geometry the “gravita¬ 
tional field of the black hole.” 

Salvatius. Agreed. 


§33.2. THE GRAVITATIONAL AND ELECTROMAGNETIC 
FIELDS OF A BLACK HOLE 


The collapse of an electrically neutral star endowed with spherical symmetry pro¬ 
duces a spherical black hole with external gravitational field described by the 
Schwarzschild line element 


ds 2 = -(1 - 2 M/r)dt 2 + 


dr 2 

1 - 2 M/r 


+ r 2 (dd 2 -f sin 2 6 d<j> 2 ). 


(33.1) 


The surface of the black hole, i.e., the horizon, is located at r — 2M — (gravitational 
radius). Only the region on and outside the black hole’s surface, r > 2M, is relevant 
to external observers. Events inside the horizon can never influence the exterior. 

The gravitational collapse of a realistic star (nonspherical, collapse with small but 
nonzero net charge of one sign or the other) produces a black hole somewhat different 
from the simple Schwarzschild hole. For collapse with small charge and small 
asymmetries, perturbation-theory calculations (Box 32.2) predict a final black hole 
with external field determined entirely by the mass M, charge Q , and intrinsic angular 
momentum S of the collapsing star. For fully relativistic collapse, with large asym¬ 
metries and possibly a large charge, the final black hole (if one forms) is also 
characterized uniquely by M, Q , and S. This is the conclusion that strongly suggests 
itself in 1972 from a set of powerful theorems described in Box 33.1. 

Why M, Q , and S should be the complete governors of the final external field 
of the black hole, one can understand heuristically as follows. Of all quantities 
intrinsic to any isolated source of gravity and electromagnetism, only M, Q , and 
S possess (and are defined in terms of) unique , conserved imprints in the distant 
external fields of the source (conserved Gaussian flux integrals; see Box 19.1 and 
§20.2). When a star collapses to form a black hole, its distant external fields are 
forced vy maintain unchanged the imprints of M, Q, and S. In effect, M, Q, and 
5 provide anchors or constraints on the forms of the fields. Initially other constraints 
are produced by the distributions of mass, momentum, stress, charge, and current 
inside the star. But ultimately the star plunges through a horizon, cutting itself off 
causally from the external universe. (The nonpropagation of long-wavelength waves 
through curved spacetime plays a key role in this cutoff; see Box 32.2.) Subsequently, 
the only anchors remaining for the external fields are the conserved imprints of M, 
Q, and S. Consequently, the external fields quickly settle down into unique shapes 
corresponding to the given M , Q , and S. Of course, the settling down involves 
dynamic changes of the fields and an associated outflow of gravitational and electro- 


The structure of a black hole 
is determined uniquely by its 
mass M, charge .Q, and 
intrinsic angular momentum, S 


Heuristic explanation of the 
M-Q-S uniqueness 
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Box 33.1 A BLACK HOLE HAS NO "HAIR" 


The following theorems come close to proving that 
the external gravitational and electromagnetic fields 
of a stationary black hole (a black hole that has 
settled down into its “final” state) are determined 
uniquely by the hole's mass M, charge Q , and intrin¬ 
sic angular momentum S —i.e., the black hole can 
have no “hair” (no other independent characteris¬ 
tics). For a detailed review, see Carter (1973). 


I. Stephen Hawking (1971b, 1972a): A station¬ 
ary black hole must have a horizon with 
spherical topology; and it must either be static 
(zero angular momentum), or axially sym¬ 
metric, or both. 

II. Werner Israel (1967a, 1968): Any static black 
hole with event horizon of spherical topology 
has external fields determined uniquely by its 
mass M and charge Q; moreover, those exter¬ 
nal fields are the Schwarzschild solution if 
<2 = 0, and the Reissner-Nordstrom solution 
(exercises 31.8 and 32.1) if Q 0 (both special 
cases of Kerr-Newman; see §33.2). 

III. Brandon Carter (1970): “All uncharged, sta¬ 
tionary, axially symmetric black holes with 
event horizons of spherical topology fall into 
disjoint families not deformable into each 
other. The black holes in each family have 
external gravitational fields determined 
uniquely by two parameters: the mass M and 
the angular momentum S .” (Note: the “Kerr 
solutions”—i.e., “Kerr-Newman” with Q — 0 
—form one such family; it is very likely 
that there are no others, but this has not been 
proved as of December 1972. It is also likely 
that Carter’s theorem can be extended to the 
case with charge; but this has also not yet 
been done.) 


Conclusions made by combining all three the¬ 
orems: 

(a) All stationary black holes are axially sym¬ 
metric. 

(b) All static (nonrotating) black holes are 
characterized uniquely by M and Q , and 
h<jve the Reissner-Nordstrom form. 

(c) All uncharged, rotating black holes fall 
into distinct and disjoint families, with 
each black hole in a given family charac¬ 
terized uniquely by M and S. The Kerr 
solutions form one such family. There 
may well be no other family. 

V. Remarks and Caveats: 

(a) The above statements of the theorems are 
all somewhat heuristic. Each theorem 
makes several highly technical assump¬ 
tions, not stated here, about the global 
properties of spacetime. These assump¬ 
tions seem physically reasonable and in¬ 
nocuous, but they might not be. 

(b) Progress in black-hole physics is so rapid 
that, by the time this book is published, 
there may well exist theorems more pow¬ 
erful than the above, which really prove 
that “a black hole has no hair.” 

(c) For insight into the techniques of “global 
geometry” used in proving the above the¬ 
orems and others like them, see Chapter 
34; for greater detail see the forthcoming 
book by Hawking and Ellis (1973). 

(d) For analyses which show that a black hole 
cannot exert any weak-interaction forces 
caused by the leptons which have gone 
down it, see Hartle (1971, 1972) and Tei- 
telboim (1972b,c). For similar analyses 
which show absence of strong-interaction 
forces from baryons that have gone down 
the hole, see Bekenstein (1972a,b) and 
Teitelboim (1972a). 
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magnetic waves. And, of course, the outflowing waves carry off mass and angular 
momentum (but not charge), thereby leaving M and S changed. And, of course, 
the external fields must then readjust themselves to the new M and S. But the process 
will quickly converge, producing a black hole with specific final values of M, Q , 
and S and with external fields determined uniquely by those values. 

The problem of calculating the external fields for given Af, Q , and S and then- 
given imprints, is analogous to the problem of Plateau—to calculate the shape of 
a soap film anchored to a wire of given shape.* One calculates the shape of the 
soap film by seeking a surface of minimum area spanning the bent wire. The 
condition of minimum area leads to a differential equation describing the soap film, 
which must be solved subject to the constraint imposed by the shape of the wire. 

To calculate the external fields of a black hole, one can extremize the “action 
integra 1”/($ 4- £)^—gd 4 x for interacting gravitational and electromagnetic fields 
(see Chapter 21) subject to the anchored-down imprints of Af, Q, and S at radial 
infinity, and subject to the existence of a physically nonsingular horizon (no infinite 
curvature at horizon!). Extremizing the action is equivalent to solving the coupled 
Einstein-Max well field equations subject to the constraints imprinted by M, Q , and 
S, and the existence of the horizon. The derivation of the solution and the proof 
of its uniqueness are much too complex to be given here. (See references cited in 
Box 33.1.) However, the solution turns out to be the “Kerr-Newman geometry” and 
its associated electromagnetic field.f 

Written in the t, r, #, <f> coordinates of Boyer and Lindquist (1967) (generalization 
of Schwarzschild coordinates), the Kerr-Newman geometry has the form 


Variational principle for 
black-hole structure 


Details of black-hole structure: 


ds 2 = —^ [dt — a sin 2 # oty] 2 4- s * n ,- 
P 2 P“ 

+ ~ dr 2 + p 2 dO 2 , 


[(r 2 -f a 2 ) d(j> — a dt] 2 


(33.2) 


(1) metric ("Kerr-Newman 
geometry") 


where 

A = r 2 - 2Mr + a 2 + 0 2 , (33.3a) 

p 2 = r 2 + a 2 cos 2 #, (33.3b) 

a = S/M= angular momentum per unit mass. (33.4) 

The corresponding electromagnetic field tensor, written as a 2-form (recall: dx a A 
dx& = dx a <8> dx& — dx& ® dx a ) is 


F — Qp 4 (r 2 — a 2 cos 2 #) dr A [dt — a sin 2 # d§\ 

4- 2 Qp~ 4 ar cos # sin # dO A [(r 2 4- a 2 ) d§ — a dt]. 


(33 5) ^ electromagnetic field 


*On the problem of Plateau see, e.g., Courant (1937), Darboux (1941), or p. 157 of Lipman Bers 
(1952). 

fThe uncharged {Q = 0) version was first found as a solution to Einstein’s vacuum field equations 
by Kerr (1963). The charged generalization was first found as a solution to the Einstein-Maxwell field 
equations by Newman, Couch, Chinnapared, Exton, Prakash, and Torrence (1965). Only later was the 
connection to black holes discovered; see Box 33.1. 
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Expressions (33.2) for the metric and (33.5) for the electromagnetic field are 
sufficiently long to be somewhat frightening. Therefore, it is helpful to develop some 
qualitative insight into them and into their implications before attempting detailed 
computations with them. Boxes 33.2, 33.3, and 33.4 develop qualitative insight by 
presenting, without derivation, a summary of the key features of the Kerr-Newman 
geometry and a summary of the physics and astrophysics of black holes. The re¬ 
mainder of this chapter is a Track-2 justification and derivation of some, but not 
all, of the results cited in Boxes 33.2-33.4. 

{continued on page 891 ) 

Box 33.2 KERR-NEWMAN GEOMETRY AND ELECTROMAGNETIC FIELD 

I. Equations for metric and electromagnetic field 

A. Parameters appearing in equations: 

M — mass, Q = charge, a = S/M = angular momentum per unit mass, all 
as measured by their standard imprints on the distant fields. 

B. Constraint on parameters: 

The Kerr-Newman geometry has a horizon, and therefore describes a black 
hole, if and only if M 2 > Q 2 -f a 2 . It seems likely that in any collapsing body 
which violates this constraint, centrifugal forces and/or electrostatic repulsion 
will halt the collapse before a size ~~ M is reached; see equation (33.56). 

C. Limiting cases: 

Q — 0, Kerr (1963) geometry; 

S = 0, Reissner-Nordstrom geometry and electromagnetic field 

(exercises 31.8 and 32.1); 

Q zn S = 0, Schwarzschild geometry; 

M 2 = Q 2 + a 2 “Extreme Kerr-Newman geometry.” 

D. Boyer-Lindquist (1967) coordinates (/, r, 0, $—generalization of Schwarzs¬ 
child coordinates; black hole rotates in <p direction): 

ds 2 = — (A/p 2 )[dt — a sin 2 0 d<j>] 2 4- (sin 2 0/p 2 )[(r 2 -1- a 2 ) dtp — a dt] 2 
+ (p 2 /A)dr 2 + p 2 d9 2 ; 

A = r 2 - 2Mr 4 a 2 + Q 2 , p 2 = r 2 + a 2 cos 2 /?. 

F — Qp~ 4 (r 2 — a 2 cos 2 0) dr A [dt — a sin 2 0 d(p] 

+ 2Qp~ 4 ar cos 0 sin 8 d6 A [(r 2 4- a 2 ) d<j> — a dt] 

E. Kerr coordinates [ V 9 r, 8, <f>—generalization of ingoing Eddington-Finkelstein 

coordinates; ( V ., 8, = constant is an ingoing, “radial,” null geodesic; black 

hole rotates in <J> direction]: 

Relationship to Boyer-Lindquist: 


( 2 ) 

( 3 ) 
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dV = dt + (r 2 + a 2 )(dr/A ), 
d<t> = d<f> + a(dr/A). 

ds 2 — — [1 — p~ 2 (2Mr - Q 2 )] dV 2 + 2 dr dV + p 2 d6 2 

+ P _2 [(/* 2 + a 2 ) 2 - Aci 2 sin 2 #] sin 2 # d<j> 2 — 2a sin 2 # d<j> dr 
- 2ap~\2Mr - Q 2 ) sin 2 6 d$dV. 


( 4 ) 

( 5 ) 


F = Qp 4 [(r 2 — a 2 cos 2 #) dr A dV — 2a 2 r cos 6 sin 6 dO A dV (6) 

— a sin 2 0(r 2 — a 2 cos 2 #) dr A d<j) + 2ar(r 2 -f a 2 ) cos # sin 0 dd A d§\. 


II. Properties of spacetime geometry 

A. Symmetries (§33.4): 

The metric coefficients in Boyer-Lindquist 
coordinates are independent of t and <£>, 
and in Kerr coordinates are independent 
of V and <j>. Thus the spacetime geometry 
is “time-independent” (stationary) and ax¬ 
ially symmetric. The “Killing vectors” 
(§25.2) associated with these two sym¬ 
metries are (d/dt)^ = (d/d V) r , 0 ,Z and 
(d/d<j>) t r 0 = 

B. Dragging of inertial frames and static limit 
(§33.4): 

1. The “dragging of inertial frames” by 
the black hole’s angular momentum 
produces a precession of gyroscopes 
relative to distant stars. By this preces¬ 
sion one defines and measures the an¬ 
gular momentum of the black hole (see 
§§19.2 and 19.3). 

2. The dragging becomes more and more 
extreme the nearer one approaches the 
horizon of the black hole. Before the 
horizon is reached, at a surface de- 

_scribed by 

r = r 0 (6) 

= M + \/M 2 - Q 2 - a 2 cos 2 0, (7) 

the dragging becomes so extreme that 
no observer can possibly remain at rest 
there (i.e., be “static”) relative to the 
distant stars. At and inside this surface 


(called the “ static limit”), all observers 
with fixed r and # must orbit the black 
hole in the same direction in which the 
hole rotates: 

Q = dfy/dt 

a sin # — VA 

> --- 

(r 2 -f a 2 ) sin # — \[~Aa sin 2 # 

(> 0 for a = S/M > 0 and r < r 0 ). 

No matter how hard an observer, at 
fixed (r, #) inside the static limit, blasts 
his rocket engines, he can never halt his 
angular motion relative to the distant 
stars. 

3. The mathematical foundation for the 
above statement is this: world lines of 
the form (r, #, <j>) = constant [tangent 
vector oc d/dt = “Killing vector in time 
direction”] change from being timelike 
outside the static limit to being space¬ 
like inside it. Therefore, on and inside 
the static limit, no observer can remain 
at rest. 

C. Horizon (§33.4): 

1. The horizon is located at 

r = r + = M + VM 2 ~ Q 2 ~ a*. (8) 

2. As with the Schwarzschild horizon of 
a nonrotating black hole, so also here, 
particles and photons can fall inward 
through the horizon; but no particle or 
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Box 33.2 {continued) 



photon can emerge outward through it. 

3. The horizon is “generated” by outgoing 
null geodesics (outgoing photon world 
lines). 

D. Ergosphere (§33.4): 

1. The “ergosphere” is the region of space- 
time between the horizon and the 
static limit. It plays a fundamental role 
in the physics of black holes (Box 33.3; 
§33.7). 

2. The static limit and the horizon touch 
at the point where they are cut by 
the axis of rotation of the black hole 
(i 9 = 0, 7 r); they are well-separated else¬ 
where with the static limit outside the 
horizon, unless a — 0 (no rotation). 
When a = 0, the static limit and hori¬ 
zon coincide; there is no dragging of 
inertial frames; there is no ergosphere. 



Qualitative representation of horizon, ergosphere, and static 
limit [adapted from Ruffini and Wheeler (1971b)]. 

E. Singularity in Boyer-Lindquist coordi¬ 
nates: 

1. For a nonrotating black hole, the 
Schwarzschild coordinates become sin¬ 
gular at the horizon. One manifestation 

v_ 


of the singularity is the infinite amount 
of coordinate time required for any 
particle or photon to fall inward 
through the horizon, t —oo as 
r —2 M. One way to remove the sin¬ 
gularity (Eddington-Finkelstein way) is 
to replace t by a null coordinate 
V = / -f r -f 2Mln \r/2M — 1| 
attached to infalling photons [so 
(d/dr)ve<t> vector tangent to photon 
world lines]. 

2. For a rotating black hole, the Boyer- 
Lindquist coordinates, being generali¬ 
zations of the Schwarzschild coordi¬ 
nates, are also singular at the horizon. 
It requires an infinite coordinate time 
for any particle or photon to fall in¬ 
ward through the horizon, t —oo as 
r —>- r + . But that is not all. The drag¬ 
ging of inertial frames forces particles 
and photons near the horizon to orbit 
the black hole with i2~d<j>/dt > 0. 
Consequently, for a particle falling 
through the horizon (r —r + ), just as 
t —h oo, so also <j> —>- oo (infinite 
twisting of world lines around horizon). 

3. To remove the coordinate singularity, 
one must perform an infinite compres¬ 
sion of coordinate time, and an infinite 
untwisting in the neighborhood of the 
horizon. Kerr coordinates achieve this 
by replacing t with a null coordinate V, 
and <j> with an untwisted angular coor¬ 
dinate (j>: 

dV = dt + (r 2 + a 2 )(dr/A j, 

d<j> = d<j> 4- a(dr/A). 

Both of the new coordinates are at¬ 
tached to the world lines of a particular 
family of infalling photons; (3/3/O^j 
is the field of vectors tangent to the 
world lines of this family of photons 
(ingoing principal null congruence; 
§33.6). 
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F. Spacetime diagram: 

1. A spacetime diagram in Kerr coordi¬ 
nates looks much like an Eddington- 
Finkelstein diagram for the Schwarzs- 
child geometry. In both cases, one plots 
the surfaces of constant V not as hori¬ 
zontal planes, but as “backward light 
cones” (“45-degree surfaces”), because 
they are generated by the world lines 
of ingoing photons. Equivalently, one 
plots surfaces of constant 7= V — r as 
horizontal planes. 

2. The key differences between a Kerr 
diagram and an Eddington-Finkelstein 
diagram are: (a) Because the Kerr- 
Newman geometry is not spherical, a 
Kerr diagram with one rotational de¬ 
gree of freedom suppressed loses infor¬ 
mation about the geometry. Kerr dia¬ 
grams are usually made for the 
equatorial “plane,” 6 = 77 / 2 . (b) Just as 
the horizon pulls the light cones inward, 
so the dragging of inertial frames tilts 
the light cones in the direction of in¬ 
creasing for a > 0 and r = constant, 
(c) The ingoing edge of a light cone 
(dr/dV = — 00 ) does not tilt toward in¬ 
creasing the transformation from 
Boyer-Lindquist coordinates to Kerr 
coordinates untwists the tilt with de¬ 
creasing r, which would otherwise be 
produced by “frame dragging.” 

3. The shapes of the light cones reveal the 
special features of the static limit and 
horizon. At the static limit, a vertical 
world line [r, 0, constant; (d/dV) r6 j 
= (3/3Or,« 9,0 = tangent vector] lies on 
the light cone. At the horizon the light 
cones tilts fully inward, except for a 
single line of tangency to the horizon. 
Notice that the line of tangency has 
d$/dV — a/(r + 2 + a 2 ) ^ 0. Equiva¬ 
lently, the outgoing null geodesics, 
which generate the horizon, twist about 
it (“barber-pole-twist”)—yet another 
manifestation of the dragging of inertial 
frames. 


Kerr diagram for equatorial slice 
(6 = tt/2) through the spacetime of an 
“extreme Kerr” black hole (Q = 0, a = A/). 


V 



View from above showing the shapes of 
the light cones as a function of radius 
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Box 33.2 (continued) 

4. The Kerr diagram, like the Eddington- 
Finkelstein diagram/ describes infall 
through the horizon in a faithful, non¬ 
singular way. 

5. [The term “Kerr diagram” is a misno¬ 
mer. Kerr has not published such dia¬ 
grams himself, though nowadays others 
construct such diagrams using his coor¬ 
dinate system. Penrose is the originator 
and greatest exploiter of such diagrams 
(see, e.g., Penrose, 1969). But several 
other types of diagrams bear Penrose’s 
name, so it would be confusing to name 
them all after him.] 

G. Maximal analytic extension of Kerr-New- 

man geometry: 

1. When one abstracts the Schwarzschild 
geometry away from all sources (Chap¬ 
ter 31), one discovers that it describes 
an expanding and recontracting bridge, 
connecting two different universes. But 
in the context of black holes, only half 
of the Schwarzschild geometry (regions 
I and II) is relevant. The other half 
(regions III and IV) gets fully replaced 
by the interior of the star that collapsed 
to form the black hole. Because only a 


Surface of 
collapsing star 



part of the Schwarzschild geometry 
comes into play, ingoing Eddington- 
Finkelstein coordinates—which de¬ 
scribe I and II well, but III and IV 
badly—are well-suited to black-hole 
physics. 



2. Similarly, when one abstracts the Kerr- 
Newman geometry away from all sour¬ 
ces, one discovers that it describes a 
much larger, and more complex space- 
time manifold than one might ever 
have suspected. This “maximum ana¬ 
lytic extension” of the Kerr-Newman 
geometry has been analyzed in de¬ 
tail by Boyer and Lindquist (1967) 
and by Carter (1966a, 1968a). But it is 
totally irrelevant to the subject of black 
holes, for two reasons. First, as with 
Schwarzschild, the star that collapsed to 
form the black hole replaces most of the 
inward extension of the Kerr-Newman 
manifold. Second, even outside the star, 
the Kerr-Newman geometry does not 
properly represent the true geometry at 
early times. At early times the star has 
not got far down the road to collapse. 
Gravitational moments of the star arise 
from mountains or prominences or tur¬ 
bulence or other particularities that 
have not yet gone into the meat grinder. 
The geometry departs from flatness (1) 
by a term that varies for large distances 
as mass divided by distance, and (2) by 
another term that varies as angular mo¬ 
mentum divided by the square of the 
distance and multiplied by a spherical 
harmonic of order one, but also (3) by 
higher-order terms proportional to 
higher-order mass moments multiplied 
by higher spherical harmonics. These 
higher-order terms normally will devi¬ 
ate at early times from the correspond¬ 
ing terms in the mathematical analysis 
of the Kerr-Newman geometry— 
though the deviations will die out as 
time passes. For a system endowed with 
spherical symmetry, no such higher- 
order terms do occur or can occur. 
Therefore the geometry outside is 
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Schwarzschild in character at all stages 
of the collapse. However, when the sys¬ 
tem lacks spherical symmetry, the ge¬ 
ometry outside initially departs from 
Kerr-Newman character. Only well after 
the collapse occurs (asymptotic future), 
and in the region at and outside the hori¬ 
zon, is the Kerr-Newman geometry a 
faithful descriptor of a black hole . This 
region is described in a nonsingular 
manner by Kerr coordinates and Ken- 
diagrams; and it is the only region that 
this book will explore. 

H. Test-particle orbits 

See §§33.5-33.8 and Box 33.5. 


III. Properties of electromagnetic field (§33.3): 


A. Far from the black hole, where spacetime 
is nearly flat, in the usual spherical 
orthonormal frame (w t = dt , cu T = dr , 
cu § = rdO , cu* = rsinO d$), the electric 
and magnetic fields have dominant com¬ 
ponents 



. Qa . 

cos#, B& = — T s\n 0. 
r 6 




These reveal that 

Q = charge of black hole, 

9Z = Qa — magnetic dipole moment of 
black hole. 

B. Notice that the gyromagnetic ratio, y~ 
(magnetic moment)/(angular momentum), 
is equal to Q/M — (charge/mass), just as 
for an electron! 

C. Notice that the value of the magnetic mo¬ 
ment, like all other features of the black 
hole, is determined uniquely by the hole’s 
mass, charge, and angular momentum: 
9H ~ QS/M. This illustrates the theorem 
(Box 33.1) that a black hole has no “hair.” 

D. Other electric and magnetic moments are 
nonzero, but are determined uniquely by 
M , S , and Q. 

E. Near the black hole, the curvature of 
spacetime deforms the electric and mag¬ 
netic fields produced by the charged, rota¬ 
ting black hole. For a mathematical de¬ 
scription of this deformed field, see Cohen 
and Wald (1971); for a diagrammatic rep¬ 
resentation, Hanni and Ruffini (1973). 


— 

Box 33.3 THE ASTROPHYSICS OF BLACK HOLES 


Black holes in nature should participate in astro- 
physical processes that are as varied as those for 
stars. By searching for observable phenomena as¬ 
sociated with these processes, astronomers have a 
good chance of discovering the first black hole 
sometime during the 1970’s. This box lists some 
possible astrophysical processes, and a few rele¬ 
vant references. 



I. Mechanisms of Formation 

A. “Direct, in isolation”: A massive star (M > 
3Af©) collapses, almost spherically, pro¬ 
ducing a collapsed neutron-star core that 
is too massive to support itself against 
gravity. Gravity pulls the core on inward, 
producing a horizon and black hole. [May 
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Box 33.3 (continued) 

and White (1966, 1967); Chapter 32 of this 
book.] 

B. “Indirect, in isolation”: “Collapse, pursuit, 
and plunge scenario” depicted in Figure 
24.3 [Ruffini and Wheeler (1971b).] 

C. “In the thick of things”: Stars collected 
into a dense cluster (e.g., the nucleus of 
a galaxy) exchange energy. Some acquire 
energy and move out into a halo. Others 
lose energy and make a more compact 
cluster. This process of segregation con¬ 
tinues. The cluster becomes so compact 
that collisions ensue and gas is driven off. 
The gas moves toward the center of the 
gravitational potential well. Out of it new 
stars form. The process continues. Eventu¬ 
ally star-star collisions may become suffi¬ 
ciently energetic and inelastic that the cen¬ 
ters of the colliding stars coalesce. In this 
way supermassive objects may be built up 
and may evolve. Ultimately (1) many 
“small” stars may collapse to form “small” 
black holes (M ~~ M 0 ); (2) one or more 
supermassive stars may collapse to form 
huge black holes 10 4 A/ o to 10 9 A/ o ); 
(3) the entire conglomerate of stars and gas 
and holes may become so dense that it 
collapses to form a single gigantic hole. 
[Sanders (1970), Spitzer (1971), Lynden- 
Bell (1967, 1969), Colgate (1967), §§24.5, 
24.6, 25.7 of this book.] 

D. “Primordially”: Perturbations in the initial 
density distribution of the expanding uni¬ 
verse may produce collapse, resulting in 
“primordial black holes.” Those holes 
would subsequently grow by accretion of 
radiation and matter. By today all such 
holes might have grown into enormous 
objects [M ~~ 10 17 M 0 ; ZePdovich and 
Novikov (1966)]; but some of them might 
have avoided such growth and might be 
as small as 10 -5 grams [Hawking (1971a)]. 



II. How many black holes are there in our galaxy 
today? 

Peebles (1972) has given an excellent review 
of this issue and of prospects for finding black 
holes in the near future. He says “a good 
fraction of the mass of the disc of our galaxy 
was deposited [long ago] in stars capable of 
collapsing to black holes.... The indication is 
that the galaxy’s disk may contain on the 
order of 10 9 black holes.” 

III. “Live” black holes versus “dead” black holes 

A. A Schwarzschild black hole is “dead” in 
the sense that one can never extract from 
it any of its mass-energy. One aspect of 
this “deadness”—the fact that a Schwarzs¬ 
child black hole is stable against small 
perturbations—is essential (1) to the iden¬ 
tification of a black hole with the ultimate 
“ground state” of a large mass, and (2) to 
any assertion that general relativity theory 
predicts the possible existence of black 
holes. [For a proof of stability see Vish- 
veshwara (1970). The problem was formu¬ 
lated, and most of the necessary tech¬ 
niques developed, by Regge and Wheeler 
(1957), with essential contributions also by 
Zerilli (1970a).] Thus a small pulse of gravi¬ 
tational (or other) radiation impinging on 
a Schwarzschild black hole does not initi¬ 
ate a transition of the black hole into a 
very different object or state. 

B. A Kerr-Newman black hole—which is ro¬ 
tating or charged or both—is not dead. 
The rotational and electromagnetic contri¬ 
butions to the mass-energy can be extracted. 
(See §§33.7 and 33.8 for mathematical de¬ 
tails.) Thus, such black holes are “live”; 
they can inject energy into their surround¬ 
ings. By a suitable arrangement of external 
apparatus, one can trigger an exponen¬ 
tially growing energy release [Press and 
Teukolsky (1972).] But for a perturbed 
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black hole in isolation, the release is al¬ 
ways “controlled” and damped; i.e.. Ken- 
black holes are stable in any classical con¬ 
text [Press and Teukolsky (1973)]. 

C. Most objects (massive stars; galactic nu¬ 
clei; ...) that can collapse to form black 
holes have so much angular momentum 
that the holes they produce should be 
“very live” (a nearly equal to M; S nearly 
equal to M 2 ). [Bardeen (1970a).] 

D. By contrast, it is quite probable (but far 
from certain) that no black hole in the 
universe has substantial charge—i.e., that 
all black holes have Q 4.M. A black hole 
with Q — M (say, Q y 0 for concreteness) 
would exert attractive electrostatic forces 
on electrons, and repulsive electrostatic 
forces on protons, that are larger than the 
hole’s gravitational pull by the factor 


(electrostatic force) 
(gravitational force) 




Here e is the electron charge and ju, is the 
electron (or proton) mass. Such huge 
differential forces are likely to pull in 
enough charge from outside the hole to 
neutralize it. 

E. But one has learned from the “unipolar 
induction process” for neutron stars [Gold- 
reich and Julian (1968)] that charge neu¬ 
tralization can sometimes be circum¬ 
vented. Whether any black-hole process 
can possibly prevent neutralization one 
does not know in 1972. 

IV. Interaction of a black hole with its environ¬ 
ment 

A. Gravitational pull: A black hole exerts a 
gravitational pull on surrounding matter 
and stars. The pull is indistinguishable, at 
radii r > M, from the pull of a star with 
the same mass. 

B. Accretion and emission of x-rays and y- 
rays: Gas surrounding a black hole gets 


pulled inward and is heated by adiabatic 
compression, by shock waves, by turbu¬ 
lence, by viscosity, etc. Before it reaches 
the horizon, the gas may become so hot 
that it emits a large flux of x-rays and 
perhaps even y-rays. Thus, accreting mat¬ 
ter can convert a black hole into a glowing 
“white” body [for a review of the litera¬ 
ture, see Novikov and Thome (1973)]. 
Accretion from a nonrotating gas cloud 
tends to decrease the angular momentum 
of a black hole [preferential accretion of 
particles with “negative” angular momen¬ 
tum; Doroshkevich (1966), Godfrey 
(1970a)]. But the gas surrounding a hole 
is likely to be rotating in the same direc¬ 
tion as the hole itself, and to maintain 
S ~~ M 2 [more precisely, S zz 0.998A/ 2 ; 
Thome (1973b)]. 

C. A lump of matter (an “asteroid” or a 
“planet” or a star) falling into a black hole 
should emit a burst of gravitational waves 
as it falls. The total energy radiated is 
E ~~ 0.01 J u( J u/M), where fi is the mass of 
the object. [Zerilli (1970b); Davis, Ruffini, 
Press, and Price (1971); Figure 36.2 of this 
book.] 

D. An object in a stable orbit around a black 
hole should spiral slowly inward because 
of loss of energy through gravitational ra¬ 
diation, until it reaches the most tightly 
bound, stable circular orbit. It should then 
fall quickly into the hole, emitting a “last- 
gasp burst” of waves. The total energy 
radiated during the slow inward spiral is 
equal to the binding energy of the last 
stable circular orbit: 

^radiated M ^Iast orbit 

f 0.0572 /a for Schwarzschild hole, 

(0.4235/1 for Kerr hole with 

S ~ M 2 , Q — 0. 

Here /a is the rest mass of the captured 
object. [Box 33.5.] The total energy in 
the last-gasp burst is E — 0.01/a(/a/A/) if 
li <M. [Fig. 36.2.] 
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E. When matter falls down a black hole, it 
can excite the hole’s external spacetime 
geometry into vibration. The vibrations 
are gradually converted into gravitational 
waves, some of which escape, others go 
down the hole. [Press (1971, Goebel (1972).] 
These vibrations are analogous to an 
“incipient gravitational geon” [Wheeler 
(1962); Christodoulou (1971)]—except that 
for a vibrating black hole the background 
Kerr geometry holds the vibration energy 
together (prevents it from propagating 
away immediately), whereas in a geon it 
is curvature produced by the “vibration 
energy” itself that prevents disruption. 

F. By a non-Newtonian, induction-zone (i.e., 
nonradiative) gravitational interaction, a 
black hole gradually transfers its angular 
momentum to any non-axially-symmetric, 
nearby distribution of matter or fields. 
[Hawking (1972a); Ipser (1971), Press 
(1972), Hawking and Hartle (1972).] 

G. A star or planet falling into a large black 
hole will get torn apart by tidal gravita¬ 
tional forces. If the tearing occurs near but 
outside the horizon, it may eject a blob of 
stellar matter that goes out with relativis¬ 
tic velocity (“tube-of-toothpaste effect”). 
Moreover, the outgoing jet may extract a 
substantial amount of rotational energy 
from the hole’s ergosphere—i.e., the hole 
might throw it off with a rest mass plus 
kinetic energy in excess of the rest mass 
of the original infalling object. [Wheeler 
(1971 d); §§33.7 and 33.8.] 

H. The magnetic field fines of a charged black 
hole may be anchored to surrounding 
plasma, may get wound up as the hole 
rotates, and may shake, twitch, and excite 
the plasma. 



V. Collisions between black holes 

A. Two black holes can collide and coalesce; 
but there is no way to blast a black hole 
apart into several black holes [Hawking 
(1972a); exercise 34.4]. 

B. When two black holes collide and coalesce, 
the surface area of the final black hole 
must exceed the sum of the surface areas 
of the two initial black holes (“second 
law of black-hole dynamics”; Hawking 
(1971 a,b); Box 33.4; §34.5). This constraint 
places an upper limit on the amount of 
gravitational radiation emitted in the colli¬ 
sion. For example, if all three holes are of 
the Schwarzschild variety and the two ini¬ 
tial holes have equal masses M/2, then 

4ir(2M flnaI )2 > 4 tt[2(M/2)]2 + 4w[2(M/2)?, 
A/ tinal > M/V2, 
so the energy radiated is 

Radiated < M - M/ y/l = 0.293M. 

VI. Where and how to search for a black hole [For 

a detailed review, see Peebles (1971)]: 

A. When it forms, by the burst or bursts of 
gravitational radiation given off during 
formation [Figure 24.3]. 

B. In a binary star system: black-hole compo¬ 
nent optically invisible, but may emit 
x-rays and y-rays due to accretion; visible 
component shows telltale Doppler shifts 
[Hoyle, Fowler, Burbidge, and Burbidge 
(1964); Zel’dovich and Guseynov (1965); 
Trimble and Thome (1969); Pringle and 
Rees (1972); Shakura and Sunyaev 
(1973)]. The velocity of the visible compo¬ 
nent and the period give information on 
the mass of the invisible component. If 
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mass of this invisible component is four 
solar masses or more, it cannot be an ordi¬ 
nary star, because an ordinary star of that 
mass would have (4) 3 = 64 times the lu¬ 
minosity of the sun. Neither can it be a 
white dwarf or a neutron star because ei¬ 
ther object, so heavy, would instantly col¬ 
lapse to a black hole. Therefore, it is attrac¬ 
tive—though not necessarily compelling 
[see Trimble and Thome (1969)]—to iden¬ 
tify the invisible object as a black hole. 

C. [But one must not expect to see any no¬ 
ticeable gravitational lens action from a 
black hole in a binary system: if it taxed 
the abilities of astronomers for decades to 
see the black disc of Mercury, 4,800 km 
in diameter, swim across the great face of 
the sun, little hope there is to see a black 
hole with an effective radius of only 
km, enormously more remote, occult a 
companion star. Significant lens action re¬ 
quires that the lens (black hole) be sepa¬ 
rated by a normal interstellar distance 
from the star it focuses; whence the impact 
parameter of the focused rays is more than 
a stellar radius, so the lens ^action is not 
more than that of a normal star. Moreover, 

___ 



even with 10 9 black holes in the galaxy, 
only one per year would pass directly be¬ 
tween the Earth and a more distant star, 
and produce significant lens action 
(Refsdal, 1964). Chance of watching the 
right spot on the sky at the right time with 
a sufficiently strong telescope: nil!] 

D. At the center of a globular cluster, where 
a black hole may settle down, attract nor¬ 
mal stars to its vicinity, and thereby pro¬ 
duce a cusp in the distribution of light 
from the cluster. [Cameron and Truran 
(1971), Peebles (1971).] 

E. In the nucleus of a galaxy, including even 
the Milky Way, where a single huge black 
hole (M— 10 4 to 10 8 M o ) might sit as an 
end-product of earlier activity of the ga¬ 
lactic nucleus. Such a hole will emit gravi¬ 
tational waves, light, and radio waves as 
it accretes matter. Much of the light may 
be converted into infrared radiation by 
surrounding dust. The black hole may also 
produce jets and other nuclear activity. 
[Lynden-Bell (1969), Lynden-Bell and 
Rees (1971), Wheeler (1971d), Peebles 
(1971).] 


r \ 

Box 33.4 THE LAWS OF BLACK-HOLE DYNAMICS 

The black-hole processes described in Box 33.3 are governed by the standard laws 
of physics: general relativity, plus Maxwell electrodynamics, plus hydrodynamic, 
quantum mechanical, and other laws for the physics of matter and radiation. From 
these standard laws of physics, one can derive certain “rules” or “constraints,” which 
all black-hole processes must satisfy. Those rules have a power, elegance, and 
simplicity that rival and resemble the power, elegance, and simplicity of the laws 
of thermodynamics. Therefore, they have been given the analogous name “the laws 
of black-hole dynamics” (Israel 1971). This box states two of the laws of black-hole 
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Box 33.4 (continued) 

dynamics and some of their ramifications. Two additional laws, not discussed here, 
have been formulated by Bardeen. Carter, and Hawking (1973). 

I. The First and Second Laws of Black-Hole Dynamics. 

A. The first law . 

1. Like the first law of thermodynamics, the first law of black-hole dynamics 
is the standard law of conservation of total energy, supplemented by the 
laws of conservation of total momentum, angular momentum, and charge. 
For detailed discussions of these conservation laws, see Box 19.1 and 
Chapter 20. 

2. Specialized to the case where matter falls down a black hole and gravita¬ 
tional waves pour out, the first law takes the form depicted and discussed 
near the end of Box 19.1. 

3. Specialized to the case of infalling electric charge, the first law says that 
the total charge Q of a black hole, as measured by the electric flux emerging 
from it, changes by an amount equal to the total charge that falls down 
the hole, 

*7that falls in' 


4. Specialized to the case where two black holes collide and coalesce (example 
given in Box 33.3), the first law says: (a) Let P x and P 2 be the 4-momenta 
of the two black holes as measured gravitationally, when they are so 
well-separated that they have negligible influence on each other. (P x and 
P 2 are 4-vectors in the surrounding asymptotically flat spacetime.) Simi¬ 
larly, let J 1 and J 2 be their total angular-momentum tensors (not intrinsic 
angular-momentum vectors!) relative to some arbitrarily chosen origin of 
coordinates, *P 0 , in the surrounding asymptotically flat spacetime (J x and 
J 2 contain orbital angular momentum, as well as intrinsic angular mo¬ 
mentum; see Box 5.6.). (b) Let P 3 and J 3 be the similar total 4-momentum 
and angular momentum of the final black hole, (c) Let P r and J r be the 
total 4-momentum and angular momentum radiated as gravitational waves 
during the collision and coalescence. Then 

^3 — + P 2 ~ ^r» “b 

[Note: to calculate the mass and intrinsic angular momentum of the final 
black hole from a knowledge of P 3 and J 3 , follow the prescription of Box 
5.6. In that prescription, the world line of the final black hole is that world 
line, in the distant asymptotically Lorentz coordinates, on which the hole’s 
distant spherical field is centered. 
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B. The second law [expounded and applied by Hawking (1971b, 1972a)]. 
When anything falls down a black hole, or when several black holes collide 
and coalesce or collide and scatter, or in any other process whatsoever involv¬ 
ing black holes, the sum of the surface areas (or squares of “irreducible 
masses”—see equation 3 below) of all black holes involved can never decrease . 
(See §34.5 for proof) This is the second law of black-hole dynamics. 


II. Reversible and Irreversible Transformations; Irreducible Mass 

[Christodoulou (1970); Christodoulou and Ruffini (1971)—results derived inde¬ 
pendently of and simultaneously with Hawking’s discovery of the second law.] 

A. Consider a single Kerr-Newman black hole interacting with surrounding 
matter and fields. Its surface area, at any moment of time, is given in terms 
of its momentary mass M, charge Q , and intrinsic angular momentum per 
unit mass a = S/M by 

A = 47 r[r + 2 + a 2 ] = 4tt[(M + y/W- Q 2 - a 2 ) 2 + a 2 ] (1) 

(exercise 33.12). Interaction with matter and fields may change M, Q y and 
a in various ways; M can even be decreased—i.e., energy can be extracted 
from the black hole! [Penrose (1969); §33.7.] But whatever may be the 
changes, they can never reduce the surface area A . Moreover, if any change 
in A/, Q , and a ever increases the surface area, no future process can ever 
reduce it back to its initial value. 

B. Thus, one can classify black-hole processes into two groups. 

1. Reversible transformations change M, Q, or a or any set thereof, while 
leaving the surface area fixed. They can be reversed, bringing the black 

— Jiole back to its original state. 

2. Irreversible transformations change M, Q, or a or any set thereof, and 
increase the surface area in the process. Such a transformation can never 
be reversed. The black hole can never be brought back to its original state 
after an irreversible transformation. 

C. Examples of reversible transformations and of irreversible transformations 
induced by infalling particles are presented in §§33.7 and 33.8. 

D. The reversible extraction of charge and angular momentum from a black hole 
T3ecrease in Q and a holding A fixed) necessarily reduces the black hole’s 
mass (energy extraction!). By the time all charge and angular momentum 
have been removed, the mass has dropped to a final “irreducible value” of 


M iT = ([A/Utr ) 1 ' 2 = ( 


mass of Schwarzschild > 
black hole of surface area Aj 
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Box 33.4 (continued) 


E. Expressed in terms of this final, irreducible mass, the initial mass-energy of 
the black hole (with charge Q and intrinsic angular momentum S) is 

— 

[This formula, derived by Christodoulou and Ruffini, may be obtained by 
combining equations (1), (2), and S — Ma\. 

F. Thus, one can regard the total mass-energy of a black hole as made up of an 
irreducible mass , an electromagnetic mass-energy ; and a rotational energy. But 
one must resist the temptation to think of these contributions as adding 
linearly. On the contrary, they combine in a way [equation (3)] analogous 
to the way rest mass and linear momentum combine to give energy, E 2 = 
m 2 + p 2 . 

G. Contours of constant M/M ir are depicted below in the “charge-angular 
momentum plane.” Black holes can exist only in the interior of the region 
depicted (Q 2 + a 2 < A/ 2 ). [Diagram adapted from Christodoulou (1971).] 


* 

s 

4 2 

I 2 



-2 
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H. Since a black hole’s irreducible mass is proportional to the square root of 
its surface area, one can restate the second law of black-hole dynamics as 
follows: 





In black-hole processes the sum of the squares of the irreducible masses of 
all black holes involved can never decrease . 


V 


§33.3. MASS, ANGULAR MOMENTUM, CHARGE, 

AND MAGNETIC MOMENT 

It is instructive to verify that the constants M, Q y and a , which appear in equations 
(33.2)—(33.5) for the Kerr-Newman geometry and electromagnetic field, are actually 
the black hole’s mass, charge, and angular momentum per unit mass, as claimed 
above. 

Mass and angular momentum are defined by their imprints on the spacetime 
geometry far from the black hole. Therefore, to calculate the mass and angular 
momentum, one can expand the line element (33.2) in powers of 1/r and examine 
the leading terms: 

** = - [ 1 — V + o (£)] <** - [*7*+ o (£)] * * 

/Tyj ( 33 « 

(Ml [dr 2 + r 2 (d$ 2 + sin 2 0 <fy 2 )]. 


+ 1 + 0 



-\ 

The rest of this chapter is 
Track 2. To be prepared for it, 
one needs to have covered the 
Track-2 part of Chapter 32 
(gravitational collapse). In 
reading it, one will be helped 
greatly by Chapter 25 (orbits in 
Schwarzschild geometry). The 
rest of this chapter is needed 
as preparation for Chapter 34 
(singularities and global 
methods). 

_; > 

The metric far outside a 
black hole: imprints of mass 
and angular momentum 


The examination is facilitated by transforming to asymptotically Lorentz coordi¬ 
nates— x = r sin 0 cos <j>, y = r sin 6 sin <j>, z = r cos 6: 


ds 2 = - o(fY dt 2 - + [xdy-ydx] 


+ 1 + 0(7) l dx2 + d y 2 + dz2 l 


(33.6') 


Direct comparison with the “standard form” [equation (19.13)] of the metric far from 
a stationary rotating source reveals that (1) the parameter Mis, indeed, the mass 
of the black hole; and (2) the intrinsic angular momentum vector of the black hole 
is 


S = (aM) d/dz = 



unit vector pointing along polar axis 
of Boyer-Lindquist coordinates 


)■ 


(33.7) 


The charge is defined for the black hole, as for any source, by a Gaussian flux 
integral of its electric field over a closed surface surrounding the hole. The electric 
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The electromagnetic field far 
outside a black hole: 

(1) electric field 


(2) magnetic field 


(3) magnetic dipole moment 


Nonspherical shape of hole's 
geometry . 


Killing vectors for the 
Kerr-Newman geometry 


field in the asymptotic rest frame of the black hole has as its orthonormal components 

Er = E r = F Tt = 0/r* + 0(l/r% 

E h = EJr = F et /r = 0(\/r% (33.8) 

= EJr sin 0 — F et /r sin 9 = 0. 


Hence, the electric field is purely radial with a Gaussian flux integral of 4 nQ, which 
reveals Q to be the black hole’s charge. 

A similar calculation of the dominant components of the magnetic field reveals 


B • = F r , = . = 2 cos 0 + O (3-Y 

r r 2 sin 9 r 3 \r 4 /’ 

B» = sin 6 + o(-J-), 


^ r sin 0 


= F H = 


r° 

F^ 


(33.9) 


= 0 . 


This is a dipole magnetic field, and from it one immediately reads off the value 

91 = Qa — ( Q/M)S — (charge/mass) X (angular momentum) (33.10) 

^“gyromagnetic ratio”] 

for the magnetic moment of the black hole. 

Just as the rotation of the black hole produces a magnetic field, so it also produces 
nonspherical deformations in the gravitational field of the black hole [see Hernandez 
(1967) for quantitative discussion]. But those deformations, like the magnetic mo¬ 
ment, are not freely specifiable. They are determined uniquely by the mass, charge, 
and angular momentum of the black hole. 


§33.4. SYMMETRIES AND FRAME DRAGGING 


The metric components (33.2) of a Kerr-Newman black hole are independent of 
the Boyer-Lindquist time coordinate t and angular coordinate <f>. This means (see 
§25.2) that 


£ {t) = (d/dOr, 0,0 an d f( 0 ) = r,e 1) 

are Killing vectors associated with the stationarity (time-translation invariance) and 
axial symmetry of the black hole. The scalar products of these Killing vectors with 
themselves and each other are 
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2 Mr - Q 2 


( A — a 2 sin 2 # \ / 

' (0 ' MO — gtt — — ( ^2 / = — \ ^ ~ 

r . r _ „ _ a sin2e (A- r 2 - a 2 ) (2 Mr - Q 2 )a sin 2 0 

'(0 '(0) <?£0 ^2 ^2 


^"( 0 ) ^"( 0 ) ^>00 


[(r 2 + a 2 ) 2 — zla 2 sin 2 0] sin 2 0 


), (33.12a) 

(33.12b) 
(33.12c) 


Since Killing vectors are geometric properties of spacetime, with existence inde¬ 
pendent of any and all coordinate systems, their scalar products also have coor¬ 
dinate-free meaning. It so happens (not by chance, but by careful choice of coordi¬ 
nates!) that the Boyer-Lindquist metric components g tt9 g t<t>9 and g^ are equal to these 
coordinate-independent scalar products. Thus g tt , g t<>9 and g^ can be thought of 
as three scalar fields which embody information about the symmetries of spacetime. 
By contrast, the metric coefficients g^ = p 2 /A and g ee = p 2 carry no information 
at all about the symmetries.* They depend, for their existence and values, on the 
specific Boyer-Lindquist choice of coordinates. 

Any observer who moves along a world line of constant (r, 6) with uniform angular 
velocity sees an unchanging spacetime geometry in his neighborhood. Hence, such 
an observer can be thought of as “ stationary” relative to the local geometry. If and 
only if his angular velocity is zero, that is, if and only if he moves along a world 
line of constant (r, 0, <J>), will he also be “static” relative to the black hole’s asymptotic 
Lorentz frame (i.e., relative to the “distant stars”). 

The precise definition of “angular velocity relative to the asymptotic rest frame” 
—or simply “angular velocity”—is 


q d<f> dfy/dj u$ 

dt dt/dr u l 


(33.13a) 


Stationary observers 


Static observers 


Angular velocity and 
4-velocity of a stationary 
observer 


(see exercise 33.2). In terms of 12, the Killing vectors, and the scalar products of 
Killing vectors, the 4-velocity of a stationary observer is 


u = u'(d/dt + S 73/3*) = f - fy 

I <(t) + J/ M0)l 


_ ftt) + _ 

(~8tt — ^®gt<t> ~ ^) 1/2 


(33.13b) 


A stationary observer is static if and only if 12 vanishes. 

The stationary observers at given /*, 0 cannot have any and every angular velocity. 
Only those values of 12 are allowed for which the 4-velocity u lies inside the future 
light cone—i.e., for which 


(f(0 + — gtt + ^®gt<t> + & 2 g<t><;> < 0. 


*This is not quite true. Kerr-Newman spacetime possesses, in addition to its two Killing vectors, also 
a “Killing tensor” which is closely linked to the Boyer-Lindquist coordinates r and 6. See Walker and 
Penrose (1970); also §33.5. 
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Frame dragging, static limit, Thus, the angular velocities of stationary observers are constrained by 
and ergosphere 

£ min < a < £ ma *> (33.14) 

where 


^min = w - Vw 2 - g tt /g^, 
l^max = w + V« 2 - gtt/g<M’ 


to zzz — ([2 . + [2 

— 2 v min ' n 


*) = - 


g# (2Mr - Q-)a 


g* (r 2 + a 2 ) 2 — Aa 2 sin 2 0 ’ 


(33.15a) 

(33.15b) 

(33.16) 


and it is assumed that S/M = a > 0. The following features of these limits are 
noteworthy. (1) Far from the black hole, one has = —1 and rJ? max = +1, 

corresponding to the standard limits imposed by the speed of light in flat spacetime. 
(2) With decreasing radius, i2 min increases (“dragging of inertial frames”). Finally, 
when g tt reaches zero, i.e., at 

r = r o (0) = M+ VM 2 - Q 2 - a 2 cos 2 /9, (33.17) 


*^min becomes zero. At and inside this surface, all stationary observers must orbit 
the black hole with positive angular velocity. Thus, static observers exist outside and 
only outside r = r 0 (8). For this reason r = r 0 (8) is called the “static limit”; see Box 
33.2. (3) As one moves through the static limit into the “ergosphere,” one sees the 
allowed range of angular velocities become ever more positive (ever more “frame 
dragging”). At the same time, one sees the allowed range narrow down, until finally, 
at the horizon _- 


r = ~ M + \/M 2 - Q 2 - a 2 , 


(33.18) 


the limits G min and i2 max coalesce (to 2 = g tt /g ^). Thus, at the horizon there are no 
stationary observers. All timelike world lines point inward. There is no escape from 
the black hole’s “pull.” 

Further features of stationary observers and “frame dragging” are explored in 
the exercises. 


EXERCISES Exercise 33.1. KERR DESCRIPTION OF KILLING VECTORS 

(a) Use the transformation law from Boyer-Lindquist coordinates to Kerr coordinates 
[equation (4) of Box 33.2] to show that 

( it) = Wdt) r ^ = (d/dV) Tt$ j, 
f<0) == (d/d<p>) tr$ = (d/d<t>)v,r,0' 

Verify explicitly by examining metric components that 

8 vv — &t* 8 v $ ~ £00 ~ £00* 

in accordance with equations (33.19a,b). 


(33.19a) 

(33.19b) 

(33.19c) 
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(b) Show that for a stationary observer (world line of constant r , 0), the angular velocity 
expressed in terms of Kerr coordinates is 

~ d<$>/dt = d$/dV — u^/u l \ 

so that the entire discussion of stationary observers in terms of Kerr coordinates is identical 
to the discussion in terms of Boyer-Lindquist coordinates. Differences between the coordinate 
systems show up only when one moves along world lines of changing r. Reconcile this 
fact with the fact that both coordinate systems use the same coordinates (r, 9) but different 
time and aximuthal coordinates (/, <p versus P f 5). 

Exercise 33.2. OBSERVATIONS OF ANGULAR VELOCITY 

An observer, far from a black hole and at rest in the hole’s asymptotic Lorentz frame, watches 
(with his eyes) as a particle moves along a stationary (nongeodesic) orbit near the black 
hole. Let i2 = d$/dt be the particle’s angular velocity, as defined and discussed above. The 
distant observer uses his stopwatch to measure the time required for the particle to make 
one complete circuit around the black hole (one complete circuit relative to the distant 
observer himself; i.e., relative to the hole’s asymptotic Lorentz frame). 

(a) Show that the circuit time measured is 'l t n/Q. Thus, Q can be regarded as the particle's 
“angular velocity as measured from infinity. ” 

(b) Let the observer moving with the particle measure its circuit time relative to the 
asymptotic Lorentz frame, using his eyes and a stopwatch he carries. Show that his answer 
for the circuit time must be 


At = (33.20) 

y / 

u “redshift factor”] 

Exercise 33.3. LOCALLY NONROTATING OBSERVERS 
(Bardeen 1970b) 

(a) Place a rigid, circular mirror (“ring mirror”) at fixed (r, 9) around a black hole. Let an 
observer at (r, 9) with angular velocity i? emit a flash of light. Some of the photons will 
get caught by the mirror and will skim along its surface, circumnavigating the black hole 
in the positive-^ direction. Others will get caught and will skim along in the negative-^ 
direction. Show that the observer will receive back the photons from both directions simulta¬ 
neously only if his angular velocity is 

Q = «(/*, 9) — expression (33.16). 

Thus in this case, and only in this case, can the observer regard the +$ and — $ directions 
as equivalent in terms of local geometry. Put differently, in this case and only in this case 
is the observer “nonrotating relative to the local spacetime geometiy.” Thus, it is appropriate 
to use the name “locally nonrotating observer ” for an observer who moves with the angular 
velocity i? = w(r, 9). 

(b) Associated with the axial symmetry of a black hole is a conserved quantity, 
P<*> = P * f ( 0 ), for geodesic motion. This quantity for any particle—whether it is moving along 
a geodesic or not—is called the “component of angular momentum along the black hole’s 
spin axis,” or simply the particle’s “angular momentum.” (See §33.5 below.) Show that of 
all stationary observers at fixed (r, 0 ), only the “locally nonrotating observer” has zero angular 
momentum. [Note: Bardeen, Press, and Teukolsky (1972) have shown that the “locally 
nonrotating observer” can be a powerful tool in the analysis of physical processes near a 
black hole.] 
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Exercise 33.4. ORTHONORMAL FRAMES OF LOCALLY 
NONROTATING OBSERVERS 

(a) Let spacetime be filled with world lines of locally nonrotating observers, and let each 
such observer carry an orthonormal frame with himself. Show that the spatial orientations 
of these frames can be so chosen that their basis 1-forms are 

mi = l&i “ « 2 S<*l 1/2rf, > = (g< t ,t) u 2 (d<> ~ “«*). 

ut r — (p/A 1/2 ) dr , cu* = p d6 . 

More specifically, show that these 1-forms are orthonormal and that the dual basis has 

3/8? = u = 4-velocity of locally nonrotating observer. (33.22) 

Show that u = —a/* is a rotation-free field of 1-forms [dot 1 A w l = 0; exercise 4.4]. 

(b) One sometimes meets the mistaken notion that a “locally nonrotating observer” is 
in some sense locally inertial. To destroy this false impression, verify that: (i) such an observer 
has nonzero 4-acceleration, 

a = r ni e } = I ▼ In \g tt - «*g (33.23) 


(ii) if such an observer carries gyroscopes with himself, applying the necessary accelerations 
at the gyroscope centers of mass, he sees the gyroscopes precess relative to his orthonormal 
frame (33.21) with angular velocity 


tf(„recess> _ + 

2 I g u ~ » 2 gJ l/2 



] 

-^- e 4 


(33.24) 


[Hints: See exercise 19.2, equation (13.69), and associated discussions. The calculation of 
the connection coefficients is performed most easily using the methods of differential forms; 
see §14.6.] 


Exercise 33.5. LOCAL LIGHT CONES 

Calculate the shapes of the light cones depicted in the Kerr diagram for an uncharged (Q — 0) 
Kerr black hole (part II.F of Box 33.2). In particular, introduce a new time coordinate 

7EE V-r (33.25) 

for which the slices of constant? are horizontal surfaces in the Kerr diagram. Then the Kerr 
diagram plots? vertically, r radially, and $ azimuthally, while holding 9 = 77/2 (“equatorial 
slice through black hole”). 

(a) Show that the light cone emanating from given /, r, $ has the form 


(#) 

( 2 M/r _ 

1 1 rHdi/dtf 

'dP 

f 1 + 2 M/r ^ 

(1 + 2 M/r) 2 1 + 2 M/r 


(b) Show that the light cone slices through the surface of constant radius along the curves 

dr/dT = 0, dWdT = J2 mm and fl m «, (33.26b) 

where and i? max are given by expressions (33.15a,b) (minimum and maximum allowed 
angular velocities for stationary observers). 

(c) Show that at the static limit, r - r 0 (tt/ 2), the light cone is tangent to a curve of constant 
r, 0, 
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(d) Show that the light cone slices the surface of constant 5 along the curves 


cfy dr 1 - 2 M/r 

— = 0, — = — land-. 

dt dt 1 + 2 M/r 


(33.26c) 


(e) Show that the light cone is tangent to the horizon. 

(f) Make pictures of the shapes of the light cone as a function of radius. 

(g) Describe qualitatively how the light cone must look near the horizon in Boyer-Lindquist 
coordinates. (Note: it will look “crazy” because the coordinates are singular at the horizon.) 


§33.5. EQUATIONS OF MOTION FOR TEST PARTICLES 
[Carter (1968a)] 

Let a test particle with electric charge e and rest mass /it move in the external fields 
of a black hole. Were there no charge down the black hole, the test particle would 
move along a geodesic (zero 4-acceleration). But the charge produces an electromag¬ 
netic field, which in turn produces a Lorentz force on the particle: pa = eF* u. (Here 
u is the particle’s 4-velocity, and a = ▼„!/ is its 4-acceleration.) 

The geodesic equation, a — 0, for the uncharged case is equivalent to Hamilton’s 
equations 


dx^/dX = dX/dp^ dpJdX = -dX/dx*, (33.27 a) 

where X is an affine parameter so normalized that 

d/dX — p — 4-momentum, (33.27b) 

and where 

X = “super-Hamiltonian” = y g^p^p (33.27c) 

(see exercise 25.2). Similarly (see exercise 33.6) the Lorentz-force equation, fia = 
eF* u , for the charged case is equivalent to Hamilton’s equations written in terms 
of position x M and “generalized momentum” 7 ^: 

dx*/dX = dX/top dmjd\ = -dX/dx*. (33.28a) 

The form of the superhamiltonian X, in terms of the metric coefficients at the Superhamiltonian for a 

particle’s location, g^(x a ) y and the particle’s charge e and generalized momentum charged test particle in any 
m j s electromagnetic field in 

**’ curved spacetime 

- eA J(”. - eA f ). (33.28b) 


[See §7.3 of Goldstein (1959) for the analogous superhamiltonian in flat spacetime.] 
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Vector potential for a charged 
black hole 


"Constants of motion" for a 
charged test particle moving 
around a charged black hole: 


(1) "energy at infinity" E 

(2) "axial component of 
angular momentum" L z 


The first of Hamilton’s equations for this superhamiltonian reduces to 


pn = (4-momentum) = dx^/dX — — eA A (33.29a) 

(value of ^ in terms of p*, e , and A*); the second, when combined with the first, 
reduces to the Lorentz-force equation 


dp*/dX + r* a pp a pfi = eF^’p v . 

[ p v , not u v because~j_j 

X — t/ju 


(33.29b) 


For a Kerr-Newman black hole, the vector potential in Boyer-Lindquist coordi¬ 
nates can be put in the form 



Or 

-t -(dt — a sin 2 # c/<£), 

P 2 


(33.30) 


as one verifies by checking that 

dA = - A aJ} ) dx a A dx<* 


reduces to the Faraday 2-form of equation (33.5). 

There is good reason for going through all this formalism, rather than tackling 
head-on the Lorentz-force equation in its most elementary coordinate version, 


d 2 x a 
dX 2 


+ r 


a dx$ dx y 

^ dX dX 


— eF a 


dx& 

* dX ‘ 


The Hamiltonian formalism enables one to discover immediately two constants of 
the motion; the elementary Lorentz-force equation does not. The key point is that 
the components A M of A [equation (33.30)] and the components g fiV of the metric 
[inverse of g^ v of equation (33.2); see (33.35)] are independent of t and <j> 
(stationarity and axial symmetry of both the electromagnetic field and the spacetime 
geometry). Consequently, the superhamiltonian is also independent of t and <j>; and 
therefore Hamilton’s equation 


dm J dX = —3 DC/dx a 

guarantees that 7r t and are constants of the motion . 

Far from the black hole, where the vector potential vanishes and the metric 
becomes 

ds 2 = -dt 2 + dr 2 + r\dB 2 + sin 2 # d<j> 2 ), 
the constants of the motion become 

ir t =p t = ~P l = -energy, 

_ __ £ _ /projection of angular momentum\ 

m<t> ^ \ along black hole’s rotation axis / 
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Thus it is appropriate to adopt the names and notation 

E ~ (“energy at infinity”) = —nr t = —(p t + eA t ), (33.31a) 

( “axial component of angular\ 
momentum”, or simply =ir 0 = /> 0 + eA^ (33.31b) 

“angular momentum” / 

for the constants of the motion — 7r £ and 
A third constant of the motion is the particle’s rest mass 

P = Ipl = (-g^PaPp) 1 ' 2 - (33.31c) 

In general, four constants of the motion are needed to determine uniquely the 
orbit of a particle through four-dimensional spacetime. If the black hole were to 
possess an additional symmetry—e.g., if it were spherical, rather than merely axially 
symmetric—then automatically there would be a fourth constant of the motion. But 
in general, black holes are not spherical; so test-particle motion around a black hole 
possesses only three obvious constants. It is rather remarkable, then, that a constant 
turns out to exist. It was discovered by Carter (1968a), using Hamilton-Jacobi 
methods. As of 1973, nobody has given a cogent geometric explanation of why this 
fourth constant should exist—although hints of an explanation may be found in 
Carter (1968c) and Walker and Penrose (1970). 

Carter’s “fourth constant” of the motion, as derived in exercise 33.7, is 

2 =p e 2 + cos 2 0[a 2 (n 2 - E 2 ) + sir r 2 6L 2 ]. (33.3Id) 

The constant of the motion 

X = 2 + (L 8 - aE) 2 , (33.3le) 

obtained by combining 2, L z , and E, is often used in place of 2. Whereas 2 can be 
negative, % is always nonnegative: 

% = p e 2 -f ( L z — aE sin 2 0) 2 /sin 2 0 + a 2 \i 2 cos 2 6 
> 0 everywhere 

= 0 only for case of photon (ju = 0) moving along polar axis (6 = 0, 77). 

The contravariant components of the test particle’s 4-momentum, p a = dx a /d \, 
are readily expressed in terms of the constants E , L z , /a, 2 , by combining equations 
(33.31) with the metric coefficients (33.2) and the components of the vector potential 
(33.30). The result is 


p 2 d9/d\ = VO, (33.32a) 

p 2 dr/dX = V~R, (33.32b) 

p 2 d<t>/d\ = -(aE - L z /sin 2 8) + (a/A)P, (33.32c) 

p 2 dt/d\ = —a(aEsm 2 0 — L z ) + ( r 2 + a 2 )^l _1 / > . (33.32d) 


(3) rest mass ft 


(4) "£” 


Equations of motion for 
charged test particles 
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Here p 2 = r 2 + a 2 cos 2 9 as defined in equation (33.3b), and the functions 6>, R, P 
are defined by 


0 = i2 - cos 2 0 [tf 2 (/i 2 - P 2 ) + L 2 /sin 2 0], (33.33a) 

P = E(r 2 -f # 2 ) — L z a — e(>, (33.33b) 

P = P 2 - 4/x 2 r 2 + (L z - a£) 2 + g]. (33.33c) 

When working in Kerr coordinates (to avoid the coordinate singularity at the 
horizon), one must replace equations (33.32c) and (33.32d) by 

p 2 dV/d\ — —a(aE sin 2 0 — L z ) -f (r 2 -f a 2 )A~\\fR + P), (33.32c') 

p 2 d$/dk = ~(aE - L z /sin 2 0) + aA~\^/R + P). (33.32d') 

[These follow from (33.32) and the transformation between the two coordinate 
systems—see equations (4) of Box 33.2.] In the above equations, the signs of \fR 
and V& can be chosen independently; but once chosen, they must be used consis¬ 
tently everywhere. 

Applications of these equations of motion will play a key role in the rest of this 
chapter. 


Exercise 33.6. SUPERHAM ILTONI AN FOR CHARGED-PARTICLE MOTION 

Show that Hamilton’s equations (33.28a) for the Hamiltonian (33.28b) reduce to equation 
(33.29a) for the value of the generalized momentum, and to the Lorentz force equation 
(33.29b). [Hint: Use the relation (g a ^g^ y ) M = 0.] 

Exercise 33.7. HAMILTON-JACOBI DERIVATION OF EQUATIONS OF MOTION 
[Based on Carter (1968a)] 

Derive the first-order equations of motion (33.32) for a charged particle moving in the external 
fields of a Kerr-Newman black hole. Use the Hamilton-Jacobi method [Boxes 25.3 and 25.4 
of this book; also Chapter 9 of Goldstein (1959)], as follows. 

(a) Throughout the superhamiltonian 9C of equation (33.28b), replace the generalized 
momentum n a by the gradient dS/dx a of the Hamilton-Jacobi function. 

(b) Write down the Hamilton-Jacobi equation [generalization of equation (2) of Box 25.4] 
in the form 


- f - r h &] - ^-)(3 - *4 < 33 - 34a > 

(c) Show that the metric components g Q $ for a Kerr-Newman black hole in Boyer-Lind- 
quist coordinates are given by 

35 ■= - 5 * [<' ! + r, * • [£ ■ 4 0 if' 


(33.35) 
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(d) Use these metric components and the components (33.30) of the vector potential to 
bring the Hamilton-Jacobi equation (33.33) into the concrete form 


05 

8A 



1 1 
2 p 2 sin 2 0 


as 


as 


+ * 2 )^+a^- 


dt 


d<t> 


rM +asin2 ^i 2 

Lad. a/J 


eQrJ 

f 1A 
2 P 2 



±_L(dS\ 2 
2 P 2 \ dd )' 


(33.34b) 


(e) Solve this Hamilton-Jacobi equation by separation of variables. [Hint: Because the 
equation has no explicit dependence on A, or t , the solution must take the form 


5 Et + LJ + S T (r) + S e (0), 


(33.36a) 


where the values of the “integration constants” follow from dS/d\ = -X\ dS/dt = ir ( , 
dS/d<j> = Insert this assumed form into (33.35) and solve for S r (r) and S„(d) to obtain 

S T = f A' 1 VR dr, S 9 = f V& dO, (33.36b) 


where R(r) and 9(0) are the functions defined in equation (33.33). Notice that the constant 
2 arises naturally as a “separation-of-variables constant” in this procedure. It was in this 
way that Carter originally discovered £, following Misner’s suggestion that he seek analogies 
to a constant in Newtonian dipole fields (Corben and Stehle, 1960, p. 209).] 

(f)-By successively .setting -dS/d\j2_+ (L z - aE ) 2 ], dS/dp 2 , dS/dE, and dS/dL z to zero, 
obtain the following equations describing the test-particle orbits: 


r d dO _ r r dr 

J Vd~J Vr' 


> / 
-=/ 


1 a 2 cos 2 0 


r r r 2 

S+ f ~jR dr ' 


Ve J Vr 

-a(aE s\n-0 - L z ) 


\[Q 

— (aE sin 2 0 — L z ) 
sin 2 0 VO 


+ 


/ 


r (r 2 + a 2 )P 

aVr 


•+! 


aP 

aVr 


dr. 


dr, 


(33.37a) 

(33.37b) 

(33.37c) 

(33.37d) 


(g) By differentiating these equations and combining them, obtain the equations of motion 
(33.32) cited in the text. 

(h) Derive equations (33.31) for E, L z , p, and jg by setting dS/dx a = n a = p a + eA a . 


§33.6. PRINCIPAL NULL CONGRUENCES 

Two special families of photon trajectories “mold themselves into” the Kerr-Newman 
geometry in an especially harmonious way. They are called the “principal null 
congruences” of the geometry. (“Congruence” is an elegant word that means “space- 


Principal null congruences for 
the spacetime geometry of a 
black hole 
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Significance of the principal 
null congruences 


filling family of curves.”) These congruences are the solutions to the test-particle 
equations of motion (33.32) with 


fx = 0 (zero rest mass; photon). 

(33.38a) 

e — 0 (zero charge on photon), 

(33.38b) 

/ a permissible value for L z \ 


aE sin 2 # only because d9/dX turns 1, 

(33.38c) 

\out to be zero / 


& = —{L z — aE) 2 = —a 2 E 2 cos 4 #. 

(33.38d) 


For these values of the constants of motion, the equations of motion (33.32) reduce 
to 


k 6 = d9/dX = 0, (33.39a) 

k T = dr/d\ = ±lE (“ + ” for outgoing photons, (33.39b) 

for ingoing), 

k* = d<j>/d\ = aE/A, (33.39c) 

k l = dt/dX = (r 2 + a 2 )E/A. (33.39d>_ 

In what sense are these photon trajectories more interesting than others? (1) They 
mold themselves to the spacetime curvature in such a way that, if C afiy8 is the Weyl 
conformal tensor (§13.5), and *C a p y8 = ^ a ^^ v C^ v \ 8 is its dual, then 

= °. * C a wk t] kW = 0. (33.40) 

[This relationship implies that the Kerr-Newman geometry is of “Petrov-Pirani type 


D” and that these photon trajectories are “doubly degenerate, principal null con¬ 
gruences.” For details of the meanings and implications of these terms see, e.g., §8 
of Sachs (1964), or Ehlers and Kundt (1962), or the original papers by Petrov (1954, 
1969) and Pirani (1957).] (2) By suitable changes of coordinates (exercise 33.8), one 
can bring the Kerr-Newman metric into the form 

ds 2 = (n afi + 2Hk a kp) dx« dx**, (33.41) 

where H is a scalar field and k a are the components of the wave vector for one 
of the principal null congruences (either one; but not both!). [This was the property 
of the Kerr-Newman metric that led to its original discovery (Kerr, 1963). For further 
detail on metrics of this form, see Kerr and Schild (1965).] (3) In Kerr coordinates 
(Box 33.2), the ingoing principal null congruence is 

r — — EX, 9 — const, $ = const, V — const. (33.42a) 

["arbitrary normalization” 

— factor; can be removed 
by redefinition of X 
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These ingoing photon world lines are the generators of the conical surface V = const, 
in the Kerr diagram of Box 33.2. (4) The only kind of particle that can remain forever 
at the horizon is a photon with world line in the outgoing principal null congruence 
(exercise 33.9). Such photon world lines are “generators” of the horizon (dotted 
curves with a “barber-pole twist” in Kerr diagram of Box 332). They have angular 
velocity 

p _ d§ _ d<j> _ a _ a 

~ ~dt ~ ~df ~ r + 2 + a 2 “ 2M 2 - 0 2 + 2M(M 2 - a 2 - Q 2 ) 1/2 ‘ (3342b ) 


Exercise 33.8. KERR-SCHILD COORDINATES 

(a) Show that in Kerr coordinates the ingoing null congruence (33.39) has the form (33.42a). 
Also show that the covariant components of the wave vector—after changing to a new affine 
parameter X new = A old £—are 


kr D> = 0, = 0, H”* = a sin 2 6, = -1. (33.43) 

(b) Introduce new coordinates 7, jc, y, z, defined by 

x 4- iy = (r + ia)e*+ sin 9 , z = r cos 6, 7 = V - r; (33.44a) 

and show that in this “Kerr-Schild coordinate system” the metric takes the form 

ds 2 = (Vap + 2 Hk^kffi dx a dx&, (33.44b) 


where 


• Mr-\Q 2 

r 2 4- a\z/r ) 2 ’ 

k (M dx a _ r(x + 7 rfv) - a(.x 4' - / dx) zdz ^ 

a r 2 -f a 2 r 


(33.44c) 

(33.44d) 


For the transformation to analogous coordinates in which 


ds 2 = (jla/3 + 2 Hk^k^)dx a dx9. 
see, e.g., Boyer and Lindquist (1967). 


Exercise 33.9. NULL GENERATORS OF HORIZON 

(a) Show that in Kerr coordinates the outgoing principle null congruence is described by 
the tangent vector 


d6_ 

d\ 


= 0, 


dr_ 

d\ 


= E, 


^ = 2 

4 ’ 


^ 2(4*+ (33.45, 


(b) These components of the wave vector become singular at the horizon (A = 0), not 
because of a singularity in the coordinate system—the coordinates are well-behaved!—but 
because of poor normalization of the affine parameter. For each outgoing geodesic, let A 0 


EXERCISES 




904 


33. BLACK HOLES 


be a constant, defined as the value of A at the event where the geodesic slices the hypersurface 
V — 0. Then renormalize the affine parameter for each geodesic 


(£/A))^old- 


(33.46) 


Show that the resulting wave vectors 


dO 

d\ 


= 0 , 


dr 

d\~ A °' 


d$ 

d\ 


= 2a 


A ' 


dV A n 

- = 2 ^ + d>)-± 


(33.45') 


are well-behaved as one approaches the horizon; and show that the geodesics on the horizon 
have the form 


9 — const., r — r + = const., £ = 2cA, V — 2(r + 2 + tf 2 )A. (33.47) 

(c) Show that these are the only test-particle trajectories that remain forever on the horizon. 
[Hint: Examine the light cone.] 


§33.7. STORAGE AND REMOVAL OF ENERGY 
FROM BLACK HOLES [Penrose (1969)] 


When a small object falls 
down a large hole: 

(1) energy radiated is 

negligible compared to 
object's rest mass 


When an object falls into a black hole, it changes the hole’s mass, charge, and 
intrinsic angular momentum (first law of black-hole dynamics; Box 33.4). If the 
infalling object is large, its fall produces much gravitational and electromagnetic 
radiation. To calculate the radiation emitted, and the energy and angular momentum 
it carries away—which are prerequisites to any calculation of the final state of the 
black hole—is an enormously difficult task. But if the object is very small (size of 
object < size of horizon; mass of object < mass of hole), and has sufficiently small 
charge, the radiation it emits in each circuit around the hole is negligible. For 
example, for gravitational radiation 


(energy emitted per circuit) (rest mass of object) 
(rest mass of object) (mass of hole) 


(33.48) 


(2) hole's mass, charge, and 
angular momentum 
change by AM = E, 

AQ = e. AS = L z 


[see §36.5; also Bardeen, Press, and Teukolsky (1972)]. Because the energy emitted 
is negligible, radiation reaction is also negligible, and the object moves very nearly 
along a test-particle trajectory. In this case, application of the first law of black-hole 
dynamics is simple and straightforward. 

Consider, initially, a small object that falls directly into the black hole from far 
away. According to the first law, it produces the following changes in the mass, 
charge, and angular momentum of the black hole: 


AS = A 


AM = E = (“energy at infinity” of infalling object), 

AQ — e ~ (charge of infalling object), 

1^1 _ ^ _ /component of object’s angular momentum 
* Von black hole’s rotation axis 


(33.49a) 

(33.49b) 

). (33.49c) 
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The infalling object will also change the direction of S. In the black hole’s original 
asymptotic Lorentz frame, its initial angular momentum vector points in the z-direc- 
tion, 


(^initial ~ £ (^initial — = 

Consequently, only the z-component of angular momentum of the infalling object 
can produce any significant change in the magnitude of S. But the x- and ^/-com¬ 
ponents, L x and L v , can change the direction of S. If the object has negligible speed 
atirrlmity, then it produces the changes (exercise 33.10): 

AS X = L x = - (sin $ J VC - (cot 0 X cos <#>J L z , (33.49d) 

AS V = L v = (cos^JVC -Iwt'C sin<f >JL X , (33.49e) 

A(S X 2 + S 2 ) 1/2 = \f2 = (L 2 - L 2 ) l/2 . (33.49f) 

Here a subscript “oo” means the value of a quantity at a point on the orbit far 
from the black hole (at “infinity”). 

Consider, next, a more complicated process, first conceived of by Penrose (1969): 
(1) Shoot a small object A into the black hole from outside with energy-at-infinity 
E a , charge e A , and axial component of angular momentum L zA . (2) When the object 
is deep down near the horizon, let it explode into two parts, B and C, each of which 
subsequently moves along a new test-particle trajectory, with new constants of the 
motion e B and e c , E B and E c , L xB and L zC . (3) So design the explosion that object 
B falls down the hole and gets captured, but object C escapes back to radial infinity. 
What will be the changes in mass, charge, and angular momentum of the black 
hole? According to the first law of black-hole dynamics, 

( total energy that distant observers see) 
fall inward past themselves minus ) 
total energy that they see reemerge / 

— ^a ~ E c . 

Similarly, AQ = e A — e c and AS = L zA - L zC . Not unexpectedly, these changes can 
be written more simply in terms of the constants of motion for object B, which went 
down the hole. View the explosion “A — B + C” in a local Lorentz frame down 
near the hole, which is centered on the explosive event. As viewed in that frame, 
the explosion must satisfy the special relativistic laws of physics (equivalence princi¬ 
ple!). In particular, it must obey charge conservation 

e A = e B + e c 


and conservation of total 4-momentum 

(P.4 immediately before explosion = (Pb + Pc limmediately after explosion' 


(33.50a) 
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Changes in M, Q, S for any 
nonradiative black-hole 
process 


Extraction of energy from a 
black hole by processes in 
the ergosphere 


Moreover, conservation of 4-momentum p and charge e implies also conservation 
of generalized momentum n =p — eA, 

n A — Pa ~ e A A =Pb+ Pc~ ( e B + e c) A = n B + 

and hence also conservation of the components of generalized momentum along 
the vectors d/dt and d/d<j> 9 


E a — 7T tA — — 77 tc — E b + E c , (33.50b) 

Am — < 77 0 S "h = L zB + L zC , (33.50c) 

(conservation of “energy-at-infinity” and “axial component of angular momentum” 
in explosion). Combining these conservation laws with the expressions 

AM= E a - E c , AQ = e A ~ e c , and AS = L zA - L zC , 

one obtains 

AM = E b , AQ = AS = L zB . (33.51) 

This result restated in words: the changes in mass, charge, and angular momentum 
are equal to the “energy-at-infinity,” charge, and “axial component of angular 
momentum” that object B carries inward across the horizon, even though B may 
have ended up on a test-particle orbit that does not extend back to radial infinity! 

Straightforward extensions of the above thought experiment produce this general¬ 
ization: In any complicated black-hole process that involves infalling , colliding, and 
exploding pieces of matter that emit negligible gravitational radiation, the total changes 
in mass, charge, and angular momentum of the black hole are 


( sum of values of energy-at-infinity, E , ^ 

for all objects which cross the horizon—with 
E evaluated for each object at event of crossing 



similar sum, of charges, e , for\ 
all objects crossing horizon /’ 

similar sum of axial components of angular 
momentum, L z9 for all objects crossing horizon 


)■ 


(33.52a) 

(33.52b) 

(33.52c) 


This result is not at all surprising. It is precisely what one might expect from the 
most naive of viewpoints. Not so expected, however, is the following consequence 
[Penrose (1969)]: By injecting matter into a black hole in a carefully chosen way, one 
can decrease the total mass-energy of the black hole — i.e., one can extract energy from 
the hole . 

For uncharged infalling objects, the key to energy extraction is the ergosphere 
[hence its name, coined by Ruffini and Wheeler (1971a) from the Greek word “epyor” 
for “work”]. Outside the ergosphere, the Killing vector £ {t) = d/dt is timelike, as 
is the 4-momentum p of every test particle; and therefore E = — p • f (t) is necessarily 
positive. But inside the ergosphere (between the horizon and the static limit), f (t) 
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is spacelike, so for certain choices of timelike momentum vector (certain orbits of 
uncharged test particles), E = — p • f (t) is negative, whereas for others it is positive. 
The orbits of negative E are confined entirely to the ergosphere. Thus, to inject an 
uncharged object with negative E into the black hole—and thereby to extract energy 
from the hole—one must always change its E from positive to negative and therefore 
also change its orbit, after it penetrates into the ergosphere. Of course, this is not 
difficult in principle—and perhaps not even in practice; see Figure 33.2. 

For a charged object, electromagnetic forces alter the region where there exist 
orhitsof negative energy-at-infinity. If the charges of object and hole have opposite 
sign, then the hole’s electromagnetic field pulls inward on the object, giving it more 
kinetic energy when near the hole than one would otherwise expect. Thus, —p • ( {t) 
becomes an overestimate of E , 

E = -(P - eA ) • f (0 = -P • f«> + eQr/p 2 ; (33.53) 

L[< o if eQ < o] 

-and orbits with E < 0 exist in a region somewhat larger than the ergosphere. If, 
on the other handr^and Q have the same sign, then orbits with E < 0 are confined 
to a region smaller than the ergosphere. For given values e , Q , and rest mass jit, 
the region where there exist orbits with E < 0 is called the “effective ergosphere. ” 


Exercise 33.10. ANGULAR MOMENTUM VECTOR FOR INFALLING PARTICLE 

Derive equations (33.49d,e,f) for the components L x and L y of the orbital angular momentum 
of a particle falling into a black hole. Assume negligible initial speed, E 2 — \ u. 2 ~ 0. 


§33.8. REVERSIBLE AND IRREVERSIBLE TRANSFORMATIONS 
[Christodoulou (1970), Christodoulou and Ruffini (1971)] 

Take a black hole of given mass M , charge Q, and angular momentum S. By injection 
of small objects, make a variety of changes in M, Q , and S. Can one pick an arbitrary 
desired change, AM, AQ, and AS, and devise a process that achieves it? Or are there 
limitations? 

The second law of black-hole dynamics (nondecreasing surface area of black hole; 
Box 33.4; proof in §34.5 of next chapter) provides a strict limitation. 

Then can all values within that limitation be achieved—and can that limitation 
be discovered by a direct examination of test-particle orbits? 

The answer is yes; and, in fact, the limitation was discovered by Christodoulou 
(1970) and Christodoulou and Ruffini (1971) from an examination of test-particle 
orbits, independently of and simultaneously with Hawking’s (1971) discovery of the 
second law of black-hole dynamics. 


The "effective ergosphere" 
for charged-particle processes 


EXERCISE 
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Figure 33.2. 

An advanced civilization has constructed a rigid framework around a black hole, and has built a huge 
city on that framework. Each day trucks carry one million tons of garbage out of the city to the garbage 
dump. At the dump the garbage is shoveled into shuttle vehicles which are then, one after another, 
dropped toward the center of the black hole. Dragging of inertial frames whips each shuttle vehicle 
into a circling, inward-spiraling orbit near the horizon. When it reaches a certain “ejection point,** the 
vehicle ejects its load of garbage into an orbit of negative energy-at-infinity, £ garbage < 0. As the garbage 
flies down the hole, changing the hole’s total mass-energy by AM = E^ e ejected < 0, the shuttle vehicle 
recoils from the ejection and goes flying back out with more energy-at-infinity than it took down 

E — F — F 

vehicle out ^ vehicle + garbage down garbage ejected 

S F 

^ vehicle + garbage down* 

The vehicle deposits its huge kinetic energy in a giant flywheel adjacent to the garbage dump; and the 
flywheel turns a generator, producing electricity for the city, while the shuttle vehicle goes back for another 
load of garbage. The total electrical energy generated with each round trip of the shuttle vehicle is 

(Energy per trip) = £ vehicle out - (rest mass of vehicle) 

^vehicle + garbage down ^garbage ejected) (fCSt mass of Vehicle) 

= (rest mass of vehicle + rest mass of garbage - AM) - (rest mass of vehicle) 

= (rest mass of garbage) + (amount, — AM , by which hole’s mass decreases). 

Thus, not only can the inhabitants of the city use the black hole to convert the entire rest mass of then- 
garbage into kinetic energy of the vehicle, and thence into electrical power, but they can also convert 
some of the mass of the black hole into electrical power! 
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To derive the limitation of nondecreasing surface area from properties of test- 
particle orbits, one must examine what values of energy-at-infinity, E, are allowed 
at a given location (r, 0) outside a black hole. Equations (33.32a,b), when combined, 
yield the value of E in terms of a test particle’s location (r, 6), rest mass ju, charge 
e, axial component of angular momentum L z , and momenta p T = dr/d\,p' 
in the r and 6 directions: 

aE 2 - 2pE + y = 0; £ = ft + Vft 2 - a Y , 

a 

where 

a = (r 2 + a 2 ) 2 — Aa 2 sin 2 # > 0 everywhere outside horizon, 
ft = (L z a + eQr)(r 2 + a 2 ) - L z a A, 

J = (L x a + eQr) 2 - A(L Z / sin 9) 2 - p 2 Ap 2 - p 4 [(p r ) 2 + A{p e ) 2 }. 

(One must take the positive square root, + Vft 2 - ay, rather than the negative 
square root; positive square root corresponds to 4-momentum pointing toward 
future; while negative square root corresponds to past-pointing 4-momentum; see 
Figure 33.3.) 

Several features of the energy equation (33.54) are noteworthy. (1) For orbits in 
the equatorial “plane,” 9 — kt/ 2 and p e = 0, the energy equation yields an effective 
potential for radial motion (Box 33.5). (2) Orbits of negative E must have /? < 0 
and y > 0—which can be achieved only if L z a < 0 and/or eQ < 0. Thus, one cannot 
decrease the mass of a black hole without simultaneously decreasing the magnitude 
of its charge or angular momentum or both. (3) For an orbit at given (r, 9), with given 
e and L z , £ is a minimum if p T = p e = p = 0. Put differently, the rest mass and the 
r- and 9-components of momentum always contribute positively to E. 

By injecting an object into a black hole, produce small changes 

SM = E, 8Q = e, 8S = L z , 

in its mass, charge, and angular momentum. For given changes in Q and S, what 
range of changes in M is possible? Clearly 8M can be made as large as one wishes 
by making the rest mass p sufficiently large. But there will be a lower limit on SM. 
That limit corresponds to the minimum value of E for given e and L z . The orbit 
of minimum E crosses the horizon (otherwise no changes in M, Q, S would occur!), 
so one can evaluate E there. At the horizon, as anywhere, a minimum for E is 
achieved if p = p T — p 6 = 0. Inserting these values and r = r + (so A = 0) into 
equations (33.54), one finds 


= d9/dX 

(33.54a) 

(33.54b) 

(33.54c) 

(33.54d) 


L z a + eQr + 
r + 2 + a 2 


Properties of test-particle 
orbits: 


(1) £ as function of ju, e, L z , 
r, 6, p r 


(2) effective potential 

(3) negative £ requires 

L z a < 0 and/or eQ < 0 


Changes in black-hole 
properties due to injection of 
particles: 


(33.55) 
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Figure 33.3. 

Energy-a t-infinity E allowed for a particle of angular momentum L z and rest mass /a, which is (1) in 
the “equatorial plane” 8 = it/ 2, (2) at radius r — 3A//2, (3) of an uncharged (£) = 0) extreme-Kerr 
(S = A/ 2 ) black hole. E is here plotted against L s . “Seas” of “positive and negative root” states are 
shown. The positive root states have energies-at-infinity given by equations (33.54) 

P 4- VP 2 — ay 

E — - 

a 

and have 4-momentum vectors pointing into the future light cone. The negative root states (states of 
Dirac’s “negative energy sea”) have energies at infinity given by 

P - \ f P 2 - ay 

E =-, 

a 

and have 4-momentum vectors pointing into the past light cone. In the gap between the “seas” no orbits 
exist (forbidden region). The gap vanishes at the horizon r = M (infinite redshift of local energy gap, 
2 /a, gives zero gap in energy-at-infinity). [Figure adapted from Christodoulou (1971).] 


corresponding to changes in the black-hole properties of 

(1) limit on 8M for given SQ a 8S -f- r Q 8Q /absolute minimum value of\ 

and55 - o " — Q --■ . ). (33.56) 

r + 2 4 - a- \8M for given 8S and 8Q ) J 

Notice an important consequence [Bardeen (1970a)]: if the black hole is initially 
of the “extreme Kerr-Newman” variety, with M 2 = a 2 + Q 2 , so that one might fear 
a change which makes M 2 < a 2 + Q 2 and thereby destroys the horizon, one’s fears 
are unfounded. Equation (33.56) then demands (since r + — M and S = Ma) 


(2) preservation of the 
horizon 


M8M> aSa+ Q 8Q; 

so M 2 remains greater than or equal to a 2 + Q 2 , and the horizon is preserved. 
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Box 33.5 ORBITS OF TEST PARTICLE IN “EQUATORIAL PLANE" 

OF KERR-NEWMAN BLACK HOLE 

Radial motion is governed by energy equation (33.54) with 6 - p e = 0: 

i _ ft + V/8 2 — ay 0 + ar*(p T ) 2 


aE 2 - 2pE + Y 0 - r\p')* = 0; E = 


0 ) 


a « A Yo are functions of r and of constants of motion, 

a = (r 2 + a 2 ) 2 - Aa 2 > 0, (2a) 

ft = (E z a + eQr)(r 2 + a 2 ) — L z aA, (2 b) 

Yo = + eQr) 2 - AL 2 - ^r 2 A; (2c ) 

p T = (radial momentum) is 

p T = dr/dk. ( 3 ( 

Thus, equation (1) is an ordinary differential equation for dr/d\. 

Qualitative features of the radial motion can be read off an effective-potential 
diagram. The effective potential V(r) is the minimum allowed value of £ at radius r. 

V(r) = — — ^ft~ ~ a Yo 
a 

As in the Schwarzschild case (Figure 25.2), the allowed regions for a particle of 

energy-at-mfimty £ are the regions with V(r) < £; and the turning points (a- = 
ar/dX = 0) occur where V(r) = E. r 

Stable circular orbits occur at the minima of V{r). By examining V(r) closely, one 
finds that for uncharged black holes the innermost stable circular orbit (most tightly 

Whe n ekr r ( b 1971b)] the CharaCteristicS here tabuIated f tabl e adapted from Ruffini and 


Extreme Kerr 


Charac¬ 

teristic 

Newtonian 
{Figure 25.2 ) 

Schwarzschild 
{a = Q = 0) 

(Figure 25.2 ) 

{3 = M~, Q — 0) 

(see figure) 

[Bardeen [1970a)] 
if La > 0 if L,a < 0 

r/M 

0 

6 

1 

9 

E/\i 

— 00 

2V2/3 

I/V3 

5/(3 V3) 

(n - £)//x = 
“fractional 
binding” 

*f oo 

0.0572 

0.4226 

0.0377 


0 

2V3 

2 / v / 3 

22/(3 V3) 
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Box 33.5 ( continued) 




For a charged extreme Kerr-Newman black hole ( M 2 = Q 2 + a 2 , Q ^ 0 and a^O) 
stable circular orbits with 100 per cent binding (E = 0) are achieved in the limit 

7 - ” -00 ’ and 

[Christodoulou and Ruffini (1971)]. 

The effective potential for an uncharged, extreme Kerr black hole (a = M) is 
shown in the figure [figure adapted from Ruffini and Wheeler (1971b)]. For detailed 
diagrams of orbits in the equatorial plane, see de Felice (1968). For many interesting 
properties of orbits that are not confined to the equatorial plane, see Wilkins (1972). 
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The general limit (33.56) on the change in mass can be rewritten in an alternative 
form [Christodoulou (1970), Christodoulou and Ruffini (1971)]: 

SM ir > 0, (33.57) 

where 


Af ir = y y/rf + a 2 - y[(M + \/M 2 - Q 2 - a 2 ) 2 + a 2 ] 1/2 (33.58)'' (3) irreducible mass 

is the “irreducible mass” of the black hole. Equation ( 33.57) states that no black-hole 
transformation produced by the injection of small lumps of matter can ever reduce 
the irreducible mass of a black hole. This result is actually a special case of the second 
law of black-hole dynamics, since the surface area of a black hole is 

A = 1677- M lr 2 (33.59) 

(Exercise 33.12). 

Equation (33.58) can be combined with a = S/M and inverted to yield 


= U + SLY + _£L_. (33e0) 

\ 4 Mj 4 ’ 

rotational con- ] 

_tribution to mass] 


Irreducible con¬ 
tribution to mass. 


JL 


electromagnetic con- 1 
.tribution to mass 


A black-hole transformation that holds fixed the irreducible mass is reversible; (4) reversible and irreversible 
one that increases it is irreversible. The derivation of equation (33.56) revealed that transformations 
the only injection processes that actually achieve the minimum possible value for 
<5A/(and thus make $M iv ~ 0) are those with /x = p T = p e = 0 at the horizon, r = r + . 

Restated in words: To produce a reversible transformation by injecting an object 
into a black hole, one must (1) give the object a rest mass /x extremely small compared 
to its charge e or axial component of angular momentum L z , 


li/e < 1 and/or n 2 /L z 2 < 1; 


and (2) set the object down “extremely gently” ( p r = p e - 0), extremely close to 
the horizon (r = r + ). This does not sound too difficult until one recalls that objects 
with p T - p 8 - 0 at the horizon must be moving outward with the speed of light, 
and that the nearer one approaches the horizon as one sets down the object, the 
greater one’s danger of “slipping” and getting swallowed! 

Clearly, any actual injection process will depart somewhat from irreversibility. 
Reversibility is an idealized limit, approachable but not attainable. 
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EXERCISES Exercise 33.11. IRREDUCIBLE MASS IS IRREDUCIBLE 

Show that condition (33.56) is equivalent to 8M ir > 0. 

Exercise 33.12. SURFACE AREA OF A BLACK HOLE 

Show that the surface area of the horizon of the Kerr-Newman geometry [area of surface 
r = r + and t = const (Boyer-Lindquist coordinates) or K = const (Kerr coordinates)] is 

16*A4 2 . 


Exercise 33.13. ANGULAR VELOCITY OF A BLACK HOLE 

A general theorem [Hartle (1970) for relativistic case; Ostriker and Gunn (1969) for nonrela- 
tivistic case] says that, if one injects angular momentum into a rotating star while holding 
fixed all other contributions to its total mass-energy (contributions from entropy and from 
baryonic rest mass), then the injection produces a change in total mass-energy given by 

. /angular velocity of star\ . 

5(mass-energy) = I ... ) 5(an2ular momentum). (33.61) 

Vat point of injection / v 1 

By analogy, if one injects an angular momentum 8S into a rotating black hole while holding 
fixed all other contributions to its total mass-energy (contributions from irreducible mass 
and from charge), one identifies the coefficient Q h in the equation 

8M = Q h 8S 


as the angular velocity of the hole; 


S? h = (dM/dS) QtMi . 

(a) Show that the angular velocity of a black hole is equal to 


(33.62) 


Q 


h 


a 

r + 2 + a 2 ' 


(33.63) 


Notice that this is precisely the angular velocity of photons that live forever on the horizon 
[equation (33.42b); “barber-pole twist” of null generators of horizon]. 

(b) Show that any object falling into a black hole acquires an angular velocity (relative 
to Boyer-Lindquist coordinates) of i? = d§/dt — Q h in the late stages, as it approaches the 
horizon. (Recall that the horizon is a singularity of the Boyer-Lindquist coordinates. This 
is the reason that every object, regardless of its L z , E, e , ju, i2, can approach and does approach 
Q = Q h .) 


Exercise 33.14. SEPARATION OF VARIABLES FOR WAVE EQUATIONS 

This chapter has studied extensively the motion of small objects in the external fields of 
black holes. Of almost equal importance, but not so well-understood yet because of its 
complexity, is the evolution of weak electromagnetic and gravitational perturbations 
(“waves”) in the Kerr-Newman geometry. Just as one had no a priori reason to expect a 
“fourth constant” for test-particle motion in the Kerr-Newman geometry, so one had no 
reason to expect separability for Maxwell’s equations, or for the wave equations describing 
gravitational perturbations—or even for the scalar wave equation ~ — i£ a a = 0. Thus 
it came as a great surprise when Carter (1968c) proved separability for the scalar wave 
equation, and later when Teukolsky (1972, 1973) separated both Maxwell’s equations and 
the wave equations for gravitational perturbations. 
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Show that separation of variables for the scalar-wave equation in the (uncharged) Ken- 
geometry yields solutions of the form 

^ = (r 2 + a 2 )~ 1/2 u tm (r)S ml (—icoa y cos 6)e iim<t> ~ (33.64a) 

where m and i are integers with 0 < \m\ < X; S mt is a spheroidal harmonic [see Meixner 
and Scharfke (1954)]; and u lm satisfies the differential equation 

— d 2 u/dr* 2 + Vu = 0. (33.64b) 

In order to put the equation in this form, define a Regge-Wheeler (1957) “tortoise”-type 
radial coordinate r* by 


dr* = A x (r 2 + a 2 ) dr , 

and find an effective potential V(r*) given by 

V = - (« - 7 + l( m - “«) 2 + 2](r 2 + a 2 )' 2 A 

+ 2 (Mr - a 2 )(r 2 + a 2 )- 3 A + 3 a\r 2 + a 2 )-* A 2 . 


(33.64c) 


(33.64d) 


In this radial equation 2. is a constant (analog of Carter’s constant for particle motion), 
given in terms of m and X by 


g = \ — \ = [eigenfunction of spheroidal harmonic; 

Lsee Meixner and Scharfke (1954) 


(33.64e) 


[These details of the separated solution were derived by Brill et al (1972). For studies of 
the interaction between fields and Kerr black holes—studies performed using the above 
solution, and using analogous solutions to the electromagnetic and gravitational wave 
equations—see Bardeen, Press, and Teukolsky (1972), Misner (1972b), Teukolsky (1972), Ipser 
(1971), Press and Teukolsky (1973), and Chrzanowski and Misner (1973).] 



r 


CHAPTER 


34 


GLOBAL TECHNIQUES, HORIZONS, 
AND SINGULARITY THEOREMS 
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This chapter is entirely 
Track 2. §22.5 (geometric 
optics) and the Track-2 portions 
of Chapters 32 and 33 
(collapse and black holes) are 
necessary preparation for it. It 
is not needed as preparation 
for any later chapter. 


Local techniques of analyzing 
spacetime physics contrasted 
with global techniques 


§34.1. GLOBAL TECHNIQUES VERSUS LOCAL TECHNIQUES 

Until the 1960 s, computations in gravitation theory used local techniques almost 
exclusively: the Einstein field equation describes how the stress-energy tensor T at 
a given event generates curvature G at that same event (local physics). When reduced 
to differential equations for the metric coefficients, G = SttT relates g a09 dg^/dx^, 
and at each given event to T yS at that same event (local equation). 

The solution of these differential equations is effected, on a computer or in any 
initial-value-type analysis, by integrating forward in time from event to event to 
event (local integration). The nongravitational laws of physics are obtained by 
invoking the equivalence principle in a local Lorentz frame at each individual event 
in spacetime. To build up an understanding of the global structure of spacetime, 
one performs local computations near each event, and then patches the local results 
together to form a global picture. Why this great reliance on local analyses? Because 
the laws of gravitation physics take on particularly simple forms when stated locally. 

That gravitation physics is also subject to powerful and simple global laws, physi¬ 
cists did not realize until the mid 1960’s. But since 1963, studies of black holes and 
of singularities have revealed global laws and global properties of spacetime that 
rival in their simplicity and elegance even the (local) equivalence principle. An 
example is the second law of black-hole dynamics: “In an isolated system, the sum 
of the surface areas of all black holes can never decrease.” As a result, there has 
developed a powerful body of knowledge and techniques for analyzing directly the 
global properties of spacetime. 

To give a full treatment of global techniques would require many chapters. 
Fortunately, a full treatment is being published, almost simultaneously with this 
book, by Hawking and Ellis (1973). Because Hawking and Ellis are much better 
qualified to write on this subject than are we (Misner, Thome, and Wheeler), we 
have chosen to not write a “competitive” treatment. Instead, we give in this chapter 
only a brief taste of the subject—enough of a taste to make the reader acquainted 
with the types of techniques involved and several of the most important results, but 
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not enough to give him a working knowledge of the subject. The topics we have 
chosen to treat are those that contact most closely the rest of this book: properties 
of “infinity” in an asymptotically flat spacetime (§34.2); causality and horizons 
(§§34.3 and 34.4); a proof of the second law of black-hole dynamics (§34.5); and 
theorems about the evolution of singularities in spacetime (§34.6). For greater detail 
on global techniques, one can consult not only the book of Hawking and Ellis (1973), 
but also review articles by Geroch (1971), by Penrose (1968a, 1972), and by Hawking 
(1973), the thesis of Godfrey (1970b), and the more specialized papers cited in the 
body of this chapter. 

—§34t^-^H^NITY'' in asymptotically flat spacetimes 

When performing calculations in asymptotically flat spacetime, one often must 
examine the asymptotic forms of fields (e.g., the metric, or the curvature tensor, or 
the electromagnetic field) “at infinity.” For example, the mass and angular momen¬ 
tum of an isolated system are determined by the asymptotic form of the metric 
(Chapter 19). It is rarely sufficient to examine asymptotic forms near “spatial infin¬ 
ity.” For example, if one wishes to learn how much mass was carried away by 
gravitational and electromagnetic waves during a supernova explosion, one must 
examine the asymptotic form of the metric not just at “spatial infinity,” but at 
“future null infinity” (see Figure 34.1). 

Penrose (1964, 1965a) has developed a powerful body of mathematical technique 
for studying asymptotic properties of spacetime near “infinity.” The key to his 
technique is a “conformal transformation” of spacetime, which brings “infinity” in 
to a finite radius and thereby converts asymptotic calculations into calculations at 
“finite points.” Penrose’s technique also provides rigorous definitions of several types 
of “infinity” that one encounters in asymptotically flat spacetimes. 

The details of Penrose’s technique are not of importance to the rest of this chapter. 
However, this chapter will refer frequently to the various types of “infinity” defined 
by Penrose. In heuristic terms, they are as follows (see Figure 34.2a). 

/ + = “future timelike infinity”: 

the region t —+ oo at finite radius r 
(region toward which timelike lines extend). 

/” = “past timelike infinity”: 

the region t —— oo at finite radius r 
(region from which timelike lines come). 

1° = “spacelike infinity”: 

the region r —oo at finite time t 
(region toward which spacelike slices extend). 

= “future null infinity”: 

the region t + r —*- oo at finite t — r 
(region toward which outgoing null lines extend). 

<f~ ~ “past null infinity”: 

the region t — r —— oo at finite t + r 
(region from which ingoing null lines come). 

Note: <f is a script /, and is sometimes given the name “scri.” 


References on global 
techniques 


Motivation for studying 
properties of spacetime 
infinity 


Specific regions of infinity: 

/+ /°, r, 




Coordinate diagrams for 
exhibiting structure of infinity 



Figure 34.1. 

Measurement of the mass-energy radiated as gravitational and electromagnetic 
waves by a supernova explosion in asymptotically flat spacetime. The mass-energy 
radiated equals the mass (A/ before ) of the presupernova star, minus the mass (after) 
of the neutron star and nebula after the explosion: 

^radiated ^before ““ ^after’ 

To measure one can examine the asymptotic form (in suitable coordinates) 

of £ 00 at spatial infinity 

« 1 i ^ ^before , ^ ^ \ 

goo = — 1 H-f O (—J as r —oo, t — constant. 


But to measure in the same way, one must wait, at any fixed r, until the 

radiation has flowed entirely past that point: 


goo ~ — 1 + 


Rafter 



as r 


oo with t — r 


( constant value sufficiently large 
to be inside the burst of waves, 


Put differently, to measure M alb&r one must examine the asymptotic form of g 00 
not at “spatial infinity,” but rather at “future null infinity.” 


It is often useful, in visualizing the asymptotic structure of spacetime, to introduce 
coordinates that attribute finite coordinate values to infinity. For example, in flat 
spacetime one can transform from the usual spherical coordinates t, r, 6 , <J>, with 

ds 2 = ~dt 2 + dr 2 + r 2 (d6 2 + sin 2 0 </<J> 2 ), (34.1) 


to new spherical coordinates \p, £, 6 , <f>, with 
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/ / + 




Figure 34.2. 

Flat, “Minkowski” spacetime as depicted (a) in the usual spherical coordinates t , r, 0, <j> of a global 
Lorentz frame, and (b) in the spherical coordinates of equations (34.2). The five regions of infinity—/+, 
7~» 7°, y + , are shown in each coordinate diagram. In both coordinate systems, radial null lines 
make angles of 45 with the vertical axis, and nonradial null lines make angles less than 45° [see equations 
(34.1) and (34.2c)]. See exercise 34.1 for further detail. 


r + r= tanyW/+ £), (34.2a) 

t - r = tan y (^ - £), (34.2b) 

„ -dxb 2 + d£ 2 

ds 2 = -—--- — -+ r 2 {dd 2 + sin 2 0 d$ 2 ). (34.2c) 

4 cos 2 yO|/ + i) cos 2 y (4> - i) 

The resulting tp, £ coordinate diagram (Figure 34.2b) depicts /+, I~, 1°, 3 + , 3~ more 
clearly than does the usual t, r, coordinate diagram. 

As another example, replace the Kruskal-Szekeres coordinates v, u, 6, <J> for 
Schwarzschild spacetime by new coordinates \p, £, 6, <p: 


v + u = tany(i^ + |), 
v - u = tan-kty - £), 


(1 - r/2M)e r/2M = v 2 - u 2 = tan^-ty + £) tan-k* - £), 


ds 2 = 


32 M 3 e~ r/2M (—d\f/ 2 + d£ 2 ) 


4 cos 2 -(xp + £) cos 2 —-(^ - i) 


+ r 2 (d6 2 + sin 2 # d<p 2 ). 


(34.3a) 
(34.3b) 
(34.3 c) 
(34.3d) 


The resulting coordinate diagram (Figure 34.3) depicts clearly the causal connections 
between the horizons, the singularities, and the various regions of infinity. 
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* 


/+ r — 0 singularity /+ 



Figure 34.3. 

Schwarzschild spacetime as depicted in the £, 0, 6 
coordinates of equations (34.3). This coordinate diagram 
should be compared with the Kruskal-Szekeres coordi¬ 
nate diagram (Figure 31.3). In both diagrams, radial null 
geodesics are 45° lines. Each of the asymptotically flat 
regions (one on each side of the ‘‘wormhole” of Figure 
31.5a) has its own set of infinities 7 + , 7“, 7°, and 
See exercise 34.2 for justification of this diagram. 


EXERCISES Exercise 34.1. FLAT SPACETIME IN £, 9, $ COORDINATES 

(a) Derive equation (34.2c) from (34.1) and (34.2a,b). 

(b) Show that the regions 7 + , /“, 7°, i + , and of flat spacetime are located at 


7 + : 



= 0 , 




7~: 


-IT 

4 = 0, 




7°: 

+ = 

o,4 

— 77, 






4 = 

n, -77 < 

*- 

4 

< 77, 


4>- 

4 = 

— 77, —77 

< * 

+ 

4 < 


(34.4) 


[see equations (34.2)]. These are the regions depicted in Figure 34.2. 

(c) Show that in flat spacetime, in a £ coordinate diagram (Figure 34.2), radial null 
lines make angles of 45° with the vertical axis, and nonradial null lines make angles of less 
than 45 0 . 


Exercise 34.2. SCHWARZSCHILD SPACETIME IN if,, £, 9. <j> COORDINATES 

(a) Derive equations (34.3c,d) from (34.3a,b) and the Kruskal-Szekeres equations (31.14). 

(b) Use equations (34.3) to justify the precise form of the coordinate diagram in Fig¬ 
ure 34.3. 

Exercise 34.3. REISSNER-NORDSTROM SPACETIME 

(a) Show that there exists a coordinate system in which the Reissner-Nordstrom geometry 
with 0 < \Q\ < M (exercises 31.8 and 32.1) has the form 


ds 2 = F 2 ( — d\p 2 + dt 2 ) + r 2 (d6 2 + sin 2 # d$ 2 ), 
F = F(i//, £), r - r(& £), 


(34.5) 


and in which the horizons and infinities are as shown in Figure 34.4. [Note: This is a very 
difficult exercise unless one has in hand the solution to exercise 31.8(d). For solution, see 
Carter (1966b).] 

(b) Use Figure 34.4 to deduce that the Reissner-Nordstrom geometry describes a “worm- 
hole” or bridge, connecting two asymptotically flat spacetimes, which: (i) expands to a state 
of maximum circumference; (ii) recontracts toward a state of minimum circumference, and 
in the process disconnects its outer regions from the two 7°’s (spatial infinity) and reconnects 
them to a pair of r — 0 singularities; (iii) bounces; (iv) reexpands, and in the process 
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Figure 34.4. 

Reissner-Nordstrom spacetime 


ds 2 


--(■-“♦a 


r 

dr 2 


dt 2 


1 - 2 M/r + Q 2 /r 2 


+ r 2 (d8 2 + sm 2 8 d<$> 2 ) 


with 0 < \Q\ < M, as depicted in a new (^, £, 
8, <£) coordinate system where the line element 
has the form 


ds 2 = F2(~^ 2 + dt 2 ) + r\d8 2 + sin 2 8 d<f> 2 ). 

(see exercise 34.3.) This coordinate diagram re¬ 
peals the global structure of the geometry, in¬ 
cluding its singularities at r = 0, its horizons at 
r = r + = M + VA /2 _ Q 2 

(which limit communication with and i“), the 
null surfaces at 

r = = M - y/M 2 - Q 2 

(which limit communication with the singulari¬ 
ties), and the various asymptotically flat infinities, 
/ + , I~, 7°, Y + , and From this diagram one 
can read off the “causal structure” of the geome¬ 
try—i.e., the abilities of various regions to com¬ 
municate with each other. For detailed discussion 
of the geometry, see Graves and Brill (1960) 
and Carter (1966b). For discussions of collaps¬ 
ing charged stars, for which this geometry is 
the external gravitational field, see Novikov 
(1966a,b), de la Cruz and Israel (1967), and Bar¬ 
deen (1968). 


disconnects its “outer regions” from the two singularities and reconnects them to a pair of 
7° s in two new asymptotically flat universes; (v) slows its expansion to a halt; 
(vi) recontracts toward a state of minimum circumference, and in the process disconnects 
its outer regions from the two 7°’s and reconnects them to a new pair of r = 0 singularities; 
etc. ad infinitum. 
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Note: < (H y if i i; 7 , 

f < eP <3 
but i? £ J + (f) 


Note: «S is a spacelike slice which 

extends from /° in to r = 2M, 
but does not include r = 2M. 
J + {S) does not include the 
leftmost horizon; but y + ( t S) 
is the leftmost horizon. 


(a) (b) 

Figure 34,5. 

Spacetime diagrams illustrating various causal relationships. Diagram (a) is a hypo¬ 
thetical spacetime; diagram (b) is Schwarzschild spacetime (see Figure 34.3). In 
both diagrams, null lines have slopes of 45°. 


§34.3. CAUSALITY AND HORIZONS 


Restriction of discussion to 
asymptotically flat, 
time-oriented manifolds 


Turn now to global* techniques for analyzing black holes. The goals of the discussion 
will be (1) to define the concept of horizon (this section), (2) to deduce global 
geometric properties of horizons (next section), and (3) to prove the second law of 
black-hole dynamics (following section). The entire discussion will be confined to 
spacetime manifolds that (1) contain at least one asymptotically flat region (“the 
external universe”; region “outside black holes”), and (2) are “time-oriented.” By 
“ time-oriented ” one means that at each event in spacetime a distinct choice has been 
made as to which light cone is the future cone and which is the past, and moreover 
that this choice is continuous from event to event throughout spacetime. 

The discussion begins with definitions of a variety of causal relationships between 
events and regions of spacetime (see Figure 34.5). 


Definitions of several 
causality concepts 


Definition: 9 < 2 or equivalently 2 > 9 (“the event 9 precedes the event 2”; 

“the event 2 follows the event 9") means that there exists at least 
one smooth, future-directed timelike curve that extends from 9 to 2. 

Definition: A causal curve (?(\) is any smooth curve that is nowhere spacelike—i.e., 
that is timelike or null or “zero” [<?(\) = some fixed 9, for all X] or 
some admixture thereof. 

Definition: 9 < 2 or equivalently 2 > 9 (“the event 9 causally precedes the event 

‘Global, but not fully global; the “universe” of §§34.3-34.5 is asymptotically flat; no account is taken 
here of possible closure or collapse of the universe or of their consequences. 
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2"; the event 2 causally follows the event 9”) means that there exists 
at least one future-directed causal curve that extends from 9 to 2. 

Definition: J~(9), called the causal past of 9, is the set of all events that causally 
precede 9— i.e., J~(9) = {2\2 < 9). 

Definition: J + (9), called the causal future of 9, is the set of all events that causally 
follow 9— i.e., J + (9 ) = { 2\2 > 9}. 

Definition: If 5 is a region of spacetime—e.g., a segment of a spacelike hyper- 
surface—then J~(S ) ( the causal past of S) is the set of all events that 
causally precede at least one event in 5—i.e., 

J~(S) = {2\2 < 9 for at least one 9 £ 5}. 

Definition: Similarly, J + (S ) (the causal future of S) is the set of all events that 
causally follow at least one event in 5—i.e., 

•7 + (<S) = {2\2 > 9 for at least one 9 £ S). 

Definition: j + (S) is the boundary of J + (S), 
j~(S) is the boundary of 

Definition: One defines the future of 9, I + (9); the past of 9, I~(9 ); the future 
of 5, 1 + (S); the past of S, I~(S); the boundary of the future of S, 7 + (<S); 
and the boundary of the past of S, 7~(S) in precisely the same manner 
as above, except that the phrase “causally precede” is replaced by 
“precede,” and “causally follow” is replaced by “follow.” Example: 

7 + (<S) = {212 > 9 for at least one 9 £ S}. 

Not all these definitions are needed in the following discussion; but the literature 
on global methods uses these concepts so extensively that the reader should be 
familiar with them. 

Focus attention on a specific spacetime manifold, and in that manifold select out 
a specific asymptotically flat region. [In the external field of a star, there is but one 
asymptotically flat region. In the vacuum Schwarzschild geometry without source 
(Figure 34.3), there are two. In the Reissner-Nordstrom geometry without source 
(Figure 34.4), there are infinitely many different asymptotically flat regions.] The 
selected asymptotically flat region (“external universe”) has one future timelike 
infinity /+, one past timelike infinity I~, one spacelike infinity 7°, one future null 
infinity 7 + , and one past null infinity S~. It may also possess black holes, which 
form by stellar collapse, and which collide, coalesce, accrete matter, and generally 
wreak havoc in their immediate vicinities. The surfaces of all black holes (“future 
horizons”) separate the external universe, which can send signals out to 7 + , from 
the black-hole interiors, which cannot. One thus has the definition: 

Definition: The totality (or “ union ”) of all future horizons (surfaces of all black Definition: surfaces of black 
holes) is the region i.e., it is the boundary of the domain holes; future 

7 - (.7 + ) that can send future-directed causal curves out to future null horizons J W ) 
infinity. 
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Figure 34.6. 

Black holes in an asymptotically flat spacetime (schematic spacetime diagram). 
J (i + ) is the “external universe”—i.e., the region which can send causal curves 
to future null infinity. the greyish region, is the boundary of the “external 

universe”—i.e., it is the union of all future horizons. At the “time” of spacelike 
slice <3 1} there are no black holes in the universe. Between S , and S 2 two stars 
collapse to form black holes. The two closed 2-surfaces, in which S 2 intersects j~{3+) 
are the horizons of those black holes at “time” S 2 . Between S 2 and S 3 , the two 
original black holes collide and coalesce, while a third black hole is being formed 
by stellar collapse. 


[Similarly, one can define the totality of all past horizons to be But past 

horizons are of little interest for astrophysics. Whereas gravitational collapse pro¬ 
duces future horizons in a quite natural manner, past horizons must be primordial 
in origin—i.e., they must be postulated as initial structure in the origin of the universe 
[Novikov (1964), Ne’eman (1965)]. There is no good reason to believe that the 
universe began with or should have begun with such strange initial structure.] 
Any given spacelike slice S through spacetime will intersect j~(d + ) in a number 
of disjoint, closed, two-dimensional surfaces. Each such 2-surface is the horizon of 
a single black hole at the “moment of time” 5. See Figure 34.6. 


§34.4. GLOBAL STRUCTURE OF HORIZONS 


Penrose's theorem on the 
structure of J~(4 + ) (future 
horizons) 


The union of all future horizons, j has an especially simple global geometric 
structure, as follows. 

THEOREM [Penrose (1968a)]: j~(d + ) is generated by null geodesics that have no 
future end points . Stated more precisely (see Figure 34.7): 

(1) Definition: The “ generators ” of )~{d + ) are null geodesics which 
(at least for some finite lapse of affine parameter) lie in 
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7 

Figure 34.7. 

The future horizon produced by the spherical gravitational collapse of a 

star. This horizon illustrates the global geometric structure of j~(4 + ) as spelled out 
in Penrose’s theorem (§34.4 of text). In this special case, there is only one caustic. 
In general there will be many. 




(2) Theorem: When followed into the past, a generator may (but 

does not have to!) leave Each event at which a generator 

leaves is called a “ caustic ” of When a generator leaves, 

it goes into 

(3) Once a generator, being followed into the future, enters 

from J~(4 + ) at a caustic, it can never thereafter leave / - (/ + ), 
nor can it ever intersect another generator. [Generators can 
intersect only at the “caustics,” where they enter /-(/+).] 

(4) Through each noncaustic event of / _ (/ + ) there passes one and 
(aside from normalization of affine parameter) only one genera- 

This theorem is proved in Box 34.1, 

For a Schwarzschild black hole, the generators of j~(H + ) are the world lines of 
radially outgoing photons at the gravitational radius [r — 2 M, 6 and <j> constant, 
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u= + V ; dotted line on horizon in Figure 32.1(c)], For a Kerr-Newman black hole, 
the generators of are the “barber-pole-twist” null geodesics of Box 

33.2(F)—i.e,, they are those members of the outgoing principal null congruence that 
lie on the horizon, r = (§33.6; exercise 33.9), But the theorem is more general. 

It refers to any black hole—dynamic or static; accreting matter, or coalescing with 
a neighboring black hole, or existing alone in isolation—in any time-oriented, 
asymptotically flat spacetime. 

(i continued on page 931) 

Box 34.1 HORIZONS ARE GENERATED BY NONTERMINATING 
NULL GEODESICS (Penrose 1968a) 

A. Lemma: If (1) ^(A) is a causal, future-directed curve 
from event if to event 2, (2) (? 2 (\) is a causal, fu¬ 
ture-directed curve from event 2 to event and 
(3) 9 if {if is not in the past of $), then 6 X and 
(? 2 are null geodesics, and their tangent vectors coin¬ 
cide (aside from normalization) at event 2 . 

Proof*: 

1. Suppose that were not a null geodesic, or c? 2 
were not a null geodesic, or both. Then some¬ 
where along (? 1 U S 2 there would be a timelike 
segment, or a nongeodesic null segment, or both, 
a. If 6 t U (? 2 contained a timelike segment, then 
a slight deformationf of 6 > 1 U C 2 would pro¬ 
duce a smooth curve (P 3 from if to if which 
is everywhere timelikej—contradicting the as¬ 
sumption if if. 


*The proof utilizes some elementary concepts of point-set topology: 
see, e.g., Wallace (1963) or Kelley (1955). 

f One can always deform any curve in any spacetime manifold by 
a small amount in any direction one wishes, without running into 
singularities or into other boundaries of the manifold. This is possible 
because a manifold by definition is open. In physical terms, spacetime 
is open because each event in spacetime must possess a local Lorentz 
neighborhood which also lies in spacetime. 

t One can convince oneself of this, and of similar claims made later 
in the proof, by arguments using local Lorentz frames. In the literature 
on global geometry, claims such as this are rarely substantiated— 
though each author is always convinced that he could do so if forced 
to by a skeptic. Unfortunately, to substantiate such claims with rigor¬ 
ous arguments would lengthen and complicate the discussion enor¬ 
mously and would tend to obscure the simplicity of the underlying 
ideas. 
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b. If S 1 U e 2 contained a nongeodesic null seg¬ 
ment 6 reaching from event d to event 9, 
then, when compared to neighboring curves 
between d and 9, G would not have stationary 
length. This means that some curves from d 
to 9 would have larger squared length—i.e., 
would be spacelike—while others would have 
smaller squared length—i.e., would be time¬ 
like. Thus, a slight deformation of 6 would 
produce a timelike segment from d to 9. Then 
a further deformation of G x U G 2 , as described 
in (a) above, would produce a smooth timelike 
curve from 9 to 9, contradicting 9 <$ GL. 
Thus, the supposition is wrong; i.e., both G x and 
G 2 must be null geodesics. 





2 . 



Suppose that the tangent vectors of G x and G 2 
did not agree at their join point, 2. Then one 
could “round off the comer” at 2, producing a 
timelike segment there. One could then further 
deform G x U as in (la) above, to produce a 
smooth timelike curve from 9 to contradict¬ 
ing 9 9. Thus, the supposition is wrong; i.e., 

the tangent vectors must agree at 2. q.e.d. 


a 
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Box 34.1 (continued) 

B. Lemma: If 8 £ J~(A + ) and s8 £ /“(.*+), then 8 4 

Proof: Assume 8 4 

1. Then there exists a timelike curve from 8 to (fl. 

2. A slight deformation of that curve which keeps 
it still timelike will make it link an arbitrary event 
2 in some sufficiently small neighborhood 2l[8] 
to an arbitrary event (ft in some small &[$]. 

3. Pick (91 to lie in Then join the timelike 

curve from 2 to onto a causal curve from $ 
to i + . The resulting curve, when smoothed in a 
neighborhood of the join, becomes a causal curve 
from any arbitrary 2 £ %\8] to 

4. The existence of such curves implies that 9i[8] C 

J (^ + ), and hence that 8 $: —in contra¬ 

diction to the original hypotheses. 

Conclusion: 8 4 $8. Q.e.d. 





C, Lemma: Let d’(A) be a causal curve that intersects 
at some event Then when followed into 

the past from ®,<?(A) forever lies in J~(A + ) U /-(/+). 

Proof: 

L Pick an arbitrary event 8 on <?(A), with 8 <$. 

2. Construct an arbitrarily small neighborhood 

%[<?). 

3. A small deformation of between 8 and 
produces a timelike curve tf) from some event 
9 £ %[8) to S. 

4. Since ® a slight deformation of 

keeping it still timelike, produces a curve & from 
9 to some event 2 £ & can then be pro¬ 

longed, remaining causal, until it reaches A + . The 
result is a causal curve from ^ to / + . Hence, 
9 £ 

5. But 9 was in an arbitrarily small neighborhood 

9l\8). Hence, 8 must also be in J~(f + ) or else 
in its boundary, q.e.d. 



.j 



§34.4. GLOBAL STRUCTURE OF HORIZONS 


D. Theorem [Penrose (1968a)]: j~(3 + ) is generated by 
null geodesics which have no future endpoints. [See 
text of §34.4 for more detailed statement of theo¬ 
rem.] 

Proof: 

1. Pick an arbitrary event 9 in j~(3 + ). Prove as 

follows that through 9 there passes a future-di¬ 
rected null geodesic which lies in j~(3 + ): 

a. Construct an arbitrary neighborhood 9L[9\. [If 

happens somewhere to encounter a 
singularity of spacetime, then %[9] must be 
chosen small enough to keep the singularity 
outside it.] 

b. In %[9] n J~(3 + ), construct a sequence of 
events {9^} which converges to the event 9. 

c. For each /, construct a causal curve 6 i extend¬ 
ing from 9 i to 3 + . 

d. Let 2 { be the intersection of with f %[9], the 
boundary of %\9\ Since l*[!p] is a compact set, 
the sequence 2 { must have a limit point, 2. 

e. Because there exist causal curves from events 
9i arbitrarily near 9 to events 2 { arbitrarily 
near 2, there must be a causal curve from 9 
to 2. Call that curve 6. 

f. Since 2 is a limit point of a sequence of events 
in J~(3 + ), 2 either lies in J~(3 + ), or else lies 
on its boundary J~(3 + ), or both. Suppose 
2 £ J~{3 + ). 

i. Then some small neighborhood 2L[2] is 
contained entirely in J~{3 + ). 

ii. Construct a causal curve from 9 to 3 + by 
going from 9 to 2 along the causal curve 
6, then from 2 along a timelike curve to 
some event 9 £ %\2\, and then from 9 to 
3 + along a causal curve—and by smooth¬ 
ing at the join points 2 and 9. 

iii. Since this curve from 9 to 3 + has a time¬ 
like segment, it can be deformed smoothly, 
while being kept causal, so that it reaches 
any desired event 5 in some small neigh¬ 
borhood 3JL '[9]. But this means that 
9l'[9] C J~(3 + ), hence that 9 £ J~(3 + )— 
which contradicts the original definition 
of 9. 

Conclusion: 2 £ j~(3 + ). 
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Box 34.1 (continued) 

g. By Lemma - B, since 9 £ j~(d+) and 

~ £ J~(d + ), then 9 2. But G is a future- 

directed causal curve from 9 to 2. Conse¬ 
quently, by Lemma A, G is a null geodesic. 

h. Since the curve G intersects at 2, be¬ 

tween 9 and 2 it must everywhere lie in 
/“(y + ) U J~(d + ) [Lemma C]. Apply the rea¬ 
soning of (f) above, with 2 replaced by an 
arbitrary point on G between 9 and 2. 
Thereby conclude that, everywhere between 9 
and 2 , G lies in j~(d + ). 

Summary: Through every event 9 £ J~(d + ) there 
passes a null geodesic 6 which , when followed into 
the future from 9, lies in J~(d + ). This null geodesic 
is called a “ generator ” of j~(d + ). 

2. Follow the generator G from 9 to 2 and then 
onward still further. Can it ever leave j~(d + )l No! 
For suppose it did leave, at some event 9' £ 

j-{^y 

a. Repeat the entire construction of step 1, with 
9' replacing 9, to conclude that there is a null 
geodesic G’ C j~(d + ) extending into the 
causal future from 9 9 to some event 2'. 

b. By Lemma B, since 9 £ J~(d + ) and 2' £ 

9 4 2\ 

c. Then by Lemma A the null geodesic G from 
9 to 9' and the null geodesic G’ from 9* to 
2’ have tangents that coincide at 9 9 (aside 
from normalization). Thus, with a renormal¬ 
ization of affine parameter, G ' becomes the 
prolongation of G— which means that G does 
not leave J~(d + ) at 9\ 

Conclusion: Once a generator , being followed into 
the future, enters j~(d + ), it can never thereafter 
leave J~(d + ). 

3. Figure 34.7 provides an example of how a null 
geodesic, being following into the future, can 
enter J~(d + ) and become a generator. Lemma C 
guarantees that, when a null geodesic enters 

it enters from 


-\ 
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4. As indicated by the example of Fig. 34.7, at a 
“caustic” [entry point of generators onto 
generators can cross each other. Follow a genera- 
- '-tor Gto the causal future. from its entry point onto 
J~(4 + ). Can it ever again cross another generator? 
No. For suppose that at an event 9 the generator 
G were to cross another generator 9. 

a. To the causal future of 9, both generators 

always lie in Thus, events d and 9 of 

the picture are in 7“(y + ). 

b. Since 9 is to the causal future of the caustic 
where G enters 7~(/ + ), there exists an event 
2 £ j~(d+) n G to the causal past of 9. 

c. Since 2 £ j~(4 + ) and & £ /%#), 2 ^ # 
[Lemma B]. 

d. Lemma A, applied to the curves G from 2 to 
9, and 9) from 9 to then guarantees that 
the tangent vectors u e and coincide at 9 
(aside from normalization), and that therefore 
(aside from normalization) the geodesics G and 
9) are identical. This contradicts the supposition 
that G and 9) are different generators which 
cross at 9. 

Conclusion: Once a generator has entered /"(/+), 
it can never thereafter cross any other generator . 




V_ J 


§34.5. PROOF OF SECOND LAW OF BLACK-HOLE DYNAMICS 
[Hawking (1971b, 1972a, 1973)] 

All the tools are now in hand for a proof of the second law of black-hole dynamics. 

Consider the union of all future horizons, J~(4 + ), in an asymptotically flat space- Proof of second law of 
time, as depicted in Figure 34.8. Divide up the null-geodesic generators of )~{9 + ) black-hole dynamics: 
into a large number of infinitesimal bundles, and give each bundle an identifying 
number, K. As one moves from past toward future along + ), one occasionally 
sees new bundles of generators created in “caustics” of the 3-surface j~(4 + ). The 
caustic sources of new generators are created by such processes as the infall of matter 
through the horizon (example: bundle #42 in Figure 34.8), and the collision and 
coalescence of two black holes (example: bundle #29). But each bundle, once 
created, can never be destroyed (no termination of null generators as one moves 
from past toward future). 
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second law of black-hole dynamics. See text for 
details of the proof, and see Figure 34.6 for 
physical interpretation of the diagram. 


Focus attention on a specific bundle of generators—bundle # K. At a specific event 
9 along that bundle, let various observers, moving with various velocities, measure 
its (two-dimensional) cross-sectional area d K (fP). As shown in Figure 22.1, exercise 
22.13, and exercise 22.14: (1) the cross-sectional area Ct K (9) is independent of the 
velocity of the observer who measures it—i.e., d K (9) depends only on location 9 
along the bundle; and (2) <X K changes from event to event along the bundle in a 
manner governed by the “focusing theorem” 


Proof assumes nonnegative 
energy density 


£ 2 # 1/2 if the energy density T 55 , as measured 

— 2 <0 by all observers along the bundle, is ( 34 . 6 ) 

K nonnegative. 

Here A* is affine parameter along the bundle. Assume —in accord with all physical 
experience and the best assessments of modern physics —that energy density T^ 0 can 
never be negative. (This assumption underlies the second law of black-hole dynamics. 
If it were ever found to be invalid , then one would have to abandon the second law.) 

Suppose that dd K 1/2 /d\ K were negative at some event 9 along the bundle. Then, 
according to the focusing theorem, dd K 1/2 /d\ K would always remain at least as 
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negative as its value at —and, hence, after a lapse of affine parameter given by 

( d 1/2 \ 

-4V/W ./ < 34 - 7 > 

d K l/2 would go to zero. At the point where djf /2 reaches zero, adjacent null 
geodesics in the bundle cross each other, giving rise to events in /"(/+) through 
which pass more than one null geodesic generator. But this violates Penrose’s theorem 
on the global structure of horizons (§34.4). 

Thus either the supposition of negative dd K 1/2 /dX K is wrong; or else dd K l/2 /dX K 
goes negative, but then, before the generators get a chance to cross [before the finite 
lapse (34.7) of affine parameter], the generators hit a singularity and cease to exist. 
To prove the second law of black-hole dynamics , one must assume that no singularity 
is hit by the horizon , and thereby conclude that dd K ^ 2 /dX K never goes negative. 
Hawking (1971b, 1972a) makes an alternative assumption which implies dd K 1/2 /dX K 
> 0: Hawking assumes that spacetime is “future asymptotically predictable. ” In essence 
this means that spacetime possesses no “naked singularities”—i.e., no singularities 
visible from /+. (Naked singularities could influence the evolution of the external 
universe; and, therefore, unless one knew the laws of physics governing singularities— 
which one does not—they would prevent one from predicting the future in the 
external universe.) 

Under either assumption (no naked singularities; or horizon never hits a singular¬ 
ity), one concludes that 

dd K in /dX K is nonnegative everywhere along bundle K. (34.8) 

This result says that the cross-sectional area d K of each bundle can never decrease 
as one moves toward the future along 7~(/ + ). Since new bundles can be created, 
but old ones can never be destroyed as one moves toward the future, the total 
cross-sectional area of j~~(# + ) cannot decrease toward the future. Equivalently, (see 
Figure 34.8), if S x and <S 2 are spacelike hypersurfaces with S 2 everywhere to the 
future of S v then the cross-sectional area of j~(# + ) at its intersection with S 2 , d(S 2 ), 
cannot be less than the cross-sectional area at S v d(S x ). This is the second law of 
black-hole dynamics, reformulated in more precise language than that of Chapter 
33, and finally proved. 


Exercise 34.4. A BLACK HOLE CAN NEVER BIFURCATE [Hawking (1972a)] 

Make plausible the theorem that no matter how hard one “zaps” a black hole, and no matter 
what one “zaps” it with, one can never make it bifurcate into two black holes. [Hint: By 
drawing pictures, make it plausible that, at any bifurcation point, some null geodesic genera¬ 
tors of/~(7 + ) must leave 7~(7 + ) as one follows them into the future —in violation of Penrose’s 
theorem (§34.4). Assume that the surface of each hole is topologically a 2-sphere. Note: The 
same argument, time-reversed, shows that if two black holes coalesce, generators enter J~(<f+) 
from 7“(i + ) at the coalescence point: and the surface area of the horizon increases.] 


Proof assumes that horizon 
never hits a singularity (no 
naked singularities) 


Precise formulation of 
second law 


EXERCISE 
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Overview of theorems on 
singularities 


Singularity defined 


Trapped surface defined 


§34.6. THEOREMS ON SINGULARITIES, AND THE 
"ISSUE OF THE FINAL STATE" 

Just as global techniques are powerful tools in the analysis of horizons, so they also 
are-powerful in the analysis of spacetime singularities. In fact, it was the proof of 
Penrose’s (1965b) pioneering theorem on singularities that gave birth to global 
techniques for studying spacetime. 

For a detailed introduction to the global analysis of singularities, one can read 
the book of Hawking and Ellis (1973). Now that the reader has had a taste of global 
techniques, attention here will focus on a qualitative description of results: 

How does gravitational collapse terminate? Is the singularity at the end point of 
spherical collapse typical, or can asymmetries remove it? That singularities are very 
general phenomena, and cannot be wished away, has been known since 1965, thanks 
to theorems on singularities proved by Penrose, Hawking, and Geroch. [For a full 
list of references, see Hawking and Penrose (1969) or Hawking and Ellis (1973).] 

Before examining the theorems on singularities, one must make precise the concept 
of a singularity. This is not easy, as Geroch (1968) has emphasized in a long treatise 
on the wide variety of pathologies that can occur in spacetime manifolds. However, 
after vigorous efforts by many people, Schmidt (1970) finally produced a definition 
that appears to be satisfactory. Put in heuristic terms, Schmidt’s highly technical 
definition goes something like this. In a spacetime manifold, consider all spacelike 
geodesics (paths of “tachyons”), all null geodesics (paths of photons), all timelike 
geodesics (paths of freely falling observers), and all timelike curves with bounded 
acceleration (paths along which observers are able, in principle, to move). Suppose 
that one of these curves terminates after the lapse of finite proper length (or finite 
affine parameter in the null-geodesic case). Suppose, further, that it is impossible 
to extend the spacetime manifold beyond that termination point—e.g., because of 
infinite curvature there. Then that termination point, together with all adjacent 
termination points, is called a “singularity.” (What could be more singular than the 
cessation of existence for the poor tachyon, photon, or observer who moves along 
the terminated curve?) 

Another concept needed in the singularity theorems is that of a trapped surface. 
This concept, devised by Penrose (1965b), is motivated by a close examination of 
the two-dimensional, spherical surfaces (r, t) = const, inside the horizon of the 
Schwarzschild geometry. These surfaces signal the nearness of a singularity (r — 0) 
by this property: light rays emitted from one of these surfaces in the perpendicular 
outward direction (i.e., outgoing, orthogonal, null geodesics) converge toward each 
other as they propagate; and inward light rays perpendicular to the 2-surface also 
converge. Penrose gives the name “trapped surface” to any closed 2-surface, spherical 
or not, that has this property. In Schwarzschild spacetime, the convergence of light 
rays, both outgoing and ingoing, can be attributed to the “intense pull of gravity,” 
which sucks the photons into the singularity. That this might also be true in asym¬ 
metric spacetimes is suggested by the Hawking-Penrose (1969) theorem [the most 
powerful of a wide class; see Hawking and Penrose (1969) for references to others; 
and see Boxes 34.2 and 34.3 for introductions to Hawking and Penrose]: 
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A spacetime M necessarily contains incomplete, inextendable timelike or null geo¬ 
desics {and is thus singular in the Schmidt sense) if in addition to Einstein's equations, 
the following four conditions hold: (1) M contains no closed timelike curves (reason¬ 
able causality condition); (2) at each event in M and for each unit timelike vector u, 
the stress-energy tensor satisfies 

(y a fi - jg a pT)u a u^ >0 

(reasonable energy condition); (3) the manifold is “general” (i.e., not too highly 
symmetric) in the sense that every timelike or null geodesic with unit tangent u passes 
through at least one event where the curvature is not lined up with it in a specific 
way: 

u laRfiiy 5 [c m p] wYmS 7 ^ 0 at some point on the geodesic. 

(4) the manifold contains a trapped surface. 

All these conditions, except the trapped surface, seem eminently reasonable for 
any physically realistic spacetime! Note, especially, that the energy condition can 
be violated only if, as measured by some local observer in his proper frame, the 
total energy density E is negative or the principal pressures (eigenvalues of stress 
tensor) P { are so negative that 

2 Pi < -e. 

i 

The relevance of the Hawking-Penrose theorem for collapse follows from the 
general expectation that, in the real universe, trapped surfaces will always exist just 
below all future horizons, J~(4 + ). (Exceptions, such as the Kerr metric with a = M, 
are probably a “set of measure zero.”) Since horizons and accompanying trapped 
surfaces are necessarily produced by slightly nonspherical collapse (Box 32.2), and 
since they probably also result from moderately deformed collapse (§32.7), such 
collapse presumably produces singularities—or a violation of causality, which is also 
a rather singular occurrence! 

If the singularities are really such a general feature of collapse, then the exact 
nature of the singularity is of life-and-death importance to anyone who falls through 
a horizon! Here one is on very shaky ground. Although the main results and 
conjectures described up to now in this section will probably survive all future 
research, opinions about the nature of the singularities are likely to change several 
times more before the whole story is in. Hence, it is safe only to describe the 
possibilities, not to attempt to judge them. 

Possibility 1 

The singularity at the endpoint of a realistic collapse is a region of infinite tidal 
gravitational forces (infinite curvature), which crushes the collapsing matter to infinite 
density. Examples: the very special, homogeneous crushing of the Oppenheimer- 
Snyder (1939) spherical collapse (§32.4); also the very special inhomogeneous but 
spherical crushing described by Podurets (1966); also the special inhomogeneous, 

{continued on page 940) 


The Hawking-Penrose 
theorem on singularities 


Relevance of the 
Hawking-Penrose theorem for 
gravitational collapse 


The nature of the singularity 
at the endpoint of realistic 
collapse: 4 possibilities 
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Box 34.2 ROGER PENROSE: Born August 8, 1931, Colchester, Essex, England 


Roger Penrose started out as an algebraic geom¬ 
eter. However, while at Cambridge'from 1952-55 
and again from 1957-60, his interest in general 
relativity was aroused by Hermann Bondi and 
Dennis Sciama. Because of his pure mathematical 
background, his approach to the subject was 
different from those which had been adopted hith¬ 
erto. He was particularly interested in the global 
light-cone structure of spacetime and in the equa¬ 
tions of zero rest-mass fields, both of which are 
preserved under conformal transformations. He 
exploited this conformal invariance to give an 
elegant and powerful treatment of gravitational 
radiation in terms of a null surface S ' + at infinity. 
More recently this interest has led him to develop 
the theory of twistors, which are the spinors corre¬ 
sponding to the conformal group of Minkowski 
space. These offer a new and very promising ap¬ 
proach to the quantization of spacetime. 

His interest in conformal geometry also led him 
to study the properties of the causality relation¬ 
ships between points of spacetime. These in turn 
led him to the theorems on the occurrence of sin¬ 



gularities in spacetime, which are probably the 
most important predictions of general relativity, 
since they seem to imply that spacetime has a 
beginning or an end. 


If spacetime is considered from the point of view of its conformal structure 
only, points at infinity can be treated on the same basis as finite points" 

[PENROSE, IN INFELD (1964)] 

"77>e argument will be to show that the existence of a trapped surface implies— 
irrespective of symmetry—that singularities necessarily develop " 

[PENROSE (1965b)] 

“While the quantum effects of gravitation are normally thought to be significant 
only when curvatures approach 10 33 cm~K all our local physics is based on the 
Poincare group being a good approximation of a local symmetry group at 
dimensions greater than 10 ~ 13 cm. Thus, if curvatures ever even approach 
10 cm , there can be little doubt but that extraordinary local effects are likely 

to take place " 
[HAWKING AND PENROSE (1969)] 
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We are thus presented with what is perhaps the most fundamental unanswered 
question of general-relativistic collapse theory , namely: does there exist a "cosmic 
censor" who forbids the appearance of naked singularities, clothing each one in an 

absolute event horizon?" * 
[PENROSE (1969)] 

Under normal circumstances, general relativity can, for practical purposes, 
remain remarkably apart—almost aloof—from the rest of physics. At a space-time 

singularity, the very reverse must surely be the case!" 
"I do not believe that a real understanding of the nature of elementary particles 
can ever be achieved without a simultaneous deeper understanding of the nature 
of spacetime itself. But if we are concerned with a level of phenomena for which 
such an understanding is not necessary—and this will cover almost all of 
present-day physics—then the smooth manifold picture presents an 
(unreasonably!) excellent framework for the discussion of phenomena." 
"The most important single lesson of relativity theory is, perhaps, that space 
and time are not concepts that can be considered independently of one another 
but must be combined together to give a four-dimensional picture of phenomena: 

the description in terms of spacetime" 

[PENROSE (1968a)] 

"If a formalism enables one to treat myriads of non-existent types of universe, 
then (effectively) it contains 'arbitrary parameters,' only special values of which 
will correspond to the world as it actually is. In the ordinary approach to 
spacetime as a pseudo-Riemannian differentiable manifold, the dimension of the 
manifold and the signature of the metric are two such arbitrary parameters." 
"As we localize the position of a particle, it jumps essentially along the null 
cone. Other particles are produced, which leap backward and forward essentially 
along null directions,_without apparent regard for continuity, heeding only the 
positions of the null cones ?/7e/77se/yes and "topology" only in the respect in 
which this term is applied to the structure of graphs" 

[PENROSE (1966)] 

"My own view is that ultimately physical laws should find their most natural 
expression in terms of essentially combinatorial principles, that is to say, in terms 
of finite processes such as counting or other basically simple manipulative 
procedures. Thus, in accordance with such a view, should emerge some form of 

discrete or combinatorial spacetime" 
[PENROSE, IN KLAUDER (1972)] 

"Complex numbers are ... a very important constituent of the structure of 
physical laws. The twistor theory carries this further in suggesting that complex 
numbers may also be very basically involved in defining the nature of spacetime 

itself. " 

[PENROSE AND MACCALLUM (1973)] 

"It is thus very tempting to believe that a link between spacetime curvature and 
quantum processes may be supplied by the use of twistors. Then, roughly 
speaking, it is the continual slight 'shifting' of the interpretations of the quantum 
(twistor) operators which results in the curvature of spacetime" 


[PENROSE (1968b)] 
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Box 34,3 STEPHEN W. HAWKING: Born January 8, 1942, Oxford, England 


As a research student of Dennis Sciama’s in Cam¬ 
bridge, Stephen Hawking’s early interest in rela¬ 
tivity theory centered mainly on the question of 
spacetime singularities. With Ellis, he showed that 
a large class of homogeneous cosmological models 
must be singular. Then, encouraged by work of 
Penrose on the singularities arising in gravitational 
collapse, he developed new techniques which, in 
a series of papers in the Royal Society of London 
during 1966-67, established the important result 
that any plausible general-relativistic cosmology 
must be singular. 

The major portion of his later research has been 
concerned with black holes. He devised a series 
of arguments of great ingenuity which, together 
with the work of Israel and Carter, established to 
all intents and purposes the result that (vacuum) 
black holes in general relativity are described by 
Kerr metrics, that topologies other than spherical 
cannot occur, and that a certain limit on the en¬ 
ergy emitted when two black holes congeal into 
one must be satisfied. 

Some of this work has had substantial pure 
mathematical interest (e.g., singularity theorems), 
some of it is concerned with astrophysics (e.g., 
work with Taylor on helium production in the big 
bang), some with observations (work with Gibbons 
on the possibility of black holes in binary star 



systems) and even experimental developments 
(with Gibbons on gravitational-wave detectors). In 
such scope is exhibited not only a considerable 
insight, depth, and versatility, but also the gift of 
an extraordinary determination to overcome se¬ 
vere physical handicaps, to seek out and compre¬ 
hend the truth. 


"The observed isotropy of the microwave background indicates that the universe 
is rotating very little if at all. . . . This could possibly be regarded as an 
experimental verification of Mach's Principle" 

[HAWKING (1969)] 

" Undoubtedly , the most important results are the theorems ... on the 
occurrence of singularities. These seem to imply either that the general theory of 
relativity breaks down or that there could be particles whose histories did not exist 
before (or after) a certain time. The author's own opinion is that the theory 
probably does break down but only when quantum gravitational 

effects become important. " 

"Although we have omitted the singular points from the definition of spacetime , 
we can still recognize the 'holes' left where they have been cut out by the 

existence of incomplete geodesics." 
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"A good physical theory should not only correctly describe the currently 
experimental knowledge, but should also predict new results which can be tested 
by experiment, the further the predictions from the original experiments, the 
greater the credit to the theory if they are found to be correct. Thus observations 
of whether or not singularities actually occurred, would provide a powerful test of 

the general theory of relativity in strong fields" 

[HAWKING (1966a)] 

"The construction of gravitational radiation detectors may open up a whole new 
field of 4 gravitational astronomy ' which could be as fruitful as radio astronomy has 
been in the last two decades. . . . Black hole collisions . . . would be much more 
effective in converting rest-mass energy into radiation than nuclear reactions, 
which can release only about 1 per cent of the rest-mass energy. In addition, 
black holes formed by collisions of smaller black holes can undergo further 
collisions, releasing more energy, whereas matter that has been fully processed by 
nuclear reactions cannot yield any more energy by the same means. ... we are 
witnessing something really cataclysmic at the centre of our galaxy" 

[HAWKING (1972b)] 

One might suggest that prior to the present expansion there was a collapsing 
phase . In this, local inhomogeneities grew large and isolated singularities occurred. 
Most of the matter avoided the singularities and reexpanded to give the present 

observed universe." 

"St seems that we should draw a surface around regions where the radius of 
curvature is less than, say, 10 ~ 16 cm. On our side of this surface, a manifold 
picture of spacetime would be appropriate, but we have no idea what structure 

spacetime would have on the other side" 

[HAWKING AND ELLIS (1968)] 

"Presumably it would be necessary to consider quantum effects in very strong 
fields. However, these would not become important until the radius of curvature 
became of the order of 10~ lA cm, which for practical purposes is pretty singular." 

"The view has been expressed that singularities are so objectionable that if the 
Einstein equations were to predict their occurrence, this would be a compulsive 
reason for modifying them. However, the real test of a physical theory is not 
whether its predicted results are aesthetically attractive but whether they agree 
with observation. So far there are no observations which would show that 

singularities do not occur" 
[HAWKING (1966b)] 

"It is shown that a stationary black hole must have a topologically spherical 
boundary and must be axisymmetric if it is rotating. These results, together with 
those of Israel and Carter, go most of the way toward establishing the conjecture 

that any stationary black hole is a Kerr solution" 

[HAWKING (1972a)] 

"The fact that we have observed the universe to be 
isotropic is only a consequence of our existence. " 

[COLLINS AND HAWKING (1973)] 
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“Kasner-like” crushing of Lifschitz and Khalatnikov (1963a,b); also, most impor¬ 
tantly, the very general “mixmaster” crushing (Chapter 30), discovered in the homo¬ 
geneous case by Misner (1969b) and by Belinsky and Khalatnikov (1969a), and 
analyzed in the inhomogeneous case by Belinsky and Khalatnikov (1969b, 1970) 
and by Khalatnikov and Lifschitz (1970). The mixmaster singularities—and only 
they among all explicitly known singularities—appear to be generic in this sense: 
if one perturbs slightly but arbitrarily the initial conditions of a spacetime that 
evolves a mixmaster singularity, then the resultant perturbed spacetime will also 
evolve a mixmaster singularity. Because of this, the prevalent opinion today (1973) 
is that realistic collapse probably produces, inside the horizon, a mixmaster singular¬ 
ity. But that opinion might change tomorrow. 

Possibility 2 

The singularity is a region of spacetime in which timelike or null geodesics terminate, 
not because of infinite tidal gravitational forces or infinite crushing, but because 
of other, more subtle pathologies. Example: “Taub-NUT space” [see Misner and 
Taub (1968)]. For other examples created specially to exhibit various pathologies, 
see Geroch (1968). 

Possibility 3 

The singularity may be sufficiently limited in “size” and influence that all or most 
of the collapsing matter successfully avoids it. The matter cannot then explode back 
outward through the horizon that it went down; the horizon is a one-way membrane 
and will not let anything back out. Instead, the matter may reach a stage of maximum 
but finite contraction, and then reexplode into some other region of spacetime 
(multiply connected spacetime topology; “wormhole”). Analytical solutions for 
collapsing, charged spheres do reexplode in this manner [Novikov (1966); de la Cruz 
and Israel (1967); Bardeen (1968); see Figure 34.4], Such a process requires that 
the “exploding” end of the wormhole be built into the initial conditions of the 
universe, with mass and angular momentum (as measured by Keplerian orbits and 
frame dragging) precisely equal to those that go down the black-hole end. This seems 
physically implausible. So does the “explosion.” 

Other Possibilities 

Various combinations of the above. 

Will quantization of If, as one suspects today, the singularities are of a very physical, infinite-curvature 

from^ingullrrtle? Un ' VerSe l yP e > then one must face up to John Wheeler’s (1964a) “issue of the final state” in 

its most raw and disturbing form. Wheeler, when faced with the issue, argues that 
infinite-curvature singularities signal a breakdown in classical general relativity—a 
breakdown forced by the onset of quantum gravitational phenomena (see Chapter 
44). Whether quantization of gravity will actually save spacetime from such singu¬ 
larities one cannot know until the “fiery marriage of general relativity with quantum 
physics has been consummated” [Wheeler (1964a); see also Misner (1969c), and the 
last section of Box 30.1]. 



VIII 


GRAVITATIONAL WAVES 


Wherein the reader voyages on stormy seas of curvature ripples , 
searching for the ripple-generating storm gods , and battles 
through an electromagnetic and thermal fog that allows only 
uncertain visibility upon those seas. 





CHAPTER 


PROPAGATION OF 
GRAVITATIONAL WAVES 


Born: '7 should like to put to Herr Einstein a question, namely: /?ow quickly the 
action of gravitation is propagated in your theory. That it happens with the speed 
of light does not elucidate it to me. There must be a very complicated 

connection between these ideas." 
Einstein: "It is extremely simple to write down the equations for the case when 
the perturbations that one introduces in the field are infinitely small. Then the g's 
differ only infinitesimally from those that would be present without the 
perturbation. The perturbations then propagate with the same velocity as fight ." 

Born: "But for great perturbations things are surely very complicated?" 

Einstein: "Yes, it is a mathematically complicated problem. It is especially 
difficult to find exact solutions of the equations, as the equations are nonlinear." 

Excerpts from discussion after Einstein's Fall 1913 lecture in Vienna on “The present position of the 
problem of gravitation," already two years before he had the final field equations [EINSTEIN, 1913a] 


§35.1. VIEWPOINTS 

Study one idealization after another. Build a catalog of idealizations, of their prop¬ 
erties, of techniques for analyzing them. This is the only way to come to grips with 
so complicated a subject as general relativity! 

Spherical symmetry is the idealization that has dominated most of the last 12 
chapters. Together with the idealization of matter as a perfect fluid, and of the 
universe as homogeneous, it has yielded insight into stars, into cosmology, into 
gravitational collapse. 

Turn attention now to an idealization of an entirely different type, one independent 
of any symmetry considerations at all: the idealization of a “gravitational wave/’ 

Just as one identifies as “water waves” small ripples rolling across the ocean, so Gravitational waves 

one gives the name “gravitational waves” to small ripples rolling across spacetime, compared to water waves 

- ocean: 

We are deeply indebted to Mr. James M. Nester, who found and corrected many errors in the equations 
of this chapter and of a dozen others throughout the book. 
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(1) approximate nature of a 
wave 


(2) local viewpoint vs. 
large-scale viewpoint 


Linearized theory of 
gravitational waves: 


Ripples of what? Ripples in the shape of the ocean’s surface; ripples in the shape 
(i.e., curvature) of spacetime. Both types of waves are idealizations. One cannot, 
with infinite accuracy, delineate at any moment which drops of water are in the 
waves and which are in the underlying ocean: Similarly, one cannot delineate 
precisely which parts of the spacetime curvature are in the ripples and which are 
in the cosmological background. But one can almost do so; otherwise one would 
not speak of “waves”! 

Look at the ocean from a rowboat. Waves dominate the seascape. Changes in 
angle and level of the surface occur every 30 feet or less. These changes propagate. 
They obey a simple wave equation 

+ + jj?) ( hd S ht ° f = °- 

Now get more sophisticated. Notice from a spaceship the large-scale curvature of 
the ocean’s surface—curvature because the Earth is round, curvature because the 
sun and the moon pull on the water. As waves propagate long distances, this 
curvature bends their fronts and changes slightly their simple wave equation. Also 
important over large distance are nonlinear interactions between waves, interaction 
with the wind, Coriolis forces due to the Earth’s rotation, etc. 

Spacetime is similar. Propagating through the universe, according to Einstein’s 
theory, must be a complex pattern of small-scale ripples in the spacetime curvature, 
ripples produced by binary stars, by supernovae, by gravitational collapse, by 
explosions in galactic nuclei. Locally (“rowboat viewpoint”) one can ignore the 
interaction of these ripples with the large-scale curvature of spacetime and their 
nonlinear interaction with each other. One can pretend the waves propagate in flat 
spacetime; and one can write down a simple wave equation for them. But globally 
one cannot. The large-scale curvature due to quiescent stars and galaxies will produce 
redshifts and will deform wave fronts; and the energy carried bv the waves them¬ 
selves will help to produce the large-scale curvature. This chapter treats both view¬ 
points, the local (§§35.2-6), and the global (§§35.7-15). 


§35.2. REVIEW OF "LINEARIZED THEORY" IN VACUUM 

Idealize, for awhile, the gravitational waves of our universe as propagating through 
flat, empty spacetime (local viewpoint). Then they can be analyzed using the “lin¬ 
earized theory of gravity,” which was introduced in Chapter 18. 

Linearized theory, one recalls, is a weak-field approximation to general relativity. 
The equations of linearized theory are written and solved as though spacetime were 
flat (special-relativity viewpoint); but the connection to experiment is made through 
the curved-space formalism of general relativity. 

More specifically, linearized theory describes gravity by a symmetric, second-rank 
tensor field h^. In the standard (“Lorentz,” or Hilbert) gauge, this field satisfies 
the “gauge” or “subsidiary” conditions (coordinate conditions) 

h^ a = 0. 


(1) Lorentz gauge condition 


(35.1a) 
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(Here, and throughout linearized theory, indices of h are raised and lowered with 
the Minkowski metric y aj3 .) In this gauge the propagation equations for vacuum 
gravitational fields are the familiar wave equations 

nv = v = o. (35.1b) 

Spacetime is really curved in linearized theory, although equations (35.1) are 
written and solved as though it were not. The global inertial frames of equations 
(35.1) are only almost inertial. In them the metric components are actually 

= V + K + Odh^] 2 ); (35.2a)* 

and the “metric perturbation” h^ is related to the “gravitational field” h w by 

h nv - V ~ V = Kv ~ y 

h~h a a = —h— — h a a . (35.2b) 

The metric (35.2a) governs the motion of test particles, the propagation of light, 
etc., in the usual general-relativistic manner. 

_ Recall the origin of the equations (35.1) that govern h^. The subsidiary conditions 
h* a — 0 were imposed by specializing the coordinate system; and the Einstein field 
equations in vacuum then reduced to Oh p = 0. 

Actually, as was shown in Box 18.2, the coordinates of linearized theory are not 
fully fixed by the conditions h* a — 0. There remains an ambiguity embodied in 
further “gauge changes” (infinitesimal coordinate transformations), £ , which satisfy 
a restrictive condition 

= 0 (35.3a) 

in order to preserve conditions (35.1a). Then 

^new = ^old + & (35.3b) 

is the coordinate transformation and 

^jiv new v old ^ y /i "E (35.3c) 

is the gauge change. All this was derived and discussed in Chapter 18. 

§35.3. PLANE-WAVE SOLUTIONS IN LINEARIZED THEORY 

The simplest of all solutions to the linearized equations h^ * — ~ 0 is the 

monochromatic, plane-wave solution, 

V = exp (ik a x a )]. (35.4a) 

* A more nearly rigorous treatment defines h pw = — 77 ^, and puts the small corrections 0([^] 2 ) 

into the field equations: 

VU = 0(fVF\ tt ), V.* = oah'j'rt). 



(2) propagation equation 


(3) metric 


(4) residual gauge freedom 


Monochromatic, plane wave 
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Plane wave has two degrees 
of freedom in amplitude (two 
polarizations) 


Transverse-traceless (TT) 
gauge: 

(1) for plane wave 


Here £[.] means that one must take the real part of the expression in brackets; 

while A My (iamplitude ) and k^ (wave vector) are constants satisfying 

k a k a = 0 (k a null vector), (35.4b) 

A^k* = 0 (A orthogonal to k) (35.4c) 

[consequences oih^, a a = 0 and h* a — 0, respectively; see (35.10) below for the true 
physics associated with this wave, the curvature tensor]. Clearly, this solution de¬ 
scribes a wave with frequency 

« 2= k° = (k* + /c y 2 + k z 2 ) 1/2 9 (35.5) 

which propagates with the speed of light in the direction (1 /k°)(k x , k y ,k z ). 

At first sight the amplitude A^ v of this plane wave appears to have six independent 
components (ten, less the four orthogonality constraints A^k* — 0). But this cannot 
be right! As Track-2 readers have learned in Chapter 21, the gravitational field in 
general relativity has two dynamic degrees of freedom, not six. Where has the 
analysis gone astray? 

One went astray by forgetting the arbitrariness tied up in the gauge freedom (35.3). 
The plane-wave vector 


£* = -iC* exp ( ik a x a ), (35.6) 

with four arbitrary constants O, generates a gauge transformation that can change 
arbitrarily four of the six independent components of A p . One gets rid of this 
arbitrariness by choosing a specific gauge. 


§35.4. THE TRANSVERSE TRACELESS (TT) GAUGE 

Select a 4-velocity u —not at just one event, but the same u throughout all of 
spacetime (special-relativistic viewpoint!). By a specific gauge transformation (exer¬ 
cise 35.1), impose the conditions 


A^u* = 0. (35.7a) 

These are only three constraints on A M „, not four, because one of them— k fi (A flv u v ) 
= 0—is already satisfied (35.4c). As a fourth constraint, use a gauge transformation 
(exercise 35.1) to set 


= 0. (35.7b) 

One now has eight constraints in all, A^u a = A m k a = A* = 0, on the ten 
components of the amplitude; and the coordinate system (gauge) is now fixed rigidly. 
Thus, the two remaining free components of A^ represent the two degrees of freedom 
(two polarizations) in the plane gravitational wave. 
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It is useful to restate the eight constraints A^u 0 = A m k a = A^ = 0 in a Lorentz 
frame where «° = 1, u> = 0, and in a form where k a does not appear explicitly: 


V = 0, 

i.e., only the spatial components 
hj k are nonzero; 

(35.8a) 

hut = 0, 

i.e., the spatial components are 
divergence-free; 

(35.8b) 

hkk — 

i.e., the spatial components are 


' 

trace-free. 

(35.8c) 


(Here and henceforth repeated spatial indices are to be summed, even if both are 
down; e.g, h kk = 2% =1 h kk .) Notice that, since h = h/ = h kk = 0, there is no dis¬ 
tinction between h^ and h^ in this gauge. 

Turn attention now away from plane waves to arbitrary gravitational waves in 
linearized theory. Any electromagnetic wave can be resolved into a superposition 
of plane waves, and so can any gravitational wave. For each plane wave in the 
superposition, introduce the special gauge (35.8). Note that the gauge conditions 
are all linear in h^. Therefore the arbitrary wave will also satisfy conditions (35.8). 
Thus arises the theorem: Pick a specific global Lorentz frame of linearized theory 
(i.e., pick a specific 4-velocity u). In that frame (where u a = 5“ 0 ), examine a specific 
gravitational wave of arbitrary form. One can always find a gauge in which h^ satisfies 
the constraints (35.8). Moreover, in this gauge only the h jk are nonzero. Therefore, 
one need only impose the six wave equations 

Dh ik = = 0. (35.9) 

Any symmetric tensor satisfying constraints (35.8) [but not necessarily the wave 
equations (35.9)] is called a transverse-traceless (TT) tensor —transverse because it 
is purely spatial (h 0fl = 0) and, if thought of as a wave, is transverse to its own 
direction of propagation (h tjJ = h tj = 0); traceless because h kk = 0. The most 
general purely spatial tensor S i} can be decomposed [see Amowitt, Deser, and Misner 
(1962) or Box 35.1] into a part Sjf, which is “transverse and traceless”; a part 
= W*i)f,kk ~ which is “transverse” (Sfjj = 0) but is determined entirely by 
one function/giving the trace of S (S T kk = V 2 /); and a part Sk = Sfj + Sf u which 
is “longitudinal” and is determined by the vector field Sf. In linearized theory hf. 
is a purely gauge part of h^, whereas h and hff are gauge-invariant parts of h 
The special gauge in which h „„ reduces to its transverse-traceless part is called the 
TT or transverse-traceless gauge. The conditions (35.8) defining this gauge can be 
summarized as 


V = h 7v (35.8d) 

As exercise 35.2 illustrates, only pure waves (and not more general solutions of the 
linearized field equations with source, Uh^ = - 16can be reduced to TT 
gauge. 


(2) for any wave 


Decomposition of spatial 
tensors 
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Curvature tensor in TT gauge In the TT gauge, the time-space components 

RjOkO — RojOk ~ = ^OjkO 

of the Riemann curvature tensor have an especially simple form [see equation (18.9) 
and exercise 18.4]: 

Rjoko = -y^oo- (35.10) 

Recall that the curvature tensor is gauge-invariant (exercise 18.1). It follows that 
h^ cannot be reduced to still fewer components than it has in the TT gauge. 

Box 35.1 describes methods to calculate h'jj from a knowledge of h /JLV in some 
other gauge. 


( -- 

Box 35.1 METHODS TO CALCULATE “TRANSVERSE-TRACELESS PART" OF A WAVE 


Problem: Let a gravitational wave A (f, in an 
arbitrary gauge of linearized theory be known. 
How can one calculate the metric perturbation 
hJJ(t,x*) for this wave in the transverse-traceless 
gauge? 

Solution 1 (valid only for waves; i.e., when 
= 0). Calculate the components Rj 0k0 of 
Riemann in the initial gauge; then integrate 
equation (35.10) 

*£00 = “ 2 j fyo*o (0 

to obtain hj k . When the wave is monochromatic, 
hpy = h^(x l )e~ ib3t ; then the solution of (1) has the 
simple form 

= 2 »-% oko . ( 2 ) 

Solution 2 (valid only for plane waves). “Project 
out” the TT components in an algebraic manner 
using the operator 

p ik = 8 jk - n i n k ■ ( 3 ) 

Here 

n k = k k /\k\ 

\ ___ 


is the unit vector in the direction of propagation. 
Verify that is a projection operator onto the 
transverse plane: 

Wlk = P)k n k — 0 * ?kk “ 2 ’ 

Then the transverse part of h jk is Pjih Lm P mk (or in 
matrix notation, PhP)\ and the TT part is this 
quantity diminished by its trace: 

= P»P mk h lm - j P jk (P ml h lm ) (4) 

(index notation), 

h TT = PhP - ^Plr(Ph) 

(matrix notation). (4') 

The sequence of operations that gives h?J cuts two 
parts out of h iy The first part cut out is 

hjk = \p^PM, (5) 

which is transverse but is built from its own trace, 

h T = Tr (PhP) = Tr (Ph) = P lm h ml . 
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Exercise 35.1. TRANSFORMATION OF PLANE WAVE TO TT GAUGE EXERCISES 

Let a plane wave of the form (35,4) be given, in some arbitrary gauge of linearized theory. 

Exhibit explicitly the transformation that puts it into the TT gauge. [Hint: Work in a Lorentz 
frame where the 4-velocity u* of the TT gauge is u° — 1, = 0. Solve for the four constants 

C** of the generating function (35.6) by demanding that h^ v satisfy the TT constraints (35.7).] 


Exercise 35.2. LIMITATION ON EXISTENCE OF TT GAUGE 

Although the metric perturbation h^ for any gravitational wave in linearized theory can be 
put into the TT form (35.8), nonradiative h f s cannot. Consider, for example, the external 
field of a rotating, spherical star, which cannot be written as a superposition of plane waves: 


'N 


The second part cut out of h { - is the longitudinal 
part 

h L jk = hjk “ PjlPmkhlm 

= n l n k h il + - n M n l n m h lm)> ( 6 ) 

or __ 

h L = h - PhP (60 

Solution 3 (general case). Fourier analyze any 
symmetric array A i; - = t) exp ( ik m x m ) d 3 k , 

and apply the formulas (4) from solution 2 to each 
Fourier component individually. But note that in 
this case one can write the projection operator in 
the direction-independent form ; 

P ]k = s !k --^d i d k (7) 

or 

n l «m = -p- 3 X 0 m ( 8 ) 

(provided the formulas are written with all A’s 
standing on the right), since d t = ik t under the 
Fourier integral. Of course the operation 1/V 2 can 
be evaluated by other methods, e.g., by Green’s 
functions, as well as by Fourier analysis. [The 


quantity xp = V -2 / stands for the solution xp of the 
Poisson equation V 2 \p = /] The advantage of this 
method is its power in certain analytic computa¬ 
tions (see, e.g., below). 

Gauge Transformations. The change in h^ v due 
to a gauge transformation is 

= -ifX, + 3^r). (9) 

The transverse part of this change is 

PjlPkrrS^hlm) = ~~ ^jl^krrS ?+ ^mfl) = 0^) 

To verify this formula for a plane wave (solution 
2), note that d t = i\k\n t and = 0. To verify 
the same result in general, use equation (7) to give 
the result 

P,ih = 0 . ( 11 ) 

Thus both hJJ of equation (4), and h^oi equation 
(5) are gauge-invariant: 

8Ag r =«A5 = 0- (12) 

In empty space (7^„ — 0), both hfj and another 

gauge-invariant quantity h 0k (discussed in exercise 
35.4) vanish, by virtue of the field equations. 
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Action of a gravitational wave 
on separation of two test 
particles 




ct r m 

U _ O A 

n 0 k — pj » 


r=(x 2 +? 2 + r 2 ) I/2 


[see equation (19.5)]. Here M is the star’s mass and 5 is its angular momentum. Show that 
this cannot be put into TT gauge. [Hint: Calculate R j0k0 and from it, by means of (35.10), 
infer hjf. Then calculate R 0ryx in both the original gauge and the new gauge, and discover 
that they disagree—not only by virtue of the mass term, but also by virtue of the angular- 
momentum term.] 


Exercise 35.3. A CYLINDRICAL GRAVITATIONAL WAVE 

To restore one’s faith, which may have been shaken by exercise 35.2, one can consider the 
radiative solution whose only nonvanishing component h^ v is 

h zz = 44 cos (t ct)J 0 (uy/x 2 + y 2 ), 

where J 0 is the Bessel function. This solution represents a superposition of ingoing and 
outgoing cylindrical gravitational waves. For this gravitational field calculate R ioko , and from 
it infer hjf. Then calculate several other components of R afjy& (e.g., R xyxy ) in the original 
gauge and in TT gauge, and verify that the answers are the same. 

Exercise 35.4. NON-TT PARTS OF METRIC PERTURBATION [Track 2] 

From Box 35.1 establish the formula h T = ^~ 2 (h kk n — h kl kl ); then verify the gauge invari¬ 
ance of h T directly, by showing that h kkLi — h kl kl is gauge-invariant. Use 8h i} = £ i y + £ ; i . 
Show similarly that the quantities h ok defined by 

hok — k ~ + hki t io) 

are gauge-invariant. Show from the gauge-invariant linearized field equations (18.5) that 

V 2 h T = - \6 ttT 00 , 

v %k = ~ I 67 rr oJfc’ 

so h T and h ok must vanish for waves in empty space. 


§35.5. GEODESIC DEVIATION IN A LINEARIZED 
GRAVITATIONAL WAVE 

The oscillating curvature tensor of a gravitational wave produces oscillations in the 
separation between two neighboring test particles, A and B. Examine the oscillations 
from the viewpoint of A. Use a coordinate system (“proper reference frame of A ”), 
with spatial origin x) = 0, attached to A’s world line (comoving coordinates); with 
coordinate time equal to A’s proper time (x° = t on world line x j = 0); and with 
orthonormal spatial axes attached to gyroscopes carried by A (“nonrotating frame”). 
This coordinate system, appropriately specialized, is a local Lorentz frame not just 
at one event on A’s geodesic world line, but all along ^4’s world line: 


ds 2 = — dx® 2 4 - Sjt dx’ dx^ + 0(\x*\ 2 ) dx a dx$. 


(35.11) 
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[Proof: such a “proper reference frame” was set up for accelerated particles in 
Track 2’s §13.6. The line element (13.71) derived there, when specialized to particle 
A (<3j = 0 because A falls freely; w* = 0 because the spatial axes are attached to 
gyroscopes) reduces to the above form, as in equation (13.73).] 

As the gravitational wave passes, it produces an oscillating curvature tensor, which 
wiggles the separation vector n reaching from particle A to particle B : 

D 2 n’/dr 2 - -R’ m n k = -R^n k . (35.12) 

The components of the separation vector are nothing but the coordinates of 
particle B , since particle A is at the origin of its own proper reference frame; thus, 



Moreover, at x’ = 0 [where the calculation (35.12) is being performed], the 
vanish for all x°; so dT^^^/dr also vanish. This eliminates all Christoffel-symbol 
corrections in D 2 n’/Dr 2 . Hence, equation (35.12) reduces to 

d 2 xj/dr 2 = -R m x B K (35.13) 

There is a TT coordinate system that, to first order in the metric perturbation 
moves with particle A and with its proper reference frame. To first order in 
the TT coordinate time t is the same as proper time r, and Rjj k0 = jRjoko- Hence, 
equation (35.13) can be rewritten 

(PxJ/dt* = -Rj& 0 V = \{t 2 h%/ct*)x B K (35.14) 

Suppose, for concreteness, that the particles are at rest relative to each other before 
the wave arrives (.xj = x m) > when hj£ = 0). Then the equation of motion (35.14) 
can be integrated to yield 

= Wk+WI (35-15) 

*- ^ -*at position of A 

This equation describes the wave-induced oscillations of B 's location, as measured 
in the proper reference frame of A. 

Turn to the special case of a plane wave. Suppose the test-particle separation lies 
in the direction of propagation of the wave. Then the wave cannot affect the separa¬ 
tion; there is no oscillation: 


W ^ h]kk k — 0. 

Only separations in the transverse direction oscillate; the wave is transverse not only 
in its mathematical description (hjj[), but also in its physical effects (geodesic deviation )! 


Transverse character of 
relative accelerations 
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EXERCISE 


Polarization of gravitational 
waves: 

(1) States of linear 
polarization, " + " 
and "X" 


Exercise 35.5. ALTERNATIVE CALCULATION OF RELATIVE OSCILLATIONS 

Introduce a TT coordinate system in which, at time t = 0, the two particles are both at rest. 
Use the geodesic equation to show that subsequently they both always remain at rest in the 
TT coordinates, despite the action of the wave. This means that the contravariant components 
of the separation vector are always constant in the TT coordinate frame: 

n j = xj — xf — const. 

Call this constant Transform these components to the comoving orthonormal frame; 

the answer should be equation (35.15). 


§35.6. POLARIZATION OF A PLANE WAVE 


Geodesic deviation in the transverse direction provides a means for studying and 
characterizing the polarizations of plane waves. 

Consider a plane, monochromatic wave propagating in the z direction. In the TT 
gauge the constraints = 0, hfjj = ikfifT = 0, and - 0 reveal that the only 
nonvanishing components of h JJ are 


= -h % = *{ A + e -+«-*} 9 


(35.16) 


The amplitudes A + and A x represent two independent modes of polarization. 

As for electromagnetic plane waves (Figure 35.1), so also for gravitational plane 
waves (Figure 35.2), one can resolve a given wave into two linearly polarized 
components, or, alternatively, into two circularly polarized components. 



Figure 35.1. 

Plane Electromagnetic Waves. 
Polarization vector: e p 
Vector Potential 

A = 

Acceleration of a test charge: 
a = ( q/m)E = (q/m)( — dA/dt) 
= &[iu(q/m)A Q e- iu,{t - z) e p ] 

Displacement of charge relative 
to inertial frame: 

Sj: = S j 
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For linearly polarized waves , the unit polarization vectors of electromagnetic theory 
are e x and e y . A test charge hit by a plane wave with polarization vector e x oscillates 
in the jc-direction relative to an inertial frame; and similarly for e r By analogy, 
the unit linear-polarization tensors for gravitational waves are 


e+ = e x ® e x - e y 0 e y , (35.17a) 

= e y + ® e x . (35.17b) 

The plane wave (35.16), when A x = 0, has polarization e + and can be rewritten 

h jk = &{A + e- iM -*e +jk }. (35.18) 

Its effect in altering the geodesic separation between two test particles depends on 
the direction of their separation. To see the effect in all directions at once, consider 
a circular ring of test particles in the transverse (x,y) plane, surrounding a central 
particle (Figure 35.2). As the plane wave (35.18) (polarization e + ) passes, it deforms 
what was a ring as measured in the proper reference frame of the central particle 
into an ellipse with axes in the x and y directions that pulsate in and out: 

OooooO 0 OOooooO 0 0O 

etc. By contrast (Figure 35.2), a wave of polarization e x deforms the ring at a 
45-degree angle to the x and y directions: 00^000^00 etc. 

For circularly polarized waves , the unit polarization vectors of electromagnetic (2) States of circular 
theory are polarization 

e R = y=-(** + /e »)> e L = -~{e x -ie y ) (35.19) 



Figure 35.2. 

Plane Gravitational Waves. Polarization tensor: 

e p 

Metric perturbation: 

h jk = *[. 

Tidal acceleration between two test particles: 


-/WU=T — 




Dr 2 "v™"* 2 dt 2 

= *[-y u 2 A 0 e~ iM - c) e Pjt n k j 
Separation between two test particles: 
n- = nf m + & 


Position of test particle B in proper reference 
frame of test particle A. (In drawing, A is the 
central particle and B is any peripheral panicle): 

xj = 4 - £ 
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Spin-2 character of 
gravitational field and its 
relation to symmetries of 
waves 


EXERCISES 


Similarly, the unit circular polarization tensors of gravitation theory are 


= -W( e + + ie xl 

V2 


= vf <e * “ 


(35.20) 


A test charge hit by an electromagnetic wave of polarization e R moves around and 
around in a circle in the righthanded direction (counterclockwise for a wave propa¬ 
gating toward the reader); for e L it circles in the lefthanded (clockwise) direction 
(see Figure 35.1). Similarly (Figure 35.2), a gravitational wave of polarization e R 
rotates the deformation of a test-particle ring in the righthanded direction, 

o o 0 0 

while a wave of e L rotates it in the lefthanded direction. The individual test particles 
in the ring rotate in small circles relative to the central particle. However, just as 
the drops in an ocean wave do not move along with the wave, so the particles on 
the ring do not move around the central particle with the rotating ellipse. 

Notice from Figure 35.2 that, at any moment of time, a gravitational wave is 
invariant under a rotation of 180° about its direction of propagation. The analogous 
angle for electromagnetic waves (Figure 35.1) is 360°, and for neutrino waves it 
is 720°. This behavior is intimately related to the spin of the zero-mass particles 
associated with the quantum-mechanical versions of these waves: gravitons have 
spin 2, photons spin 1, and neutrinos spin 1/2. The classical radiation field of a 
spin-S particle is always invariant under a rotation of 360%S about its propagation 
direction. 

A radiation field of any spin S has precisely two orthogonal states of linear 
polarization. They are inclined to each other at an angle of 90%S; thus, for a 
neutrino field, with S = the two states are distinguished as | f > and | \ > (spin 
up and spin down; 180° angle). For an electromagnetic wave S = 1 and two ortho¬ 
gonal states of polarization are e x and e y (90° angle). For a gravitational wave S = 2, 
and two orthogonal states are e + and e x (45° angle). 


Exercise 35.6. ROTATIONAL TRANSFORMATIONS FOR POLARIZATION STATES 

Consider two Lorentz coordinate systems, one rotated by an angle 6 about the z direction 
relative to the other: 

t' = t, x f = x cos 6 + y sin 0, y' = y cos 6 — x sin 0, z’ = z. (35.21) 

Let | f > and ||> be quantum-mechanical states of a neutrino with spin-up and spin-down 
relative to the * direction; and similarly for |f'> and |j/>. Let e x , e y , e x ., e v be the unit 
polarization vectors in the two coordinate systems for an electromagnetic wave traveling in 
the z-direction; and similarly e + , e x , e + >, e x , for a gravitational wave in linearized theory. 
Derive the following transformation laws: 

lt'> = It>cosi0 + ||> sin-L#; |f> = -\j >S m±d + |j> cos-Ltf; 

e x , = e x cos 6 + e v sin 8; e y . = -e x sin 8 + e y cos 8; (35.22) 

e + ’ = e + cos 26 + e x sin 26; e x , = - e + sin 26 + e x cos 26. 
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What is the generalization to the linear-polarization basis states for a radiation field of 
arbitrary spin SI 

Exercise 35.7. ELLIPTICAL POLARIZATION 

Discuss elhptically polarized gravitational waves in a manner analogous to the discussion 
of linearly and circularly polarized waves in Figure 35.2. 



§35.7. THE STRESS-ENERGY CARRIED BY 
A GRAVITATIONAL WAVE 


Exercise 18.5 showed that in principle one can build detectors which extract energy 
from gravitational waves. Hence, it is clear that the waves must carry energy. 

Unfortunately, to derive and justify an expression for their energy requires a 
somewhat more sophisticated viewpoint than linearized theory. Such a viewpoint 
will be developed later in this chapter (§§35.13 and 35.15). But for the benefit of 
Track-1 readers, the key result is stated here. 

In accordance with the discussions in §§19.4 and 20.4, the stress-energy carried 
by gravitational waves cannot be localized inside a wavelength. One cannot say 
whether the energy is carried by the crest of a wave, by its trough, or by its “walls.” 
However, one can say that a certain amount of stress-energy is contained in a given 
“macroscopic” region (region of several wavelengths’ size), and one can thus talk 
about a tensor for an effective smeared-out stress-energy of gravitational waves, 
In a (nearly) inertial frame of linearized theory, Tfff’ is given by 


Approximate localization of 
energy in a gravitational 
wave 


Effective stress-energy tensor 
for gravitational waves: 


jiGW) _ 

fiv — 



(35.23) 0) expressed in terms of 

metric perturbations 


where < ) denotes an average over several wavelengths and hf? means the (gauge- 
invariant) transverse-traceless part of h^, which is simply h jk in the TT gauge. 
Another formula for Tj®™, valid in any arbitrary gauge, with h ■£ 0, h a a ^ 0, and 

K 5* 0 is 


7’(gw) _ 1 

~ 327r 




h af3 ,-j ( 35 - 23 ') 


This stress-energy tensor, like any other, is divergence-free in vacuum 


7«™y v = 0; (35.24) ^ subject to conservation 

* law 

and it contributes to the large-scale background curvature (which linearized theory 
ignores) just as any other stress-energy does: 


G™ = M Tf™ + 7'<™ atter) + r^ therfields) ). (35.25) < 3 ) role as source of 

background curvature 

In writing here the term rj*J w) for the effective smeared-out energy density of the 
gravitational wave, one is foregoing any further insertion of the gravitational wave 
into the Einstein equation. Otherwise one might end up counting twice over the 
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(4) for a plane, 

monochromatic wave 


Conditions for validity of 
gravitational-wave formalism 


Nonlinear effects in 
gravitational waves: 


(1) radiation damping 


(2) refraction 


contribution of the same wave to the background curvature of space, even though 
expressed in very different formalisms. 

According to equation (35.23), the stress-energy tensor for the plane wave, 

h f , = *UA+e+„ + A^)e-'*-*}\' (35.26) 

is 

r (GW> = r <GW> = _ 7-<GW) = 1 W 2 (M |2 + lA J2y (35.27) 

jZtt 

Notice that the background radius of curvature 6L (ignored by linearized theory), 
and the mean reduced wavelength A (- wavelength/2 tt) and amplitude d of the 
gravitational waves satisfy 

6l~ 2 ~~ typical magnitude of components of 

~~ ~~ d 2 ! A 2 if TjjJJi w) is chief source of background curvature, 

^ ~~ ^ 2 / X 2 ^ is not chief source. 

Consequently, the dimensionless numbers d and A /are related by 

a < x/€i. (35.28) 

Thus, the whole concept of a small-scale ripple propagating in a background of 
large-scale curvature breaks down, and the whole formalism of this chapter becomes 
meaningless , if the dimensionless amplitude of the wave approaches unity. One must 
always have d l as well as A < 61 if the concept of a gravitational wave is to make 
any sense! 


§35.8. GRAVITATIONAL WAVES IN THE FULL THEORY 
OF GENERAL RELATIVITY 

Curving up of the background spacetime by the energy of the waves is but one 
of many new effects that enter, when one passes from linearized theory to the full, 
nonlinear general theory of relativity. 

In linearized theory one can consider a localized source of gravitational waves 
(e.g., a vibrating bar) in steady oscillation, radiating a strictly periodic wave. But 
the exact theory insists that the energy of the source decrease secularly, to counter¬ 
balance the energy carried off by the radiation (energy conservation; gravitational 
radiation damping; see §§36.8 and 36.11). This makes an exactly periodic wave 
impossible, though a very nearly periodic one can certainly be emitted [Papapetrou 
(1958); Amowitt, Deser, and Misner as reported by Misner (1964b)]. 

In the real universe there are spacetime curvatures due not only to the energy 
of gravitational waves, but also, and more importantly, to the material content of 
the universe (planets, stars, gas, galaxies). As a gravitational wave propagates through 
these curvatures, its wave fronts change shape (“refraction”), its wavelength changes 
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(gravitational redshift), and it backscatters off the curvatures to some extent. If the 
wave is a pulse, the backscatter will cause its shape and polarization to keep changing 
and will produce “tails” that spread out behind the moving pulse, traveling slower 
than light [see exercise 32.10; also Riesz (1949), DeWitt and Brehme (1960), DeWitt 
and DeWitt (1964a), Kundt and Newman (1968), Couch et. al (1968)]. However, so 
long as d 1 and A/fS 1, these effects will be extremely small locally. They can 
only build up over distances of the order of —and sometimes not even then. Thus, 
locally, linearized theory will remain highly accurate. 

Even in an idealized universe containing nothing but gravitational waves, back- 
scatter and tails are produced by the interaction of the waves with the background 
curvature that they themselves produce. 

If the reduced wavelength A — \/2it and the mass-energy m of a pulse of waves 
satisfy A < m, it is possible (in principle) to focus the pulse into a region of size 
r < m, whereupon a part of the energy of the pulse will undergo gravitational 
collapse to a singularity, leaving behind a black hole [see Ruffini and Wheeler (1970), 
and pp. 7-24 of Christodoulou (1971)]. Short of a certain critical strength, no part 
of the pulse undergoes such a collapse. But it does undergo a time delay before 
reexpanding. This time delay is definable and measurable in the asymptotically flat 
space, far from the domain where the energy a little earlier underwent temporary 
focusing into dimensions of order A. 

All these effects can be analyzed in general relativity theory using approximation 
schemes which, in first order, are similar to or identical to linearized theory. Later 
in this chapter (§§35.13-35.15), one such approximation scheme will be developed. 
But first it is helpful to study an exact solution that exhibits some of these effects. 


§35.9. AIM EXACT PLANE-WAVE SOLUTION 

Any exact gravitational-wave solution that can be given in closed mathematical form 
must be highly idealized; otherwise it could not begin to cope with the complexities 
outlined above. Consequently, mathematically exact solutions are useful for peda¬ 
gogical purposes only. However, pedogogy should not be condemned: it is needed 
not only by students, but also by veteran workers in the field of relativity, who 
even today are only beginning to develop intuition into the nonlinear regime of 
geometrodynamics! 

From the extensive literature on exact solutions, we have chosen, as a compromise 
between realism and complexity, the following plane wave [Bondi et. al. (1959), Ehlers 
and Kundt (1962)]: 


Here 


ds 2 — L 2 (e 2f2 dx 2 + e dy 2 ) -f dz 2 — dt 2 

= Lr(e 2 & dx 2 + e~ 2 $ dy 2 ) - da dv. 


(35.29a) 


(3) redshift 

(4) backscatter 

(5) tails 



(6) self-gravitational 
attraction 



-N 

The rest of this chapter is 
Track 2. No earlier Track-2 
material is needed as 
preparation for it, but 
Chapter 20 (conservation laws) 
and §22.5 (geometric optics) 
will be found to be helpful. It 
is not needed as preparation 
for any later chapter. 

v _ J 

Exact plane-wave solution of 
vacuum field equation: 


(1) form of metric 


u-t- z. 


U= t + z, L — L(u\ p = P(u). (35.29b) 
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The forms that the functions L(u) (“background factor 0 ) and ft(u) (“wave factor”) 
can take are determined by the vacuum field equations. In the null coordinate system 
u, u. x, j, the only component of the Ricci tensor that does not vanish identically 
is (see Box 14.4, allowing for the difference in coordinates, 2i> there = t; here ) 

R uu = -2 L-\L" + (ft') 2 L\ (35.30) 

where the prime denotes d/du . Thus, Einstein’s equations in vacuum read 

(2) generation of 

"background factor" L L" + ( ft') 2 L = 0. (35.31) 

by "wave factor" ft 

(“effect of wave factor on background factor”) 

(3) linearized limit The linearized version of this equation is L" = 0, since (ft') 2 is a second-order 

quantity. Therefore the solution corresponding to linearized theory is 

L = 1, ft(u) arbitrary but small. 

The corresponding metric is 

ds 2 = (1 + 2ft) dx~ + (1 - 2ft) dy 2 + dz 2 - dt 2 y ft = ft(t - z). (35.32) 

Notice that this is a plane wave of polarization e + propagating in the z-direction. 
(See exercise 35.10 at end of §35.12 for the extension to a wave possessing both 
polarizations, e + and e x .) 

(4) special case: a Return attention to the exact plane wave, and focus on the case where the “wave 

plane-wave pulse factor” ft(u) is a pulse of duration 2 T, and \ft'\ < \/T throughout the pulse. Then 

the exact solution (Figure 35.3) is: (1) for u < —T (flat spacetime; pulse has not 
yet arrived), 

j8 = 0, 1 = 1; (35.33a) 


(2) for — T < u < 4- T (interior of pulse), 

ft — ft(u) is arbitrary, except that \ft'\ < 1 /T, 

L(u) = 1 - f {/" EA'(«)] 2 d!?} du + OW'Tf); 

(3) for u > T (after the pulse has passed), 


(35.33b) 


(8 = 0 , 


1 = 1 --, 

a 



0([P’T] 2 ) 

T 

f (P’fdu 

J —T 


(35.33c) 


Before discussing the physical interpretation of this exact solution, one must come 
to grips with the singularity in the metric coefficients at u = a > T. (There L — 0, 
so Sxx — gyy — 0*) Is this a physical singularity like the region r = 0 of the 
Schwarzschild geometry, or is it merely a coordinate singularity as r = 2 M is in 
Schwarzschild coordinates (Chapters 31, 32, and 33)? The only nonzero compo¬ 
nents of the Riemann tensor for the metric (35.29) are (see Box 14.4) 
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t 



Figure 35.3. 

Spacetime diagram and pulse profile for an exact plane-wave solution to Einstein’s 
vacuum field equations. The metric has the form 

ds 2 = L 2 (e 2fi dx 2 + e~ 2 & dy 2 ) -f dz 2 — dt 2 . 

The “wave factor” fi{u) = - z ) (short-scale ripples) and the “background fac¬ 

tor” L(u) = L{t — z) (large-scale bending of the background geometry by the 
effective mass-energy of the “ripply” gravitational wave) are shown in the drawing 
and are given analytically by equations (35.33). 



K x uru = - P" - 2 (L'/LW, (35.34) 

= \r uu + P" + 2 (L'/W- 


Moreover, these components both vanish in any extended region where [i = 0. Thus, 

spacetime is completely flat in regions where the “wave factor ” vanishes—which is (5) spacetime is flat outside 
everywhere outside the pulse! In particular, spacetime is flat near u = a , so the the P ulse 
singularity there must be a coordinate singularity, not a physical singularity. To 
eliminate this singularity, one can perform the coordinate transformation 


X Y 

1 - Ufa' y ~ 1 - U/a' 


u = U, 


v = 


V + 


X 2 + Y 2 
a - U 


(35.35) 


throughout the region to the future of the pulse (w > T), where 
ds 2 = (1 — u/a) 2 (dx 2 + dy 2 ) — du dv. 


(35.36a) 
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EXERCISES 


Flatness outside 
gravitational-wave pulses 
unusual 


Action of exact 
gravitational-wave pulse 
test particles: 


In the new X , Y, U , K coordinates the metric has the explicitly flat form 

ds 2 = dX 2 + dY 2 - </£/</P for U = u> T. (35.36b) 


Exercise 35.8. GLOBALLY WELL-BEHAVED COORDINATES FOR PLANE WAVE 
[based on Ehlers and Kundt (1962)] 

Find a coordinate transformation similar to (35.35), which puts the exact plane-wave solution 
(35.29a), (35.31), into the form 

ds 2 = dX 2 + dY 2 - dUdV + {X 2 - Y 2 )FdU 2 , (35.37) 

F = F(U) completely arbitrary. 

This coordinate system has the advantage of no coordinate singularities anywhere; while 
the original coordinate system has the advantages of an easy transition to linearized theory, 
and easy interpretation of the action of the wave on test particles. 

Exercise 35.9. GEODESIC COMPLETENESS FOR PLANE-WAVE MANIFOLD 
[based on Ehlers and Kundt (1962)] 

Prove that the coordinate system (X, Y , U, V) of exercise 35.8 completely covers its spacetime 
manifold. More specifically, show that every geodesic can be extended in both directions 
for an arbitrarily large affine-parameter length without leaving the X , Y , U\ V coordinate 
system. (This property is called geodesic completeness.) [Hint: Choose an arbitrary event and 
an arbitrary tangent vector d/d\ there. They determine an arbitrary geodesic. Perform a 
coordinate transformation that leaves the form of the metric unchanged and puts<//</\ either 
in the (£7, V) = constant 2-surface, or in the X, Y) = constant 2-surface. Verify that the two 
coordinate systems cover the same region of spacetime. Then analyze completeness of d/dXs 
geodesic in ( X , Y, U , V) coordinates.] 


§35.10. PHYSICAL PROPERTIES OF THE EXACT PLANE WAVE 

Spacetime is completely flat both before the arrival of the plane-wave pulse 
(u < — T) and after it has passed (w > T). This is the message of the last paragraph. 

Complete flatness outside the pulse is very atypical for gravitational waves in the 
full, nonlinear general theory of relativity. It occurs in this example only because 
the wave fronts (surfaces of constant u and v , i.e., constant z and /) are perfectly 
flat 2-surfaces. If the wave fronts were bent (e.g., spherical), the energy carried by 
the pulse would produce spacetime curvature outside it. 

To see nonlinear effects in action, turn from the flat geometry outside the pulse 
to the action of the pulse on freely falling test particles. Consider a family of particles 
that are all at rest in the original t,x,y,z coordinate system (world lines: [x,y,z] 
= constant) before the pulse arrives. Then even while the pulse is passing, and after 
it has gone, the particles remain at rest in the coordinate system. (This is true for 
any metric, such as (35.29a), with g 0fJL — — 8°^, for then — 0, so = 5^ 0 r + 
const, satisfies the geodesic equation.) 
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Two particles whose separation is in the direction of propagation of the pulse 
(z-direction) have not only constant coordinate separation, Ax = Ay = Oand Az ^ 0; 
they also have constant proper separation, 4s = g zz 1/2 Az = Az. Hence, the exact 
plane wave is completely transverse, like a plane wave of linearized theory. 

Neighboring particles transverse to the propagation direction, {Ax ^ 0 , Ay ^ 0, 
Az = 0) have a proper separation that wiggles as the pulse passes: 

As = L{t - z)[e 2 ^*~ z \Ax) 2 + 2 ji /2 

~ m + 2 /3){Ax) 2 + (1 - 2p){Ay) 2 ]'' 2 . (35.38) 

Superimposed on the usual linearized-theory type of wiggling, due to the “wave 
factor /}, is a very small net acceleration of the particles toward each other, due 
to the “background factor” L [note the form of L{u) in Figure 35.3], This is an 
acceleration of almost Newtonian type, caused by the gravitational attraction of the 
energy that the gravitational wave carries between the two particles. The total effect 
of all the energy that passes is to convert the particles from an initial state of relative 
rest, to a final state of relative motion with speed 

y final = d As/dt - d(L AsJ/dt = -Asja, (35.39) 

where 


ASi = [{Ax) 2 + {Ay) 2 ] 1/2 = (initial particle separation). 

[Recall: L initiaI = 1, T final = 1 - u/a - 1 - (/ - z)/a; equation (35.33).] 

Precisely the same effect can be produced by a pulse of electromagnetic waves 
(§35.11). 


§35.11. COMPARISON OF AN EXACT ELECTROMAGNETIC 
PLANE WAVE WITH THE GRAVITATIONAL 
PLANE WAVE 

Consider the metric 

ds 2 — L 2 {u){dx 2 + dy 2 ) — du dv, \ U ~ ( Z j, (35.40) 

which is always flat if it satisfies the vacuum Einstein equations (R = 0 or L" = 0), 
and therefore cannot represent a gravitational wave. In this metric the electromag¬ 
netic potential 

A = A m dx» = A(u) dx (35.41) 

satisfies Maxwell’s equations for arbitrary A{u). It represents an electromagnetic 
plane wave analogous to the gravitational plane wave of the last few sections. The 
only nonzero field components of this wave are 

F ux = A\ i.e., F tx = —F zx = A'; (35.42) 


(1) transverse character of 
relative accelerations 


(2) gravitational attraction 
due to energy in pulse 


An electromagnetic 
plane-wave pulse 
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so the electric vector oscillates back and forth in the x-direction, the magnetic vector 
oscillates in the v-direction, and the wave propagates in the z-direction. The stress- 
energy tensor in x,y\ w, u, coordinates has only 

T uu = (35.43) 


nonzero. 

The Maxwell equations are already satisfied by the potential (35.41) in the back¬ 
ground metric (35.40), as the reader can verify. In order to make that metric itself 
equally acceptable, one need only impose the Einstein equations G^, = ixT . They 
read [see equation (35.30) with f} = 0] 

L" + (4ttT uu )L = 0. (35.44) 


Electromagnetic plane wave 
and gravitational plane wave 
produce same gravitational 
attractions 


This has exactly the form of the equation L” + (P') 2 L = 0 for the gravitational 
plane wave. Consequently, the relative motions of uncharged test particles produced 
by the “background factor” L(u) are the same whether L(u) ^ 1 is produced by 
the stress-energy of an electromagnetic wave, or by a corresponding gravitational 
wave with 


[( / S')V4^] gravwave = [r MB ] emwave = (35.45) 

The analogy can be made even closer. Decrease the wavelength of the waves, 
while holding (/T) 2 /47r and (A') 2 /4 f nL 2 fixed: 

<(/?') 2 /4t7> = ((A') 2 /4ttL 2 ) = const; X — *- 0. 

In the limit of very small wavelength (i.e., from a viewpoint whose characteristic 
length is >T), the two solutions are completely indistinguishable. Their metrics are 
identical (X —0 and <(/T) 2 ) = const, imply /? —v 0), and their jigglings of test 
particles are too small to be seen. Only their curving up of spacetime (L ^ 1) and 
the associated gravitational pull of their energy are detectable. 


§35.12. A NEW VIEWPOINT ON THE EXACT PLANE WAVE 


Exact gravitational plane 
waves reexamined in the 
language of "short-wave 
approximation": 

(1) ripples vs. background 


The above comparison suggests a viewpoint that was sketched briefly in the intro¬ 
duction to this chapter and in §35.8. Think of the exact gravitational plane-wave 
solution [Figure 35.3; equations (35.29) and (35.33)] as ripples in the spacetime 
curvature, described by propagating on a very slightly curved background 
spacetime, characterized by L(u). The most striking difference between the back¬ 
ground and the ripples is not in the magnitude of their spacetime curvatures, but 
in their characteristic lengths. The ripples have characteristic length 

X = (typical reduced wavelength, \/2tt, of waves); (35.46) 

the background has characteristic length (“radius of curvature of background 
geometry”) 
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1 


* ~ I^T /2 insidew ave ~ V\n (35.47) 

Recall that A is somewhat smaller than the pulse length, 2 T. Recall also that 
\P'T\ < 1. Conclude that the characteristic lengths of the “wave factor” and the 
“background factor” differ greatly: 


X < #. (35.48) 

This difference in scales enables one to separate out the background from the ripples. 

The ripples are very much smaller in scale (A < $) than the background. Never¬ 
theless the local curvature in a ripple is correspondingly larger than the background 
curvature [equations (35.30), (35.34)]; thus, 

uxu)background ( ^ ^ uyu ^background ~ f 1/^ , 

(^uxJwaves = ~(* VUves = ~ l£'|/A ~ 1/(A«) (35.49) 

~ (^/A)(R tMru)backgr oun d • 

One is reminded of the mottled surface of an orange! 

The metric for the background of the gravitational plane wave is the same as 
for the electromagnetic one [equation (35.40)]: 

ds 2 = gfi dx» dx v = L 2 (dx 2 + dy 2 ) - du dv . (35.50) 

By comparison with equation (35.29a), one sees that the metric for the full spacetime 
(background plus ripple) is 


ds 2 = (g® + h^) dx * (35.51) 

A xx = ~ h vv = 2 P’ a11 other V = (35.52) 

(Recall, in the region where /? ^ 0, L is very nearly 1.) One can think of the ripples (2) 
as a transverse, traceless, symmetric tensor field analogous to the electromagnetic 
field A /I , propagating in the background geometry. Just as the electromagnetic field 
produces the background curvature via 

G uu ~ - 2 L"/L = 8 tt T uu , 

so the gravitational-wave ripples h^ produce the background curvature via equation 
(35.31), which one can rewrite as 

<7® = -2 L"/L = 8 (35.53) 

Here 

r ® FF) = 4^^') 2 = 3^V,uV,« (35.54) <3) 

is the “effective stress-energy tensor” for the gravitational waves. Notice that it agrees, 
except for averaging, with the expression (35.23) that was written down without 
justification in §35.7. 


propagation of ripples in 
background 


effective stress-energy 
tensor for ripples 
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EXERCISE 


Foundations for shortwave 
formalism: 

(1) 61, X, and 3 defined 

(2) demand that 3 < 1 and 

X/61 < 1 

(3) split of metric into 
background plus 
perturbation; "steady 
coordinates" 


(4) Split of Ricci curvature 
tensor 


Exercise 35.10. PLANE WAVE WITH TWO POLARIZATIONS PRESENT 

The exact plane-wave solution (35.29) has polarization Construct a similar solution, 
containing two arbitrary amplitudes, fi{u) and y(w), for polarizations e + and e x . Extend the 
discussions of §§35.9-35.12 to this solution. 


§35.13. THE SHORTWAVE APPROXIMATION 

The remainder of this chapter extends the above viewpoint in a rigorous manner 
to very general gravitational-wave solutions. This extension is called the “shortwave 
formalism , it was largely devised by Isaacson (1968a,b), though it was built on 
foundations laid by Wheeler (1964a) and by Brill and Hartle (1964). Versions that 
are even more rigorous have been given in the W.K.B. or geometric-optics limit by 
Choquet-Bruhat (1969), and by MacCallum and Taub (1973). 

Consider gravitational waves propagating through a vacuum background space- 
time. As in §35.7, let 61 be the typical radius of curvature of the background; let 
X and 3 be the typical reduced wavelength (A/2 ir) and amplitude of the waves; 
and demand both 3 < 1 and X/61 < 1. The background curvature might be due 
entirely to the waves, or partly to waves and partly to nearby matter and nongravi- 
tational fields. 

The analysis uses a coordinate system closely “tuned” to spacetime in the sense 
that the metric coefficients can be split into “background” coefficients plus perturba¬ 
tions 


&IIV & [IV hjXV 

with these properties: (1) the amplitude of the perturbation is d 

(35.55) 

h v.v < (typical value of g®>) • d\ 

(2) the scale on which g ®? varies is > fi? 

(35.56a) 

£ (typical value of g®>)/«; 

(35.56b) 

(3) the scale on which h ^ varies is 


V,<* ~ (typical value of h^/X. 

Such coordinates are called “steady.” 

(35.56c) 

A rather long computation (exercise 35.11) shows that the Ricci tensor for an 
expanded metric of the form (35.55) is 

R m> = + R ^l(h) + R { ‘l(h) + error. 

? d / X 2 d*/X 2 d 3 /X 2 

(35.57) 
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Here a marker {d/X 2 , eta) has been placed under each term to show its typical 
order of magnitude; R®* is the Ricci tensor for the background metric g®?; and 
R^l and R i2) v are expressions defined by 


R { ^l(h) — + h afl u a 4- h aplfl a ), (35.58a) 

R%W = ~2^2hafi\fth a ^\ v + h afi {h ai 8! ^ + h^ v | ajQ — h afl — h av \np) 

+ h“ l P(h afll p — hpp | tt ) — p — —+ A aJf , A 

(35.58b) 


In these expressions and everywhere below, indices are raised and lowered with g® } , 
and an upright line denotes a covariant derivative with respect to g® } (whereas in 
Chapter 21 it denoted covariant derivative with respect to. 3-geometry). 

At the heart of the shortwave formalism is its method for solving the vacuum 
field equations R = 0. One begins by selecting out of expression (35.57) the part 
linear in the amplitude of the wave d, and setting it equal to zero. The action of 
the waves to curve up the background is a nonlinear phenomenon (linearized theory 
shows no sign of it); so cannot be linear in d. Hence, in expression (35.57), 
R^lih) is the only linear term, and it must vanish by itself 


Split of vacuum field 
equations into "wave part" 
(cctf’) plus "coarse-grain 
part" (cc tf 1 and smooth on 
scale T) plus "fluctuationa! 
corrections" (cc^/ 2 and ripply 
on scale X) 


= 0 . 


(35.59a) 


[Of course h^ v may contain nonlinear correction terms—call them j —of order d 2 , 
which must not be constrained by this linear equation. They will be determined 
by (35.59c), below.] 

One next splits the remainder of R^ v into a part that is free of ripples—i.e., that 
varies only on scales far larger than X (“coarse-grain viewpoint”), and a second part 
that contains the fluctuations. This split can be accomplished by averaging over 
several wavelengths (see exercise 35.14 for a precise treatment of the averaging 
process, also see Choquet-Bruhat (1969) for a class of solutions where such averaging 
is not required): 


R™ + <R ( ;l(h)> + error = 0 

? a 2 /x 2 a 3 /x 2 


;smooth 

part 


(35.59b) 


R{ jt\(J) + R®l(.h) - <R (2 l(h)> + error = 0 

a 2 /x 2 \ a 2 /x 2 a 2 /x 2 a 3 /x 2 


\Tnonlinear cor¬ 
rection to h 


fluctuating 

part 


(35.59c) 


That’s all there is to it!—except for reducing the equations to manageable form, 
and a fuller interpretation of the physics. 

Begin with the interpretation. 


Physical interpretation of the 
three parts of field equations: 
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(1) propagation of waves 

(2) production of background 
curvature by energy of 
waves; 7" ( ° w) defined 


(3) nonlinear self-interaction 
of waves 


EXERCISE 


Equation (35.59a) is an equation for the propagation of the gravitational 
waves h^,. 

Equation (35.59b) shows how the stress-energy in the waves creates the background 
curvature. It can be rewritten in the more suggestive form 

in vacuum, (35.60) 

where 

n G , W> = - (35.61) 

is the stress-energy tensor for the gravitational waves. Now one sees the origin of 
the statement in §35.7, that the stress-energy of gravitational waves is well-defined 
only in a smeared-out sense. 

Finally, equation (35.59c) shows how the gravitational waves h generate nonlinear 
corrections j to themselves (wave-wave scattering, harmonics of the fundamental 
frequency, etc.). These higher-order effects will not be investigated in this chapter. 


Exercise 35.11. CONNECTION COEFFICIENTS AND CURVATURE 
TENSORS FOR A PERTURBED METRIC 

In a specific coordinate frame of an arbitrary spacetime, write the metric coefficients in 
covariant representation in the form 


Zplv ~ Sfp + V* (35.62a) 

(At the end of the calculation, one can split into two parts, h^ h^ + / ; and out 
of this split obtain the formulas used in the text.) Assume that the typical components of 
h^, are much less than those of g*®?; so one can expand Christoffel symbols and curvature 
tensors in h 4Jf . Raise and lower indices of with g®J; and denote by a covariant 
derivatives relative to gj®? and by a covariant derivatives relative to g 

(a) Here g M „ and gj®? can be thought of as two different metrics coexisting in the spacetime 
manifold. Show that the difference between the corresponding covariant derivatives, V — 
V <B) ~ S —indeed, the difference between any two covariant derivatives!—is a tensor with 
components 

S *fiy = r “fiy ~ r<S \y .(35.62b) 

[Hint: See part B of Box 10.3.] 

(b) Show that 

g"' = g® ) '“' - + • • •, (35.62c) 

and also that 

g"” = g® 1 '"' - h*’ + h^h a v - h^hjh^gy. (35.62c') 
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£ 


(c) By calculating in a local Lorentz frame of g® and then transforming back to the 


original frame, show that 

= 2"£ Ma (^a:0lY + ^ay\p ~ ^py\ a)’ (35.62d) 

R%yS ~ R iB) * 0 yS = ^lY ~ S%y,| + " S% 8 S^ yi (35.62e) 

*/?5 “ = ^“jSSIa “ ^“jSatS + (35.62f) 

(d) Show that expression (35.62f) reduces to 

*31 = R {B) ps + * ( jU*) + *$(« + (35-62g) 

where .R a) and i? (2) are defined by equations (35.58). 


§35.14. EFFECT OF BACKGROUND CURVATURE 
ON WAVE PROPAGATION 

Focus attention on the propagation equation R { Jtl(h) — 0. As in linearized theory, 
so also here, the propagation is studied more simply in terms of 

V = (35.63) 

than in terms of h^ v . Rewritten in terms of h , R^l(h) = 0 says 

V,/ + = 0. (35.64) 

[To obtain this, invert equation (35.63) obtaining h^ v — h^ v — insert this into 

(35.58a) and equate to zero; then commute covariant derivatives using the identity 
(16.6b); finally contract to obtain an expression for h la a and substitute that back 
in.] 

The propagation equation (35.64) can be simplified by a special choice of gauge. 
An infinitesimal coordinate transformation 

^new(^) = ^oid(^) + tTJ) (35.65a) 

induces a first-order change in the functional forms of the metric coefficients given 
by 

^/xt'newC'*' new) ^fiv oldC* new) v) (35.65b) 

[analog of the gauge transformation of linearized theory, equation (35.3c); see 
exercise 35.12]. By an appropriate choice of the four functions one can enforce 
the four “Lorentz gauge conditions” 


h^ v defined 


Propagation equation for 
waves on curved background 


Specialization to "Lorentz 
gauge" 


Via — 0 


(35.66) 
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Coupling of waves to Ricci 
tensor can be ignored 


Propagation equation in 
Lorentz gauge and its realm 
of validity 


Lists of effects absent from 
and contained in propagation 
equation 


in the new coordinate system (exercise 35.13). This choice of gauge is analogous 
to that of linearized theory. It makes the second and third terms in the propagation 
equation vanish. (For additional gauge conditions of the “7T” type, see exercise 
35.13.) 

The last term of the propagation equation, -2/? ( ^/z r) a , vanishes to within the 
precision of the analysis, for this reason: attention has been confined to vacuum; 
so the only source of a nonvanishing Ricci tensor is the stress-energy carried by 
the gravitational waves themselves [equation (35.60)]; hence Rf£ — a 2 /X 2 and 

(35.67) 

This is of the same order as R ( ®(h), the third-order correction to the Ricci tensor, 
which is far below the precision of the analysis. For consistency in the analysis it 
will therefore be neglected. 

Summary of this section thus far: by choosing a gauge where hf^ = 0, and by 
discarding terms of higher order than the precision of the analysis, one obtains the 
vacuum propagation equation 

Vi«“ + = 0 , (35.68) 

subject to the Lorentz gauge condition 

h 1 “ = 0 

ri Ha 

Equation (35.68) is accurate to first order in the amplitude [corrections cctf 2 are 
embodied in equation (35.59c)]; and its accuracy is independent of the ratio 
as one sees from equations (35.59). Thus, it can be applied whenever the waves are 
weak, even if the wavelength is large! 

All nonlinear interactions of the wave with itself are neglected in this first-order 
propagation equation. Absent is the mechanism for waves to scatter off each other 
and off the background curvature that they themselves produce. Also absent are 
any hints of a change in shape of pulse due to self-interaction as a pulse of waves 
propagates. There are no signs of the gravitational collapse that one knows must 
occur when a mass-energy m of gravitational waves gets compressed into a region 
of size <m. To see all these effects, one must turn to corrections of second order 
in d and higher [e.g., equations (35.59c) and (35.60)]. 

Actually contained in the propagation equation are all effects due to the linear 
action of the background curvature on the propagating wave. These effects are 
explored, for short wavelengths {X/6L < 1) and for nearly flat wave fronts, in 
exercises 35.15-35.17 at the end of the chapter. The effects considered include a 
gravitational redshift of gravitational radiation and gravitational deflection of the 
direction of propagation of gravitational radiation, identical to those for light; and 
also a rotation of the polarization tensor. When the wavelength is not small {X/& 
not ^ 1), the propagation equation includes a back-scatter of the gravitational waves 
off the background curvature and a resultant pattern of wave “tails” analogous to 
that explored in exercise 32.10 [see, e.g.. Couch et al (1968), Price (1971a), Bardeen 
and Press (1972), Unt and Keres (1972)]. 
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Exercise 35.12. GAUGE TRANSFORMATIONS IN A CURVED BACKGROUND 

(a) Show that the infinitesimal coordinate transformation (35.65a) induces the change 
(35.65b) in the functional form of the metric perturbation. 

(b) Discuss the relationship between this gauge transformation and the concept of a Killing 
vector (§25.2). 

Exercise 35.13. TRANSVERSE-TRACELESS GAUGE FOR GRAVITATIONAL 
WAVES PROPAGATING IN A CURVED BACKGROUND 

(a) Show that, in vacuum in a curved background spacetime, the gauge condition h M a !a = 0 
is preserved by transformations whose generator satisfies the wave equation £j a a = 0. 

(b) Locally (over distances much smaller than 9L) linearized theory is applicable; so there 
exists such a transformation which makes [see equations (35.7b) and (35.8a)] 

h = 0 + error, h m u a = 0 + error. (35.69) 

Here u a is a vector field that is as nearly covariantly constant as possible (u a {f} = 0); i.e., 
it is a constant vector in the inertial coordinates of linearized theory; and the errors are 
small over distances much less than $. Show that h = 0 can be imposed globally along with 
= 0; i.e., show that, if it is imposed on an initial hypersurface, the propagation equation 
(35.68) preserves it. 

(c) Show that in general, the background curvature prevents any vector field from being 
covariantly constant ( u a j/( $ ~~ u a /$l at best); and from this show that h^ a u a — 0 cannot be 
imposed globally along with h * )a = 0. 


§35.15. STRESS-ENERGY TENSOR FOR GRAVITATIONAL WAVES 

Turn now to an evaluation of the effective stress-energy tensor r ( ^ w) of equation 
(35.61). The evaluation requires averaging various quantities over several wave¬ 
lengths. Useful rules for manipulating quantities inside the averaging brackets < ) 

are the following (see exercise 35.14 for justification). 

(1) Covariant derivatives commute; e.g., (h = (h h^ v ^ a ). The fractional 

errors made by freely commuting are — (A/$) 2 , well below the inaccuracy of the 
computation. 

(2) Gradients average out to zero; e.g., = 0. Fractional errors made 

here are <A/^- 

(3) As a corollary, one can freely integrate by parts, flipping derivatives from one 

h to the other; e.g., (h A„, laj8 > = ,„>. 

A straightforward but long calculation using these rules, using equation (35.58b) 
for R { £l(h), using definition (35.63) of h^, using the propagation equation (35.64), 
and using the definition (35.61) of r|f r w) , yields < R i2) (h )> = 0, and 

T™ = 3 ~ ( 35 - 7 °) 


EXERCISES 


The averaging process 
involved in "coarse-grain" 
viewpoint 


Evaluation of effective 
stress-energy tensor for 
gravitational waves, 7"j® w) 


This is the result quoted in equation (35.23'), except that there one used an inertial 
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Accuracy of expression 
for 7-<™> 


Properties of T i ° w) 


EXERCISES 


coordinate system of linearized theory, where covariant derivatives and ordinary 
derivatives are the same. In a gauge where h/‘ ta = 0, the last term vanishes. When, 
in addition, h^ v is traceless (see exercise 35.13), the second term vanishes; and there 
remains only 

T T } = if Vta = A = 0 . (35.70') 

These expressions for the effective stress-energy of a gravitational wave have 
fractional errors of order d , due to the neglect of second-order corrections to 
they also have fractional errors of order X/t, ?, due to the averaging process, which 
makes no sense when X approaches 6t in magnitude. Since d < T/0 (35.28), the 
dominant errors in T { ™ are —X/6L. 

To this accuracy, the stress-energy tensor for gravitational waves is on an equal 
footing with any other stress-energy tensor. It plays the same role in producing 
background curvature; and it enters into conservation laws in the same way. For 
example, one can show, either by direct calculation or from the identity G (B) '“‘ , k = 0, 
that 


7' (GW) '“’i„ = 0 + error, (35.71) 

where the error ~~(X/tt)(T iGm >“’/%) is negligible in the shortwave approximation. 

Some of the properties of Tf™ have already been explored in §35.7. Further 
properties are explored in exercises 35.18 and 35.19. 


Exercise 35.14. BRILL-HARTLE AVERAGE 

Isaacson (1968b) introduces the following averaging scheme, which he names “Brill-Hartle 
averaging.” 

(a) In the small region, of size several times A, where the averaging occurs, there will 
be a unique geodesic of g { ™ connecting any two points SP' and £P; so given a tensor £(*P') 
at one can parallel transport it along this geodesic to SP, getting there a tensor 

(b) Letbe a weighting function that falls smoothly to zero when 9' and 9 are 
separated by many wavelengths, and such that 

^)V-g (B, (S?') d 4 x' = 1. (35.72) 

(c) Then the average of the tensor field E(f') over several wavelengths about the point 

& is 

<E>9 = / f(^'W/(^', V-g (B) (^') d*x'. (35.73) 

(i) Show that there exists an entity (f?, f?'), whose primed index transforms as a tensor 
at ‘3” and whose unprimed index transforms as a tensor at 9, such that (for E second rank) 

EafiCd")-? = (35.74) 

This entity is called the “bivector of geodesic parallel displacement”; see DeWitt and Brehme 
(1960) or Synge (1960a). 
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(ii) Rewriting expression (35.73) in coordinate language as 

<£ a/ j(x)> = f g? )M '(x, x')gf r '(x, x')E^{x')f{x, x')\/-g (B) (x') d 4 x', ( 35 . 73 ') 

derive the three averaging rules cited at the beginning of §35.15. [For solution, see Appendix 
of Isaacson (1968b).] 

Exercise 35.15. GEOMETRIC OPTICS 

Develop geometric optics for gravitational waves of small amplitude propagating in a curved 
background. Pattern the analysis after geometric optics for electromagnetic waves (§22.5). 
In particular, let h have an amplitude that varies slowly (on a scale 1 < 6i) and a phase 
6 that varies rapidly (6 - — 1/A). Expand the amplitude in powers of A/X, so that 

V = + •• -}* <#/€ - (35.75) 

Here c is a formal expansion parameter, actually equal to unity, which reminds one that 
the terms attached to c n are proportional to (A/#) n . Define the following quantities (with 
A*„ denoting the complex conjugate of A ): 

“wave vector”: k a = 0 a (35.76a) 

“scalar amplitude”: A = {-■ A^ V A^ /2 (35.76b) 

“polarization”: = A^JA. (35.76c) 

By inserting expression (35.75) into the gauge condition (35.66) and the propagation equation 
(35.68), derive the fundamental equations of geometrical optics as follows. 

(a) The rays (curves perpendicular to surfaces of constant phase) are null geodesics; i.e. 

k a k“ = 0, (35.77a) 

K\^ = 0. (35.77b) 

(b) The polarization is orthogonal to the rays and is parallel transported along them; 

e m k a = 0, (35.77c) 

e M , t a k a = 0. (35.77d) 

(c) The scalar amplitude decreases as the rays diverge away from each other in accordance 
with 

(35.77e) 

(A 2 k a ) (a = 0 (“conservation of gravitons”). (35.77f) 

(d) The correction B^ v to the amplitude obeys 

(35.77g) 

B^\ a k a = - + iR™p p A*l>. (35.77h) 

In accordance with exercise 35.13, specialize the gauge so that h = 0, i.e., 


e a — 0 

a • 


(35.77i) 
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Then show that the stress-energy tensor (35.70') for the waves is 

T T' = 3(35.77J) 

This has the same form as the stress-energy tensor for a beam of particles with zero rest 
mass (see §5.4). Show explicitly that 7’ (GW > M, ', r = 0. 

Exercise 35.16. GRAVITONS 

Show that geometric optics, as developed in the preceding exercise, is equivalent to the 
following: “A graviton is postulated to be a particle of zero rest mass and 4-momentum 
p, which moves along a null geodesic (^,p = 0). It parallel transports with itself (V p e = 0) 
a transverse (e • p =0) traceless ( e a a — 0) polarization tensor e. Geometric optics is the 
theory of a stream of such gravitons moving through spacetime.” Exhibit the relationship 
between the quantities in this version of geometric optics and the quantities in the preceding 
version (e.g., p = hk, where ti is Planck’s reduced constant h/2n). 

Exercise 35.17. GRAVITATIONAL DEFLECTION 
OF GRAVITATIONAL WAVES 

Show that gravitational waves of short wavelength passing through the solar system experi¬ 
ence the same redshift and gravitational deflection as does light. (One should be able to 
infer this directly from exercise 35.15.) 

Exercise 35.18. GAUGE INVARIANCE OF 7<« w 

Show that the stress-energy tensor T { ® V W) of equation (35.70) is invariant under gauge trans¬ 
formations of the form (35.65). 

Exercise 35.19. 7^ W) EXPRESSED AS THE AVERAGE OF 

A STRESS-ENERGY PSEUDOTENSOR 

Calculate the average over several wavelengths of the Landau-Lifshitz stress-energy pseu¬ 
dotensor [equation (20.22)] for gravitational waves with A/$ < 1. The result should be equal 
to rj® w) . [Hint: Work in a gauge where /t M a ja = h = 0, to simplify the calculation.] 

Exercise 35.20. SHORTWAVE APPROXIMATION FROM 
A VARIATIONAL VIEWPOINT 

Readers who have studied the variational approach to gravitation theory in Chapter 21 may 
find attractive the following derivation of the basic equations of the shortwave approximation. 
It was devised, independently, by Sindor Kovics and Bernard Schutz, and by Bryce DeWitt 
(unpublished, 1971). MacCallum and Taub (1973) give a “non-Palatini” version. 

(a) Define 

g„ r = gfi + h „„ - -Jgfjh, . (35.78a) 

W 1 py ~ ~ 2 gtB)(h a /l[y 3" ^ayljB — ^Py\ a) - (35.78b) 

Raise and lower indices on h^ v and with the background metric. Using the results 

of exercise 35.11, derive the following expression for the Lagrangian of the gravitational 
field: 




£■ + 


( perfect divergence\ /corrections of order 

of form d£ a /dx a / + y? 3 /A 2 , and smaller, 


(35.78c) 
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where 


£ '-lk (-*® , ) 1/2 [*® ) - |„ - ^Vlr) 

+ - w% r w^)]. 

[Hint: recall that 


(35.78d) 


(~g w ) 1/2 B a la = d[(- g ™y'*B°]/dx“ 

for any B a .] Drop the corrections of order d 3 /A 2 from £; and, knowing in advance that 
the field equations will demand — d 2 /X 2 , drop also the corrections of order R^d. 
Knowing that a perfect divergence contributes nothing in an extremization calculation, drop 
the divergence term from £. Then £' is the only remaining part of £. 

(b) Extremize I = j£'d 4 x by the Palatini method (§21.2); i.e., abandon (temporarily) 
definition (35.78b) of FF^y, and extremize / with respect to independent variations of 
y = W M y p, h^ v = h VfL , and gjfa = gjg. Show that extremization with respect to W tL p y 
leads back to equation (35.78b) for W^p y in terms of h^. Show that extremization with respect 
to when combined with equations (35.78a,b), leads to the propagation equation for 
gravitational waves (35.64). Show that extremization with respect to g®^, when combined 
with equations (35.78a,b) and with the propagation equation (35.64), and when averaged 
over several wavelengths, leads to 


c®> = 877 Tf™, 

where T® w) is given by equation (35.70). [ Warning: The amount of algebra in this exercise 
is enormous, unless one chooses to impose the gauge conditions h = h a p { p = 0 from the 
outset.] 



CHAPTER 


GENERATION OF 
GRAVITATIONAL WAVES 


Matter is represented by curvature , but not every curvature does represent matter; 

there may be curvature "in vacuo." 

G. LEMAITRE in Schiipp (1949), p. 440 


V 


Generation of gravitational 
waves analyzed by 
electromagnetic analog 


§36.1. THE QUADRUPOLE NATURE OF GRAVITATIONAL WAVES 

Masses in an isolated, nearly Newtonian system move about each other. How much 
gravitational radiation do they emit? 

For an order-of-magnitude estimate, one can apply the familiar radiation formulas 
of electromagnetic theory, with the replacement e 2 —>* — m 2 y which converts the 
static coulomb law into Newton’s law of attraction. This procedure treats gravity 
as though it were a spin-one (vector) field, rather than a spin-two (tensor) field; 
consequently, it introduces moderate errors in numerical factors and changes angular 
distributions. But it gives an adequate estimate of the total power radiated. 

In electromagnetic theory, electric-dipole radiation dominates, with a power output 
or “luminosity,” L, given (see §4.4 and Figure 4.6) by 

^electric dipole (2/3)6? Q 

for a single particle with acceleration a and dipole moment changing as d — ex — ea; 

^electric dipole (2/3) d 

for a general system with dipole moment d. [Geometric units: luminosity in cm of 
mass-energy per cm of light travel time; charge in cm, e = ( G 1/2 /c 2 )e C0nv = (2.87 X 
10” 25 cm/esu) x (4.8 x 10 _10 esu) = 1.38 x 10~ 34 cm, acceleration in cm of distance 
per cm of time per cm of time. For conventional units, with e in esu or (g cm 3 /sec 2 ) 1/2 , 
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insert a factor c 3 on the right and get L in erg/sec]. The gravitational analog of 
the electric dipole moment is the mass dipole moment 

d = ^ m A x A . 

particles A 


Its first time-rate of change is the total momentum of the system, 

d = ^ m A x A = p. 

particles .A 

e 

The second time-rate of change of the mass dipole moment has to vanish because 
of the law of conservation of momentum, d = p = 0. Therefore there can be no mass 
dipole radiation in gravitation physics. 

The next strongest types of electromagnetic radiation are magnetic-dipole and 
electric-quadrupole. Magnetic-dipole radiation is generated by the second time- 
derivative of the magnetic moment, fi. Here again the gravitational analog is a 
constant of the motion, the angular momentum, 


H = ^ (position of A) X (current due to A) = r A x (mv A ) = /; 

A A 

so it cannot radiate. Thus, there can be no gravitational dipole radiation of any sort. 

When one turns to quadrupole radiation, one finally gets a nonzero result (see 
Figure 36.1). The power output predicted by electromagnetic theory, 

^electric quadrupole 2 Q ^ ~~~20 QjkQjk’ 

Qik=y e A [x Aj x Ak ~ f v/) 

{Qjk here Q jk in much other literature)’ as its gravitational counterpart 

^massquadrupole ^ ^ == "g” $jk^jk X (36.1) 

f ik = *Z m A ( x Ai x Ak - y &jk r A 2 ) = f p ( XjXk - y V 2 ) d 3 x. (36.2) 


Formula (36.1) contains the correct factor of 1/5, which comes from tensor calcula¬ 
tions (see §36.10), instead of the incorrect factor 1/20 suggested by the electromag¬ 
netic analog; and the righthand side of (36.1) has been averaged (“< )”) over several 

characteristic periods of the source to accord with one’s inability to localize the energy 
of gravitational radiation inside a wavelength. 


Why gravitational waves 
cannot be dipolar 


Gravitational-wave power 
output expressed in terms of 
"reduced quadrupole 
moment" of source 



Source 


Receptor 




of travel to receptor; 
gives rise to 
difference in phase 
50 = <oL cos 0 



Conclusion: 

(amplitude) net = (amplitude)! or2 50 
m \ a \ 

= —— 50 
r 

( moo 2 L sin 0 \ 

~ (-1 (wL cos0) 

cc 3 mL 2 sin 20 


- T/r 

(luminosity) — (? ) 2 

Figure 36.1. 

Why gravitational radiation is ordinarily weak. In brief, contributions to the amplitude of the outgoing 
wave from the mass dipole moments of the separate masses cancel, {m l a l + m 2 a 2 )lr = 0 (principle that 
action equals reaction). 

(a) Radiation from an accelerated charge (see §4.4 and Figure 4.6). 

(b) Representation of the field at the great distance r in terms of the typical rotating-vector diagram 
of electrical engineering; however, here, for ease of visualization, the vertical projection of the rotating 
vector gives the observed field (usual dipole-radiation field produced by a charge in circular orbit). 

(c) Corresponding rotating-vector diagram for gravitational radiation, based on the simplified model 
of the gravitation field as a spin-one or vector field (to be contrasted with its true tensor character; hence 
details of angular distribution and total radiation as given by this simple diagram are not correct; but 
order of magnitude of luminosity is correct). 

(d) The two masses m 1 and m 2 that hold each other in orbit give equal and opposite contributions 
to the amplitude of the outgoing wave because of the principle that action equals reaction. (In electro¬ 
magnetic radiation from a hydrogen atom, the corresponding radiation amplitudes do not cancel: 

^elec a elec T ^prot^prot ^elec fl elec ^ 0). 

(e) In a better approximation, one has to allow for the difference in time of arrival at the receptor 
of the effects from the two masses. The two vectors that formerly opposed each other exactly are now 
drawn inclined, at the phase angle 50. The amplitude of the resulting field goes as iF, where i is the 
reduced quadrupole moment; and the luminosity is proportional to P. 
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Notation: There is no ambiguity about the definition of the “second moment of 
the mass distribution ” as it appears throughout the physics and mathematics literature 

!jk = f PXjX k d 3 x. 

Nor is there any ambiguity about how one constructs the moment of inertia tensor 
$j k from this second moment of the mass distribution: 


** = 8 * ^ace (I ab ) - I jk = f p(r* 8 jk - Xj x k ) d 3 x. 

The moments that characterize a source radiating quadrupole gravitational radiation 
are here taken, equally unambiguously, to be the “trace-free part of the second 
moment of the mass distribution”: 

*i * = h* ~ J 8 ik trace (I ab ) = I jk - i 8 jk I = f p( Xj x k - j 8 jk r *) d 3 x. (36.3) 


This notation is adopted because it simplifies formulas, it simplifies calculations, it 
meshes well with much of the literature of gravitational-wave theory [e.g. Peters 
(1964), Peres and Rosen (1964)], and it is easy to remember. Another name for the 
quantities is reduced quadrupole moment. This terminology makes clear the 
distinction between the quantities used here and the three-times-larger quantities 
that are called quadrupole moments in the standard text of Landau and Lifshitz 
(1962) and in the literature on nuclear quadrupole moments, and the 3/2-times-larger 
quantities used in the theory of spherical harmonics: 


/Landau and Lifshitz; also \ 
\ nuclear quadrupole moments 


j = J p(3z 2 — r 2 ) d 3 x, 

Q zz (theory of spherical harmonics) = J p^-z 2 — y r2 j d 3 x, 

( reduced quadrupole moment;\ 
unambiguous measure of 1 = J pfz 2 — —r 2 \d 3 x. 
source strength adopted here/ 


Thus the f jk notation has the merit of circumventing the existing ambiguity in the 
literature. 

That electromagnetic radiation is predominantly dipolar (spherical-harmonic index 
/ = 1), and gravitational radiation is quadrupolar (/ = 2) are consequences of a 
general theorem. Consider a classical radiation field, whose associated quantum 
mechanical particles have integer spin S , and zero rest mass. Resolve that radiation 
field into spherical harmonics—i.e., into multipole moments. All components with 
I < S will vanish; in general those with / > S will not; and this is independent of 
the nature of the source! [See, e.g., Couch and Newman (1972).] Since the lowest 
nonvanishing multipoles generally dominate for a slowly moving source (speeds < c), 
electromagnetic radiation (S = 1) is ordinarily dipolar (/ = S = 1), while gravita- 


Why gravitational waves 
ordinarily quadrupolar 
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tional radiation (S = 2) is ordinarily quadrupolar (/ = S — 2). Closely connected 
with this theorem is the “topological fixed-point theorem” [e.g., Lifshitz (1949)], 
which distinguishes between scalar, vector, and tensor fields. For a scalar disturbance, 
such as a pressure wave, there is no difficulty in having a spherically symmetric 
source. Thus, over a sphere of a great radius r, there is no difficulty in having a 
pressure field that everywhere, at any one time, takes on the same value p. In contrast, 
there is no way to lay down on the surface of a 2-sphere a continuous vector field, 
the magnitude of which is non-zero and everywhere the same (“no way to comb 
smooth the hair on the surface of a billiard ball”). Likewise, there is no way to lay 
down on the surface of a 2-sphere a continuous non-zero transverse-traceless 2x2 
matrix field that differs from one point to another at most by a rotation. Topology 
thus excludes the possibility of any spherically symmetric source of gravitational 
radiation whatsoever. 


§36.2. POWER RADIATED IN TERMS OF INTERNAL 
POWER FLOW 


Gravitational-wave power 
output in terms of internal 
power flow of source 


Expression (36.1) for the power output can be rewritten in a form that is easier to 
use in order-of-magnitude estimates. Notice that the reduced quadrupole moment 
is 




/mass of that part of \ /size of \ 2 
Vsystem which moves/ Vsystem/ 

/ time for masses to move from\ 3 
lone side of system to other / 

/ nonspherical part\ 
M(R/T) 2 \of kinetic energy / 

j - - ; 


_ MR 2 
T 3 


*» 


■^internal 


( power flowing from one 
side of system to other 


(36.4) 


Consequently, equation (36.1) says that the power output in gravitational waves 
(“luminosity”) is roughly the square of the internal power flow 


^GW ~~ (^internal) 2 * (36.5) 

If this equation seems crazy (who but a fool would equate a power to the square 
of a power?), recall that in geometrized units power is dimensionless. The conversion 
factor to conventional units is 


L 0 = c 5 /G — 3.63 x 10 59 erg/sec = 2.03 x 10 5 M o c 2 /sec. (36.6) 

One may freely insert this factor of L 0 = 1 wherever one wishes in order to feel 
more comfortable with the appearance of the equations. For example, one can 
rewrite equation (36.5) in the form 

^Gw/^internal ^ ^internal / (36.7) 
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In applying the equation L GW — (internal) 2 ’ one must be careful to ignore those 
internal power flows that cannot radiate at all, i.e., those that do not accompany 
a time-changing quadrupole moment. For example, in a star the internal power flows 
associated with spherical pulsation and axially symmetric rotation must be ignored. 

Conservation of energy guarantees that radiation reaction forces will pull down 
the internal energy of the system at the same rate as gravitational waves carry energy 
away (see Box 19.1). The characteristic time-scale for radiation reaction to change 
the system markedly is 

T react ~~ [l/(rate at which energy is lost)] X [energy in motions that radiate] 

— [1/L gw ] X [(Antemai) X (characteristic period T of internal motions)] 

~~ (Anternal/^Gw)^"^ (^ 0 /^internal)^ (36.8) 

Consequently, radiation reaction is important in one characteristic period only if the 
system achieves the enormous internal power flow 

internal > K = 3*63 X 10» CTg$/SCC = 1! 


§36.3. LABORATORY GENERATORS 
OF GRAVITATIONAL WAVES 

As a laboratory generator of gravitational waves, consider a massive steel beam of 
radius r = 1 meter, length / = 20 meters, density p = 7.8 g/cm 3 , mass M = 4.9 X 
10 8 g (490 tons), and tensile strength t = 40,000 pounds per square inch or 
3 x 10 9 dyne/cm 2 . Let the beam rotate about its middle (so it rotates end over end), 
with an angular velocity co limited by the balance between centrifugal force and 
tensile strength 

co = (8t/p/ 2 ) 1/2 = 28 radians/sec. 

The internal power flow is 

internal = (j U f 

^2 x 10 18 erg/sec ~ 10" 41 L o . 

The order of magnitude of the power radiated is 

L gw - (10“ 4l ) 2 £ o ~~ 10“ 23 erg/sec. (36.9) 

(An exact calculation using equation (36.1) gives 2.2 x 10“ 22 erg/sec; see Exercise 
36.1.) Evidently the construction of a laboratory generator of gravitational radiation 
Ts~an unattractive enterprise in the absence of new engineering or a new idea or 
both. 

To rely on an astrophysical source and to build a laboratory or solar-system 
detector is a more natural policy to consider. Detection will be discussed in the next 
chapter. Here attention focuses on astrophysical sources. ' - 


Characteristic time-scale for 
radiation-reaction effects 


Power output from a rotating 
steel beam 
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EXERCISE 


Power output from violent 
astrophysical sources, in 
terms of mass and radius 


Upper limit on power output 


Exercise 36.1. GRAVITATIONAL WAVES FROM ROTATING BEAM 

A long steel beam of length / and mass M rotates end over end with angular velocity 
Show that the power it radiates as gravitational waves is 

J-gw = js MV***. (36.10) 

Use this formula to verify that the rod described in the text radiates 2.2 X 10~ 22 ergs/sec. 


§36.4. ASTROPHYSICAL SOURCES OF GRAVITATIONAL WAVES: 
GENERAL DISCUSSION 


Consider a highly dynamic astrophysical system (a star pulsating and rotating wildly, 
or a collapsing star, or an exploding star, or a chaotic system of many stars). If its 
mass is M and its size is R , then according to the virial theorem (exercise 39.6) its 
kinetic energy is — M 2 /R . The characteristic time-scale for mass to move from one 
side of the system to the other, T> is 


R _ R _ (Rf\ 1/2 
(mean velocity ( M/R ) v2 \ M ) 


(36.11a) 


time of free fall; — time to turn one radian in Kepler orbit; see Chapter 25). 
Consequently, the internal power flow is 


L 


internal 


(kinetic energy) / A/ 2 \ / M\ 1/2 /M\ 5/2 

t V^r/ [&) ~ \r) 


(36.11b) 


The gravitational-wave output or “luminosity” is the square of this quantity, or 

L gw ^ {M/RfL 0 . (36.11c) 

(If the system is rather symmetric, or if only a small portion of its mass is in motion, 
then its quadrupole moment does not change much, and the estimate of L GW must 
be reduced accordingly. The wave amplitude goes down in proportion to the fraction 
of the mass in motion, and the power is reduced in proportion to the square of 
that fraction.) 

Clearly, the maximum power output occurs when the system is near its gravitational 
radius; and because nothing, not even gravitational waves, can escape from inside 
the gravitational radius, the maximum value of the output is ~~L 0 — 3.63 X 10 59 
ergs/second, regardless of the nature of the system! 

Actually, the above derivation of this limit and of equation (36.11c) uses approxi¬ 
mations to general relativity that break down near the gravitational radius. [Velocities 
small compared to light are required in deriving the standard formula (36.1) for 




§36.5. GENERATION BY COLLAPSE. BLACK HOLES, SUPERNOVAE, PULSARS 


981 


I G w ( see §36.7); nearly Newtonian fields are required for the virial theorem argu¬ 
ments of (36.11a), as well as for the L GW formula.] Nevertheless, in rough order 
of magnitude, equation (36.11c) is valid to quite near the Schwarzschild radius, say, 
R ~ 3 M; and inside that point gravity is so strong that no system can resist collapse 
for an effective length of time much longer than T ~ M. 

The time required for radiation-reaction forces to affect a system substantially 
[equation (36.8)] is of the order 

T react ~ (L 0 /L lDteinal )T ~ (R/M) 5/2 T, (36.1 Id) 

where T is the characteristic time (36.11a) of rotation or free fall. (Note how one 
inserts and removes the factor L 0 — 1 at will!) Consequently, the effect of radiation 
reaction, as integrated over one period, is unimportant except when the system is near 
its gravitational radius. 

When a system such as a pulsating star is settling down into an equilibrium state, 
the radiation reaction will damp its internal motions. On the other hand, when the 
system, like a binary star system, is far from any state of equilibrium, then loss of 
energy (and angular momentum) to radiation under certain circumstances may speed 
up the angular velocity or speed up the internal motions and augment the radiation. 


§36.5. GRAVITATIONAL COLLAPSE, BLACK HOLES, 
SUPERNOVAE, AND PULSARS AS SOURCES 


Since L GW ~ (M/R) 5 L 0 , the most intense gravitational waves reaching Earth must 
come from a dynamic, deformed system near its gravitational radius (L GW drops 
by a factor 100,000 with each increase by 10 of R\). The scenario of Figure 24.3 
gives an impression of some of the dynamic processes that not only may happen 
but probably must happen. The sequence of events sketched out there includes pulses 
of gravitational radiation interspersed with intervals of continuous radiation of 
gradually increasing frequency. Pulse number one comes at the time of the original 
collapse of a star with white-dwarf core to a pancake-shaped neutron star. The details 
of what then goes on will differ enormously depending on the original mass and 
angular momentum of this “pancake.” In the illustration, this pancake fragments 
into a constellation of corevolving neutron stars, which then one by one undergo 
“pursuit and plunge.” 

Whether in this kind of scenario or otherwise, perhaps the most favorable source 
of gravitational radiation is a star (the original very temporary “pancake” or one 
of the fragments therefrom) collapsing through its gravitational radius in a highly 
nonspherical manner. Such a star should terminate life with a last blast of gravita¬ 
tional waves, which carry off a sizeable fraction of its rest mass. Thus an order-of- 
magnitude estimate gives 


(energy radiated) = f Z. GW dt ~~ L 0 
~~ L 0 M = M. 


/ time during which 
\peak luminosity occurs 


(36.12) 


Radiation reaction in 
astrophysical sources 


Gravitational waves from: 

(1) stellar collapse and 

formation of a black hole 
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(2) the fall of debris into a 
black hole 


(Whether the energy radiated is 0.9Af, or 0.1 Af, or 0.0 lAf is not known for certain 
today; but it must lie in this range of orders of magnitude.) The radiation should 
be weak at low frequencies; it should rise to a peak at a frequency a little smaller 
than 1/A/; and it should cut off sharply for circular frequencies above to — 1/A/. 

Matter (“debris”; see Figure 24.3) falling into a black hole can also be a significant 
source of gravitational waves. The infalling matter will radiate only weakly when 
it is far from the gravitational radius; but as it falls through the gravitational radius 
(between r — 4Af and r = 2Af), it should emit a strong burst. If m is the mass of 
an infalling lump of matter and Af is the total mass of the black hole, then the total 
energy in the final burst is 


^radiated ~ m 2 /M, 


(36.13) 


and it comes off in a time —M with a power output of L GW ~ ( m/M) 2 L 0 . (See 
exercise 36.2.) Actually, this is an extremely rough estimate of the energy output. 
In the limit where the infalling lump is small in both size and mass [(size of lump) < 
(gravitational radius of black hole); m A/; “delta-function lump”], one can 
perform an exact calculation of the spectrum and energy radiated by treating the 
lump and the waves as small perturbations on the Schwarzschild geometry of the 
black hole. The foundations for such a treatment were given by Zerilli (1970b). 
Zerilli’s formula was corrected and applied to the case of head-on impact by Davis, 
Ruffini, Press, and Price (1971). They predict the spectrum of Figure 36.2 and the 
total energy output 

Radiated = 0.0104/H 2 /^ (36.14) 

for m < Af and (size of lump) < Af. 

(3) collisions of black holes A collision between black holes should also produce a strong burst of gravitational 

waves—through such collisions are probably very rare! 

(4) supernova explosions Not quite so rare, but still not common, are supernova explosions (about one per 

galaxy per 100 years). According to current theory as verified by pulsar observations, 
a supernova is triggered by the collapse of the core of a highly evolved star (see 
§24.3). The collapse itself and the subsequent wild gyrations of the collapsed core 
(neutron star) should produce a short, powerful burst of gravitational waves. The 
characteristics of the burst, as estimated with formulas (36.11), and assuming large 
departures from sphericity, are 


(energy radiated) — (neutron-star binding energy) 

- M 2 /R - 0.1 A/ - 10 53 ergs, 

(mean frequency) ~~ l/T~~ (M/R 3 ) 1/2 — 0.03Af _1 — 3000 Hz, (36.15) 
(power output) — ( M/R) 5 L 0 — 10 _5 L o ^ 3 X 10 54 ergs/sec, 


/ time for gravitational \ 

radiation to damp the 
motion if turbulence, 
heat conduction, and other 
effects do not damp it 
\sooner / 


r — M(M/R)~ 4 ~~ 0.1 sec — 300 periods. 



§36.5. GENERATION BY COLLAPSE, BLACK HOLES, SUPERNOVAE, PULSARS 


983 



w, in units 1 /M 


Figure 36.2. 

Spectrum of the gravitational waves emitted by a “delta-function” lump of matter 
of mass m, falling head-on into a nonrotating (Schwarzschild) black hole of mass 
M m. The total energy radiated is distributed among multipoles according to 
the empirical law 

(energy in /-pole waves) ~ (0.44 m 2 /M)e ~ 2/ , 
and the total spectrum peaks at angular frequency 

«»« - 0.32/ M. 

These results were calculated by treating the infalling lump and the gravitational 
waves as small perturbations on the Schwarzschild geometry of the black hole. The 
relevant perturbation-theory equations were derived by Zerilli (1970), and were 
solved numerically to give these results by Davis, Ruffini, Press, and Price (1971). 


In the last stages of the stellar pulsations, when the amplitude £ = Sr has dropped 
to Sr/r 1, one can calculate the pulsation frequencies and damping times exactly 
by treating the fluid motions and gravitational waves as small perturbations of an 
equilibrium stellar model. The results of such a calculation, which are in good 
agreement with the above rough estimates, are shown in Box 36.1. 

Long after the pulsations of the neutron star have been damped out by gravita- (5) young pulsars 
tional radiation reaction and by other forces, the star will continue to rotate; and 
as it rotates, carrying along with its rotation an off-axis-pointing magnetic moment, 
it will beam out the radio waves, light, and x-rays that astronomers identify as “pulsar 
radiation.” In this pulsar phase, gravitational radiation is important only if the star 
is somewhat deformed from axial symmetry (axial symmetry =s- constant quadru¬ 
ple moment =>- no gravitational waves). According to estimates in exercise 36.3, 
a deformation that contains only 0.001 of the star’s mass could radiate 10 38 ergs 
per second for the youngest known pulsar (Crab nebula); and the accompanying 
radiation reaction could be a significant source of the pulsar’s slowdown. However, 
it is not at all clear today (1973)—indeed, it seems unlikely—that the neutron star 
could support even so small a deformation. 


(i continued on page 986 ) 
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36. GENERATION OF GRAVITATIONAL WAVES 



EXERCISES 


Binary stars as sources of 
gravitational waves: 


Of the sources discussed in this section, most are “impulsive” rather than continu¬ 
ous (star collapsing through gravitational radius; debris falling into a black hole; 
collision between black holes; supernova explosion). They give rise to bursts of 
gravitational waves. An order-of-magnitude method of analyzing such bursts is 
spelled out in Box 36.2. 

It is difficult and risky to pass from the above description of processes that should 
generate gravitational waves to an estimate of the characteristics of the waves that 
actually bathe the earth. For such an estimate, made in 1972 and subject to extensive 
revision as one’s understanding of the universe improves, see Press and Thorne 
(1972). 


Exercise 36.2. GRAVITATIONAL WAVES FROM MATTER FALLING 
INTO A BLACK HOLE 

A lump of matter with mass m falls into a black hole of mass M. Show that a burst of 
gravitational waves is emitted with duration and power L GW — ( m/M) 2 L 0 , so that the 

total energy radiated is given in crude order of magnitude by equation (36.13). 

Exercise 36.3. GRAVITATIONAL WAVES FROM A 
VIBRATING NEUTRON STAR 

Idealize a neutron star as a sphere of incompressible fluid of mass M and radius R, with 
structure governed by Newton’s laws of gravity. Let the star pulsate in its fundamental 
quadrupole mode. Using Newtonian theory, calculate: the angular frequency of pulsation, 
co; the energy of pulsation £ puls ; the quantity t 2 >, which, according to equation (36.1), 
is the power radiated in gravitational waves, L GW ; and the e-folding time, r — £ pu is/^gw> 
for radiating away the energy of the pulsations. Compare the answers with equations (36.15)— 
which are based on a much cruder approximation—and with the results in Box 36.1, which 
are based on much better approximations. [For solution, see Table 13 of Wheeler (1966).] 

Exercise 36.4. PULSAR SLOWDOWN 

The pulsar NP0532 in the Crab Nebula has a period of 0.033 seconds and is slowing down 
at the rate dP/dt = 1.35 X 10~ 5 sec/yr. Assuming the pulsar is a typical neutron star, 
calculate the rate at which it is losing rotational energy. If this energy loss is due primarily 
to gravitational radiation reaction, what is the magnitude of the star’s nonaxial deformation? 
[For solution, see Ferrari and Ruffini (1969); for a rigorous strong-field analysis, see Ipser 
(1970).] 


§36.6. BINARY STARS AS SOURCES 

The most numerous sources of weak gravitational waves are binary star systems. 
Moreover, roughly half of all stars are in binary or multiple systems [see, for example, 
the compilation of Allen (1962)]. According to Kepler’s laws, two stars of masses 
m 1 and m 2 that circle each other have angular frequency <o and separation a coupled 
to each other by the formula 


co 2 a 3 — m 1 + m 2 — M. 


Box 36.2 ANALYSIS OF BURSTS OF RADIATION FROM IMPULSE EVENTS* 



Elec tromagnetism 

Gravitation 

Typical moment relevant for radiation 

<4(0 

fjo 

Its Fourier transform 

(2 tt)~ 1/2 f d s exp [/tor] dt 

{2ir)~ U2 ji xx exp [/«/] dt 

Name for this quantity 

<4(") 


Time decomposition of total radiative energy 
loss AE 

c~ 3 fcP(t) dt 

Gc~ 5 / f 2 (t) dt 

Decomposition of AE according to circular 
frequency 

<T 3 / d 2 {u) do> 

Gc~ 5 f ?*(<■>) du 

Integrand nearly constant with respect to 
w from = 0 up to a critical value of co, 
beyond which radiation falls off very fast 

"cm ~ V& 

"cm ~ */'2l; 

— d AE/du for « < co crit 

~<r 3 5 2 ( 0) 

~Gc~ 5 ? 2 (0) 

Zero frequency moment that enters this formula 

~(«i Au *i + ^ l2 ) 

J(«Kinetic Energy»)„ 

Rewrite of —dAE/du 

~~(e Av) 2 /c 3 

~G[J(«K.E.»)„] 2 /c 5 

Total energy of pulse 

^•This/J/ 

—This/J/ 


•Box adapted from pp. 113 and 114 of Wheeler (1962). 


As sample applications of this analysis, Wheeler (1962) cites the following: 


One atomic-nucleus fission 
Parameter of 180 MeV 

Fission bomb yield 

1 7 kilo to ns at 

10% efficiency 

Meteorite striking 
earth at escape 
velocity 

Explosion pf star 
when 10~ 4 of 
mass is released 

Mass 

4 X 10-22 g 

I0 4 g 

10 9 g 

2 x 10 33 g 

Velocity 

1.2 X 10 9 cm/s 

4 X 10 8 cm/s 

11 x 10 5 cm/s 

4 X 10 8 cm/s 

Energy 

Fraction assumed 

2.9 x 10 -4 erg 

7 x 10 20 erg 

6 x 10 20 erg 

1.8 x 10 50 erg 

relevant to radia¬ 

1 

0.1 

1 

0.1 

tive moment 





Time integral of 





this moment 
= «K.E.»„ 

2.9 x 10 -4 erg 

7 x 10 19 erg 

6 x 10 20 erg 

1.8 x 10 49 erg 

«K.E.»„/c 2 

3.2 x 10-25 g 

0.08 g 

0-67 g 

2 X 10 28 g 

dE G (« K.E.»„\ 2 

2.3 X 10~ 67 erg 

1 A ^ 10 _2 ° 

i n in-iR 


dec c \ c 2 ) 

rad/s 

rad/s 

i.u x tu 10 

rad/s 

9 X 10 J »-— 

rad/s 

At 

10~ 21 s 

10-8 S 

10~ 3 s 

10 4 s 

Acc —- 1 /At 

AF 

radiated 

Assumed distance 

10 21 rad/s 

10- 46 erg 

10 8 rad/s 

10~ 12 erg 

10 3 rad/s 

IQ-is er g 

10 -4 rad/s 

10 35 erg 

to detector 

10 3 cm 

10 3 cm 

10 9 cm 

10 23 cm 

AE/Avr 2 

10-53 erg/cm 2 

10 -19 erg/cm 2 

10 -34 erg/cm 2 

10- 12 erg/cm 2 

The reader might find it informative to extend this table to the bursts of waves 


w - - - *^ VV/ Viiv ^UlOU KSl VY a V WO 

emitted by (1) debris falling into a black hole, (2) collisions between two black holes, 
and (3) a supernova explosion in which a star of two solar masses collapses to nuclear 
densities, ejecting half its mass in the process. 
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36. GENERATION OF GRAVITATIONAL WAVES 


(1) power output 


(2) effects of radiation 
reaction 


In this motion the kinetic energy is 

1 1 m x m o 

(kinetic energy) = - — (potential energy) = ———. 

The power that they radiate as gravitational waves can be estimated roughly as the 
square of the circulating power, L ~ w X (kinetic energy); thus, 

H 2 M 3 

^gw ~ 4q5 »' 


where ju = m 1 m 2 /A/ is the familiar reduced mass, and M = m l + m 2 is the total 
mass of this binary system. 

An exact calculation based on equation (36.1) gives a result larger than this by 
a factor ~30: for a binary system of semimajor axis a and eccentricity e, the power 
output averaged over an orbital period is 


L 


gw — 


32 ]a 2 M 3 
5 a 5 


■ADL 0 , 


(36.16a) 


where /(e) is the dimensionless “correction function,” 

/(e) = t 1 + % £2 + 16 £ 'I [1 " e2]_7/2 - (36 ‘ 16b) 

[See exercise 36.6 at end of §36.8; also Peters and Mathews (1963).] 

As the binary system loses energy by gravitational radiation, the stars spiral in 
toward each other (decrease of energy; tightening of gravitational binding). For 
circular orbits the energy, E = — —fyiM/a, decreases as 

dE/dt = 1/2 (nM/a 2 )(da/dt) 

32 n 2 M 3 

- ~ L GW - “7 a 5 ■ 


Consequently, the evolution of the orbital radius is given by the formula 

a = a 0 ( 1 - t/T 0 ) 1/4 , (36.17a) 


where a 0 = a today and 

_ l/~£\ 5 a* 

4 '^GW'today 256 U.V/ 2 


(36.17b) 


Thus, unless nongravitational forces intervene, the two stars will spiral together in 
a time t 0 (spiral time). For an elliptical orbit, the eccentricity also evolves. Radiation 
is emitted primarily at periastron. Therefore the braking forces of radiation reaction 
act there with greatest force. This effect deprives the stars of some of the kinetic 
energy of the excursions in their separation (“radial kinetic energy”). In consequence, 
the orbit becomes more nearly circular. [See Peters and Mathews (1963) for detailed 
calculations.] 
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The calculated power output, flux at Earth, and damping times are shown in Box 
36.3 for several known binary stars and several interesting hypothetical cases. Notice 
that in the most favorable known cases the period is a few hours; the damping time 
is the age of the universe (could the absence of better cases be due to radiation 
reaction s having destroyed them?); the output of power in the form of gravitational 
waves is —10 30 to 10 32 ergs/sec (approaching the light output of the sun, 3.9 x 10 33 
ergs/sec), and the calculated flux at the Earth is ^10 _1 ° to 10 -12 ergs/sec (too small 
to detect in 1973, but perhaps not too small several decades hence; see Chap¬ 
ter 37). 

The hypothetical cases in Box 36.3 illustrate the general relations for astrophysical 
systems that were derived in §36.4—namely, that only as the system approaches 
its gravitational radius can L GW approach L 0 , and only then can damping remove 
nearly the whole energy in a single period. 


§36.7. FORMULAS FOR RADIATION FROM NEARLY 
NEWTONIAN SLOW-MOTION SOURCES 


Turn now from illustrative astrophysical sources to rigorous formulas valid for a 
wide variety of sources. One such formula has already been written down, 


^GW “ ^ (hkhk)’ 


(36.1) 


but it has not yet been derived, nor has its realm of validity been discussed. 

This formula for the power output is actually valid for any “nearly Newtonian, 
slow-motion source”—more particularly, for any source in which 


(size of source)/(reduced wavelength of waves) <1, (36.18a) 

(Newtonian potential! < (size of source)/(reduced wavelength), (36.18b) 


|typical stresses! 
(mass density) 


(size of source) 
(reduced wavelength)' 


(36.18c) 


It is not valid, except perhaps approximately, for fast-motion or strong-field sources. 
Moreover, there is no formalism available today which can handle effectively and 
in general the fast-motion case or the strong-field case. 

The rest of this chapter is devoted to a detailed analysis of gravitational waves 
from nearly Newtonian, slow-motion sources. But the analysis (Track 2; §§36.9- 
36.11) will be preceded by a Track-1 summary in this section and the next. 

For any source of size R and mean internal velocity v , the characteristic reduced 
wavelength (A = \/2tt) of the radiation emitted is A — (amplitude of mo¬ 
tions)/?; < R/v. Consequently the demand (36.18a) that R/X be < 1 [i.e., that the 
source be confined to a small region deep inside the near (nonradiation) zone] 
enforces the slow-motion constraint 


(3) particular binaries 

observed by astronomers 


The "nearly Newtonian, 
slow-motion approximation" 
for analyzing sources of 
gravitational waves 


v < 1. 
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These related conditions, v < 1 and R < A, are satisfied by all presently conceived 
laboratory generators of gravitational waves. No one has seen how to bring a 
macroscopic mass up to a speed v — 1. These conditions are also satisfied by every 
gravitationally bound, nearly Newtonian system. Thus, for such a system of mass 
M, the condition for gravitational binding, £ Mv 2, < M 2 /R guarantees that 
v<(M/R) 1/2 4 1. 

The conditions M/R < R/X and \T jk \/T°° < R/X are satisfied by all nearly 
Newtonian sources of conceivable interest. Typical sources (e.g. binary stars) have 

M _ \ P k \ //?\ 2 R 
r~~ T° 0 \ A / ^ A 

(virial theorem). In those rare cases where (M/R or \P k \/T 00 ) > R/X (e.g., a mar¬ 
ginally stable, slowly vibrating star), the motion is so very slow that the radiation 
will be too weak to be interesting. 

For any nearly Newtonian slow-motion system, there is a spacetime region deep 
inside the near zone (r < X ), but outside the boundary of the source (r > R ), in 
which vacuum Newtonian gravitation theory is nearly valid. An observer in this 
Newtonian region can measure the Newtonian potential 0 and can expand it in 
powers of 1/r: 

g = -(^ + ^ + 3 Vf‘ + where «-„*</,. (36.19a) 

He can then give names to the coefficients in this expansion: 

M= “total mass-energy” = “active gravitational mass”; 

d t = “dipole moment” [if he chooses the origin of coordinates (36.19b) 
carefully, he can make d f = 0]; 

i jk = “reduced quadrupole moment” [because the system is nearly 

Newtonian, is given 
by expression (36.3)]. 

As this Newtonian potential reaches out into the radiation zone, the static portions 
of it (— M/r — djn’/r 2 ) maintain their Newtonian form, unchanged. But the dynamic 
part (—§ f jk n>n k /r 3 ) ceases to be describable in Newtonian terms. As retardation 
effects become noticeable (at increasing r values), it gradually changes over into 
outgoing gravitational waves, which must be described in the full general theory 
of relativity, or in linearized theory, or in the “shortwave” approximation of §35.13. 

If one chooses to use linearized theory in the radiation zone, and if one imposes 
the transverse-traceless gauge there (h™ - 0, hj? = 0, hj? k = 0), then the gravita¬ 
tional waves take the form [derived later as equation (36.47)] 

hjk = — ff k (t - r) + corrections of order j^-L fj^(t - r ) j. (36.20) 


Definitions of mass, dipole 
moment, and reduced 
quadrupole moment for a 
slow-motion source 


Properties of gravitational 
waves in terms of reduced 
quadrupole moment: 


(1) the wave field hj£ 
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Here fjjf is the second time-derivative of the transverse-traceless part of the quad- 
rupole moment (transverse to the radial direction; see §35.4); thus, 


i n _ 

f jk ~ 


^ja^ab^bk 


J_ Dpi 

2 1 jk 1 ab J ab"> 


Pib = (&ab ~ n a n b) (projection operator), 
n a — x a /r (unit radial vector). 


(36.21) 


The effective stress-energy tensor for these outgoing waves (§35.7) has the same form 
as for a swarm of zero-mass particles traveling radially outward with the speed of 
light; at large distances its components of lowest nonvanishing order are 


(2) effective stress-energy 
tensor 


nr = -n? w = 

= - 2n&;f ik ri k + 1 (n;t jk n k f), (36.22) 


where < > denotes an average over several wavelengths. (Recall that one cannot 

localize the energy more closely than a wavelength!) The total power crossing a 
sphere of radius r at time t is 

(3) total power radiated L GW (t, r) = J T (GW)0r r 2 dO = —(ij k (t — r)fj k (t — /*)). (36.23) 


(See exercise 36.9.) This is the formula with which this chapter began: equation 
(36.1). 

The wave fronts are not precisely spherical. For example, for a binary star system 
the wave fronts in the equatorial plane must be spirals. This means that there is 
a tiny nonradial component of the momentum flux, which decreases in strength as 
1/r 3 . Associated with this nonradial momentum is an angular momentum density 
(angular momentum relative to the system’s center, r = 0), which drops off as 1/r 2 
[Peters (1964), as corrected by DeWitt (1971), p. 286]: 


(4) density of angular p = -6 n s f k J mp n p + 9«,4 m Vp * p ,« 9 >- < 36 - 24 ) 

momentum 8 ttH 

The integral of this quantity over a sphere is the total angular momentum being 
transported outward per unit time, 


(5) total angular momentum 
radiated 


— dJj/dt = f V 2 dS 2 = | €* £ < 4 TaX>- ( 36 - 25 ) 


(See exercise 36.9.) 




§36.8. RADIATION REACTION IN SLOW-MOTION SOURCES 


993 


§36.8. RADIATION REACTION IN SLOW-MOTION SOURCES* 

The conservation laws discussed in Box 19.1 and derived in §20.5 guarantee that 
the source must lose energy and angular momentum at the same rate as the gravita¬ 
tional waves carry them off. The agent that produces these losses is a tiny component 
of the spacetime curvature inside the source, which reverses sign if one changes from 
a (realistic) outgoing-wave boundary condition at infinity to the opposite (unrealistic) 
ingoing-wave condition. These “radiation-reaction” pieces of the curvature can be 
described in Newtonian language when the source obeys the nearly Newtonian, 
slow-motion conditions (36.18). 

The dynamical part of the Newtonian potential, in its “standard form” 

& = — y: f jk (t) n f n k /r 3 + 0(1/r 4 ); equation (36.18), 

has no retardation in it. (Newtonian theory demands action at a distance!) Conse¬ 
quently, there is no way whatsoever for the standard potential to decide, at large 
radii, whether to join onto outgoing waves or onto ingoing waves. Being undecided, 
it takes the middle track of joining onto standing waves (half outgoing, plus half 
ingoing). But this is not what one wants. It turns out (see §36.11) that the join can 
be made to purely outgoing waves if and only if 0 is augmented by a tiny “radia¬ 
tion-reaction” potential 


^ ^standardNewtonian theory ”b </^ react *, (36.26a) 

1 d 5 f 

#(react) _ __jL x i x k (36.26b) 

If, instead, one sets*? = 0 standard - <Z' (react) , the potential will join onto purely ingoing 
waves. 

In order of magnitude, the radiation-reaction potential is 

^(reaco _ J_ {MR 2y2 _ ^ (36.27) 

Consequently, near the source it is tiny compared to the standard Newtonian 
potential [a factor ( R/ A) 5 ~ v 5 smaller!]. However, at the inner boundary of the 
radiation zone (r ~ A), it is of the same order of magnitude as the dynamic, quadru¬ 
ple part of the standard potential. 

The radiation-reaction part of the Newtonian potential plays the same role as 
a producer of accelerations that any other part of the Newtonian potential does. 
Any particle in the Newtonian region experiences a gravitational acceleration given 
by 

a > = -*J = -Standard~ (36.28) 


*The ideas and formalism described in this section were devised by Burke (1970), Thome (1969b), 
and Chandrasekhar and Esposito (1970). Among the forerunners of these ideas were the papers of Peters 
(1964), and Peres and Rosen (1964). 


Outgoing-wave boundary 
condition gives rise to a 
Newtonian-type 
radiation-reaction potential 


Form and magnitude of the 
radiation-reaction potential 


Effects of the potential: 


(1) radiation-reaction 
accelerations 
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Inside the source, this acceleration leads to energy and angular momentum losses 
given by 


dE/dt — J pa j v j d 3 x (36.29a) 

and 

dJj/dt = J e jkl x k pa l d 3 x. (36.29b) 


(2) loss of energy and 
angular momentum 


(Here p is the density, u j is the velocity, and a j as above is the acceleration of the 
matter in the source.) Standard Newtonian theory conserves the energy and angular 
momentum. Therefore only the reaction part of the potential can produce losses: 


dE/dt = — j p^ j eaci) v j d 3 x , 
dJj/dt = - J e jklXkP <P< r * d 3 x. 


(36.30) 


A straightforward calculation (exercise 36.5) using expression (36.26b) for the reac¬ 
tion potential yields, for the time-averaged losses, 


dE/dt = -j(ij k i jk /, 
dJj/dt = - <,hahi>- 


(36.31) 


Notice that these results agree with the energy and angular momentum carried 
off by the radiation as given by equations (36.1) and (36.25). The agreement is an 
absolute imperative. The laws of conservation of total energy and angular momen¬ 
tum demand it. 

A slow-motion electromagnetic system emitting electric dipole radiation has a 
radiation-reaction potential 


Radiation-reaction potential 
for electromagnetic waves 


A$ react) - (J, 


^ (react) __ 


_ 0 (react) _ 



(36.32) 


which is completely analogous to <£ (react) of gravitation theory [see, e.g., Burke (1971)]. 
However, attention does not usually focus on this potential and the reaction forces 
it produces. Instead, it focuses on the reaction force in a special case: that of an 
isolated charge being accelerated by nonelectromagnetic forces. For such a charge, 
the reaction force is 

fffieact) = l e 2x. (36.33) 

3 


No such formula is relevant to gravitation theory, because there is no such thing 
as a gravitationally isolated, radiating particle (i.e., one accelerated by forces that 
have no coupling to gravity). 
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Exercise 36.5. ENERGY AND ANGULAR MOMENTUM LOSSES DUE TO EXERCISES 

RADIATION REACTION 

Derive equations (36.31) for the rate at which gravitational radiation damping saps energy 
and angular momentum from a slow-motion source. Base the derivation on equations (36.26b) 
and (36.30). 


Exercise 36.6. GRAVITATIONAL WAVES FROM BINARY STAR SYSTEMS 

Apply the full formalism of §§36.7 and 36.8 to a binary star system with circular orbits. 
Calculate the angular distribution of the gravitational waves; the total power radiated; the 
total angular momentum radiated; the radiation-reaction forces; and the loss of energy and 
angular momentum due to radiation reaction. Compare the answers with the results quoted 
in §36.6. [For further details of the solution, see Peters and Mathews (1963).] 


§36.9. FOUNDATIONS FOR DERIVATION OF 
RADIATION FORMULAS 

Turn now from the formulas for radiation from a nearly Newtonian system in slow 
motion to a derivation of these formulas. Initially (this section) work in the full 
general theory of relativity without any approximations—not even that of slow 
motion. Impose only the constraint that the source be isolated, and that spacetime 
become asymptotically flat far away from it. 

Use a coordinate system that becomes asymptotically Lorentz as rapidly as space- 
time curvature permits, when one moves radially outward from the source toward 
infinity. Everywhere in this coordinate system, even inside the source, which may 
be relativistic, define 


- V- (36.34) 

The are clearly not the components of a tensor. Neither is tj the true metric 
tensor. Nevertheless, one is free to raise and lower indices on h^ with tj and to 
define 



1 

2 V 


h; 


h = hfi = h ap r 0 - 


(36.35) 



-. 

The rest of this chapter is 
Track 2. Chapter 20 
(conservation laws) is needed 
as preparation for it. It will be 
helpful in Chapter 39 
(post-Newtonian formalism), 
but is not needed as 
preparation for any other 
chapters. 

_/ 

Derivation of formula for the 
gravitational-wave field 
produced by a slow-motion 
source: 


(1) definition of 


Moreover, one can always specialize the coordinates so that the four conditions 

V\ a = 0 (36.36) 

are exactly satisfied everywhere, including the interior of the source. 

With these definitions and conventions, h^ v becomes the gravitational field of 
linearized theory far from the source, and also inside the source if gravity is weak 
there. But if the interior gravity is strong not < 1), in the interior has no 
connection whatsoever to linearized theory. 
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(2) field equations in terms 
of h^ 


(3) philosophy of controlled 
ignorance 


(4) integral formulation of 
field equations 


(5) specialization to slow 
motion 


The exact Einstein field equations can be written in terms of h** as [cf. §20.3; 
in particular, combine equations (20.14), (20.18), and (20.3); and impose the coordi¬ 
nate condition (36.36)] 

“ IfotT* 9 + (36.37) 

where T^ are the components of stress-energy tensor, and t* v are quantities 
(components of the “stress-energy pseudotensor for the gravitational field’') that are 
of quadratic order and higher in h^. Recall the “philosophy of controlled ignorance” 
expounded in §19.3. One is so ignorant that nowhere does one ever write down an 
explicit expression for t^ v in terms of h a &\ and this ignorance is so controlled that 
one will never need such an expression in the calculations to follow! More specifi¬ 
cally, the strength of the outgoing wave is proportional to the integral of a compli¬ 
cated expression over the interior of a system where “gravitational stresses may 
be comparable to material stresses, \t ik \ ~~ \T* k |. No matter. All that will count for 
the radiation is the quadrupole part of the field. Moreover, that quadrupole moment 
is empirically definable by purely Newtonian measurements in the Newtonian region 
(1) well inside the wave zone, but (2) well outside the surface of the source. One 
does not have to know the inner workings of a star to define its mass (influence 
on Kepler orbits outside) nor does one have to know those inner workings to define 
its quadrupole moment as sensed externally. 

Einstein’s equations (36.37), augmented by an outgoing-wave boundary condition, 
are equivalent to the integral equations 

X*) = 4 f [? 7 + d 3 x\ (36.38) 

all space I* _ . 

where 

|x — x'l = f ^ (x> — xO 2 ] , d 3 x’ = dx v dx 2 ' dx 3 ', 
j 

and the subscript “ret” means the quantity is to be evaluated at the retarded 
spacetime point 

(t' = t - |jc - x'|, x r ). 

These are integral equations because the unknowns, h^ v , appear both outside and 
inside the integral (inside they are contained in t^ v ). Notice that in passing from 
the wave equations (36.37) to the integral equations (36.38), one has cavalierly 
behaved as though h^ were fields in flat spacetime. This is certainly not true; but 
the mathematical manipulations are valid nevertheless!—and the integral equations 
(36.38) are valid for any field point (t, jc j ), even inside the source. 


§36.10. EVALUATION OF THE RADIATION FIELD IN 
THE SLOW-MOTION APPROXIMATION 

Thus far the analysis has been exact. Now it is necessary to introduce the slow-motion 
assumption of §36.7: R < X. 
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A slow-motion source radiating gravitational waves. The origin of spatial coordinates is located inside 
the source. The size of the source, R , is very small compared to a reduced wavelength, R 4 X. Significant 
contributions to the retarded integral (36.38) for come only from a region of size L ~~ R < X 
surrounding the source, because outside the source—but in the near zone (R < r < X )—the “stress-energy 
pseudotensor” t dies out as 1/r 4 (see exercise 36.7). 

In the radiation zone, t^ v ceases to die out as 1/r 4 , and begins to die out as 1/r^; it is trying to describe 
(but cannot, really, without appropriate averaging) the stress-energy carried by the gravitational waves. 
If the source has been emitting waves long enough, contributions from the radiation zone to the retarded 
integral (36.38) may be nonnegligible: 


[''“Ire. ~ \ — f ['"Ire, d 3 x' ~ f — r' 2 dS2' dr 
| r‘ J J r'~ _, 

[for r > X ]—* Tmay have significant contri- 
Lbutions from large r f 

Such contributions are ignored in the text, in calculations of the radiated waves, because they have nothing 
whatsoever to do with the emission process itself. Rather, they are part of the propagation process treated 
in the last chapter. They include the background curvature produced by the stress-energy of the waves, 
scattering of waves off the background curvature, wave-wave scattering, etc.; and they are totally negligible 
in the neighborhood of the source itself (r < 1,000 X, for example) because a slow-motion source radiates 
so very weakly. 



Place the origin of spatial coordinates inside the source, as shown in Figure 36.3. 
For slow-motion systems, the only significant contributions to the retarded integrals 
(36.38) come from deep inside the near zone (from a region of size L ~ R 4 A; 
see Figure 36.3). Confine attention to “field points” (points of observation) x> far 
outside this “source region,” 
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(6) calculation of h ik in 
radiation zone 


(7) specialization to nearly 
Newtonian sources 


|x| = r > L > |jc'|, (36.39a) 

and expand the retarded integral (36.38) in powers of x'/r —in just the same manner 
as was done in §19.1. (Such an expansion is justified by and requires the slow-motion 
assumption, A/7? ~ A/L < 1.) The result is 

x) = - f [T^(x\ t - r) + t* r (x’, l - r)\ d 3 x' 

r . , (36.40) 

+ O (■-|L J x^r^x', t - r) + /"*•(jc\ t - r)] d 3 x'\. 

Of the ten components of only the six spatial ones, h’ k , are of interest, since 
only they are needed in projecting out the transverse-traceless radiation field TtjT 
The spatial components are expressed by equations (36.40) in terms of integrals over 
the “stress distribution” T ik + t ik . It will be convenient, in making comparisons with 
Newtonian theory, to reexpress h’ k in terms of integrals over the “energy distribu¬ 
tion” T 00 + t 00 . One can make the conversion with the help of the exact equations 
of motion T**., = 0, which have the special form 

(7>- + //*») >r = 0 (36.41) 

in the coordinate system being used [see equations (36.36) and (36.37); also the 
discussion in §20.3]. Applying these relations twice in succession, one obtains the 
identity 

(r oo + ,oo } oq = _ {T oi + ,oi )w = _ (T io + t lo ) ol 

= +(T lm + 

From this and the elementary chain rule for differentiation, it follows that 

[(T-oo + / 00 )jc i x t ]oo = {T im + t Lm ) ml x i x k 

= [(r £m + t lm )x’x k ] ml - 2[(T l > + t l >)x k + (T ik + t^x’h 
+ 2 (P k + t’ k ), 

whence 

J (T* + t*) d 3 x = j- (d%/dt 2 ), (36.42a) 

where 

I jk (t) = J [T 00 (t, x) + t 00 (t, x)]x>x k d 3 x. (36.42b) 

Now introduce the nearly Newtonian assumption. It guarantees that gravitation 
contributes only a small fraction of the total energy: 

t oo ~ (<p, ,)2 ~ M 2 /R* ~ ( M/R)T°° < T 00 ; 


hence 

I jk (t) = J T°°(e, x)x>x k d 3 x. (36.42b') 

The quantity I jk thus represents the second moment of the mass distribution. 
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By combining equations (36.42) and (36.40), and by noting that inside the source 
\t’ k \ ~ fc | ~ 7’ 00 |<P|, one obtains 


h’ k (t, x ) 


2 - r) 

r dt 2 

2 d2l ik(‘ ~ r) 
r dt 2 


+ o[ i(^ + „)A,] 


(36.43) 


[negligible by assumptions (36.18 



Actually, what one wants are not h }k . They can be obtained by first lowering 
indices, using i\ lm — 5 lm , and then projecting out the TT part using the projection 
operator for radially traveling waves: 

Pim = «i» - «x« m ; «i = x l /r (36.44) 



(8) conversion, by projection, 
to hj? 


(see Box 35.1). (Because h ik and h jk differ only in the trace, they have the same 
TT parts). The result is 


hjk(t, x ) 



(36.45a) 


where 

1% = PnhmPmk - j PflciPtM- (36.45b) 


This is not the best form in which to write the answer, because an external observer 
cannot measure directly the second moment of the mass distribution, I jk . Fortunately, 
one can replace I jk by the reduced quadrupole moment, 

t,k = hk - j v = / ( T °° + l ° 0 ) ( xixk - J V 2 ) d 3 x, (36.46) 


(9) reexpression of h jjf in 
terms of reduced 
quadrupole moment 


and write 


h%{t,x) 


2 

r 


d 2 m< - r) 


dt 2 


(36.47) 


This is allowed because the TT parts of I jk and are identical (exercise 36.8). 

The reduced quadrupole moment has a well-defined, elementary physical 
significance for an observer confined to the exterior of the source. In the near zone 
(r A), but outside the source so that vacuum Newtonian theory is very nearly valid, 
the Newtonian potential is 

0 = -1*00 = -}a°°= - y(* 00 + i A) = - i(A°<> + h») 

[r 00 + t 00 + T» + /«] ret , 

all space I* " *1 


[see equation (36.38)]. Any nearly Newtonian, slow-motion source satisfies 


|r°° + T» + t»\ < r 00 
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EXERCISES 


[recall: t ati ~ (QJ) 2 ~ 7' 00 |$|). Hence, one can write 

4>(x, l) = - d 3 x'. (36.48) 

J \x - x'\ 

Expanding jx — x'| _1 in powers of 1/r, one obtains 


<P 


(M dtX* 


2r 5 



lr < A, but r nevertheA 
less large enough that 
vacuum Newtonian 
\ theory is valid / 


(36.49a) 


where 

M — (total mass-energy of source) = J T 00 d 3 x , 

d i = (dipole moment of source) — J T 00 x i dx s , (36.49b) 

ifc = (reduced quadrupole moment of source) = expression (36.46). 

Thus, the quantities whose second time-derivatives determine the radiation field 
by equation (36.47), are precisely the components of the star's reduced quadrupole 
moment, as measured by an observer who explores its Newtonian potential <P deep inside 
the near zone (r < A) (“empirical quadrupole moment.”). 

The final answer (36.47) for the radiation field in terms of was quoted in the 
summary of results given in §36.7. Also quoted there were expressions for the 
effective stress-energy tensor of the radiation and for the energy and angular mo¬ 
mentum radiated [equations (36.22) to (36.25)]. Those expressions can be derived 
using the formalism of the shortwave approximation. (See exercise 36.9.) 


Exercise 36.7. MAGNITUDE OF t» v 

Consider a slow-motion source of gravitational waves. Show that far from the source, but 
in the near zone (R < r < A) the components of the “stress-energy pseudotensor” t** die 
out as 1/r 4 , but in the radiation zone (r > A) they die out only as 1/r 2 . 

Exercise 36.8. PROOF THAT THE TRANVERSE TRACELESS PARTS OF 
t jk AND J jk ARE IDENTICAL 

Prove by direct computation that the TT parts of I jk (36.42b) and f k (36.46) are identical, 
no matter where the observer is who does the TT projection (i.e., no matter what the unit 
vector n in the projection operator may be). 

Exercise 36.9. ENERGY AND ANGULAR MOMENTUM RADIATED 

(a) For the gravitational waves in asymptotically flat spacetime described by equation (36.47), 
calculate the smeared-out stress-energy tensor T^ w) of equation (35.23). [Answer: equation 
(36.22).] 
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(b) Perform the integrals of equations (36.23) and (36.25) to obtain the total power and 
angular momentum radiated. [Hint: Derive and use the following averages over a sphere 

hi n < da = 0’ hi n * n i da = j *«• hi w * da = °> 

— J* dS2 = — (S tj S kl + S ik Sj, + S u S jk ). 

Here n == x/|x| is the unit radial vector.] 



§36.11. DERIVATION OF THE RADIATION-REACTION 
POTENTIAL 


Turn, finally, to a derivation of the radiation-reaction results quoted in §36.8. The Derivation of formula for the 
analysis starts with the solution (36.43) for the spatial part of the radiation field radiation-reaction potential: 
in the original (i.e., not 7T) gauge: 

= - r). (36.50) 

Although this solution was originally derived by discarding all terms that die out 
faster than 1/r, it is in fact an exact solution to the vacuum field equations h jk a a = 0 
of linearized theory. This means that it is valid in the intermediate and near zones 
(r < A, but r > R) as well as in the radiation zone. 

Were one to replace the outgoing-wave condition by an ingoing-wave condition 
at infinity, the exact solution (36.50) for h* k would get replaced by 

h> k (t,x) = + r). 


Thus, in order to delineate the effects of the outgoing-wave boundary condition, 
one can write the exact solution in the form 

h jk (t, X) = h*(t, x) = j- i jk (t - tr), € = ±1, (36.51) 


(1) formula for h jk anywhere 
outside source, with 
either outgoing or 
ingoing waves 


and then focus attention on the effects of the sign of c. 

In the near zone (r < A), but outside the nearly Newtonian source, this solution 
for h jk , as expanded in powers of r, becomes 


r ri 2 > 

h . h -2 - 


7 (4) r 
€ /< 3 ) . jjk 1 

ik + 2! 


IJtl 

3! 



(36.52a) 


(2) h jk specialized to near 
zone 


where 


i { #=d"i )k {t)/dt\ 
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The corresponding forms of h 0j and h 00 can be generated from this by the gauge 
conditions = 0; i.e., by h j0 0 = h jkk and /; 00 0 = h 0j j . The results are: 


(3) h 00 and h 0j in near zone 
calculated by gauge 
conditions 


'o j 


= 2 - 


I*** + ifk xk f 2/#y 1 _ c 


4 Ifx k r 2 
5! 




(36.52 b) 


4- (static terms not associated with radiation); 

r ps's*-r«<») (x>x k -r*8*) 2 

n 00 — 1 [ r 5 l fk 2 !r 3 ik 31 

+ 3 (xV + r’S*) pjg _ ; + j _^ 52c) 

4 - (static and time-linear terms not associated with radiation). 


(4) plucking out the 
radiation-reaction 
potentials from h afi 


The leading term in these expressions rises as 1/r 3 when one approaches the source: 
r _2{3x*x k - r 2 & k ) T 61 jk n*n k 

^00 — r 5 h* “ r 3 


It is precisely the leading term in the dynamic, quadrupole part of the Newtonian 
potential, <P = -\h 00 = -\h 00 . All other terms without e’s in front of them are 
corrections to the Newtonian potential. They produce effects like the perihelion shift 
of Mercury that in no way deplete the energy and angular momentum of the system. 

The terms with c’s are associated with radiation reaction. Pluck the leading ones 
out and call them “reaction potentials”: ...-" 


A8r° = - 

^-(rearf = _ I/O) _ -I-(2x'X* + r 2 S*)/*- 


(36.53) 


The corresponding metric perturbations h a/3 = h a/3 — are 
*Sr° = -2/}? + + 0(/$r 2 ), 

/»Sr 0 = - + 0(/fr 3 ), 

AST 0 = ~ y /ff - -jj + 3 ' 2 


(36.54) 


These reaction potentials in the near zone are understood most clearly by a change 
of gauge that brings them into Newtonian form. Set 



§36.11. DERIVATION OF RADIATION-REACTION POTENTIAL 


1003 


*"new = *"old + $"(*). h ^„ = ~ to,* ~ *r,„ (5) COnversion of 

radiation-reaction 

with potentials to Newtonian 

gauge 

t,= -I$x*+±W X i, 

io = - J /i? + } - jI fir 2 - (36.55) 

Then in the new gauge 

= 0(1$ r 2 ), /i" = 0(1$ r 3 ), 

^oo act) = - J t$x’x k . (36.56) 

This gauge is ideally suited to a Newtonian interpretation, since in it the geodesic 
equation for slowly moving particles has the form 


d 2 x>/dt 2 = -<£<pact) + 


( terms not sensitive to 
outgoing-wave condition 


(36.57) 


with 


0 (react) = _ 1 /,freact) _ 1 f{5 >*/**. (36.58) 

Thus, the leading radiation-reaction effects (with fractional errors — [A/r] 2 ) can 
be described in the near zone of a nearly Newtonian source by appending the term 
to the Newtonian potential. The resulting formalism and a qualitative 
version of the above derivation were presented in §36.8. 
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DETECTION OF 
GRAVITATIONAL WAVES 

/ often say that when you can measure what you are speaking about, and 
express it in numbers, you know something about it; but when you cannot 
measure it, when you cannot express it in numbers, your knowledge is of a 
meagre and unsatisfactory kind: it may be the beginning of knowledge, but you 
have scarcely, in your thoughts, advanced to the stage of science, 

whatever the matter may be. 

WILLIAM THOMSON, LORD KELVIN [(1889), p. 73] 


V 


Linearized description of 
gravitational waves 
propagating past Earth 


§37.1. COORDINATE SYSTEMS AND IMPINGING WAVES 

The detector is even easier to analyze than the generator or the transmission when 
one deals with gravitational waves within the framework of general relativity. Man’s 
potential detectors all he in the solar system, where gravity is so weak and spacetime 
so nearly flat that a plane gravitational wave coming in remains for all practical 
purposes a plane gravitational wave. (Angle of deflection of wave front passing limb 
of sun is only 1" 75.) Moreover, the nearest source of significant waves is so far away 
that, for all practical purposes, one can consider the waves as plane-fronted when 
they reach the Earth. Consequently, as they propagate in the z-direction past a 
detector, they can be described to high accuracy by the following transverse-traceless 
linearized expressions 


Metric perturbation: h 7 x J = —h™ = ^+( ? — z ’)» — — — z )’ (37-l a ) 

Riemann tensor: R x0x0 = -Ryo v o = ~ 4-4+(< — z \ 


R X 0v0 — RyOxO — 2 Z )' 


(37.1b) 
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Figure 37.1. 

The proper reference frame of a vibrating-bar detector. The bar hangs by *a wire 
from a cross beam, which is supported by vertical posts (not shown) that are 
embedded in the Earth. Consequently, the bar experiences a 4-acceleration given, 
at the moment when this diagram is drawn, by a = g{d/dz), where g is the “local 
acceleration of gravity” (g *** 980 cm/sec 2 ). Later, the spatial axes will have rotated 
relative to the bar (“Foucoult-pendulum effect” produced by Earth’s rotation), so 
the components of a but not its magnitude will have changed. 

The proper reference frame relies on an imaginary clock and three imaginary 
gyroscopes located at the bar’s center of mass (and shown above in a cut-away 
view). Coordinate time is equal to proper time as measured by the clock, and the 
directions of the spatial axes d/dx? are attached to the gyroscopes. The forces that 
prevent the gyroscopes from falling in the Earth’s field must be applied at the 
centers of mass of the individual gyroscopes (no torque!). 


Stress-energy: Tjg w > = T<° w > = - r<g w > = —<4 + 4> Umeavg . (37.1c) 

(See exercise 37.1.) 

To analyze most easily the response of the detector to these impinging waves, 
use not the TT coordinate system {*“} (which is specially “tuned” to the waves), 
but rather use coordinates {*“} specially “tuned” to the experimenter and his 
detector. The detector might be a vibrating bar, or the vibrating Earth, or a loop 
of tubing filled with fluid (see Figures 37.1 and 37.2). But whatever it is, it will 
have a center of mass. Attach the spatial origin, x> = 0, to this center of mass; and 
attach orthonormal spatial axes, d/dx’, to (possibly imaginary) gyroscopes located 
at this spatial origin (Figure 37.1). If the detector is accelerating (i.e., not falling 
freely), make the gyroscopes accelerate with it by applying the necessary forces at 
their centers of mass (no torque!). Use, as time coordinate, the proper time* 6 = t 
measured by a clock at the spatial origin. Finally, extend these locally defined 
coordinates x“ throughout all spacetime in the “straightest” manner possible. (See 



Proper reference frame of a 
detector 
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EXERCISES 


Equations of motion 
mechanical detector 


Track 2’s §13.6 for full details.) The metric in this “proper reference frame of the 
detector ” will have the following form 

ds 2 _ _(i + 2 a<xi)(dx 6 ) 2 + % dx> dx k + 0(\x>\ 2 ) dx« dx&. (37.2) - 

[equation (13.71) with to 1 = 0]. Here a* are the spatial components of the detector’s 
4 -acceleration. (Since a must be orthogonal to the detector’s 4-velocity, vanishes.) 
Notice that, except for the acceleration term in £56 (“gravitational redshift term”; 
see §38.5 and exercise 6 . 6 ), this reference frame is locally Lorentz. 


Exercise 37.1. GENERAL PLANE WAVE IN TT GAUGE 

Show that the most general linearized plane wave can be described in the transverse-traceless 
gauge of linearized theory by expressions (37.1). [Hint: Express the plane wave as a superpo¬ 
sition (Fourier integral) of monochromatic plane waves, and describe each monochromatic 
plane wave by expressions (35.16). Use equations (35.10) and (35.23) to calculate R afiyS 
and Tf™\] 

Exercise 37.2. TEST-PARTICLE MOTION IN PROPER REFERENCE FRAME 

Show that a slowly moving test particle, falling freely through the proper reference frame 
of equation (37.2), obeys the equation of motion (geodesic equation) 

d 2 xfdi 2 = -a- + 0(\x k \). 

Thus, one can interpret — a* as the “local acceleration of gravity” (see caption of Fig¬ 
ure 37.1). _ .. 


§37.2. ACCELERATIONS IN MECHANICAL DETECTORS 

i 

The proper reference frame of equation (37.2) is the closest thing that exists to the 
reference frame a Newtonian physicist would use in analyzing the detector. In fact, 
for a it is so nearly Newtonian that (according to the analysis of Box 37.1) the equations 
of motion for a mechanical detector , when written in this proper reference frame , take 
their standard Newtonian form and can be viewed and dealt with in a fully Newtonian 
manner , with one exception: the gravitational waves produce a driving force of non- 
Newtonian origin, given by the familiar expression for geodesic deviation 

( force per unit mass (i.e., acceleration) \ 
of a particle at x j relative to detector’s 
center of mass at x j — 0 i 

To use this equation, and to calculate detector cross sections later, one must know 
the components of the curvature tensor R a $y$, and of the waves’ stress-energy tensor, 
T ( fi w) , in the detector’s proper reference frame. One cannot calculate R a $y% directly 


_ /d 2 x’\ 

~\dFi 


(37.3) 


due to waves 


(Rjofco )due to waves ^ * 
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Box 37.1 DERIVATION OF EQUATIONS OF MOTION FOR 
A MECHANICAL DETECTOR 

Consider a “mass element” in a mechanical detector (e.g., a cube of aluminum one 
millimeter on each edge if the detector is the bar of Figure 37.1; or an element 
of fluid with volume 1 mm 3 if the detector is the tube filled with fluid shown in 
part h of Figure 37.2). This mass element gets pushed and pulled by adjacent matter 
and electromagnetic fields, as the medium of the detector vibrates or flows or does 
whatever it is supposed to do. Let 

t _/ 4-force per unit mass exerted on mass-element \ 

f \by adjacent matter and by electromagnetic fields/* ^ 

This 4-force per unit mass gives the mass element a 4-acceleration V u u = /; or, 
in terms of components in the detector’s proper reference frame, fi — Du)/dr. 
Assume that the mass element has a very small velocity (u < 1) in the detector’s 
proper reference frame (i.e., relative to the detector’s center of mass). Then, ignoring 
terms of 0(v 2 ) y 0(|jc j '| 2 ), and 0(\x^\v), one has [see equation (37.2)] 

dt/dr — = 1 — a-x’ = 1 — a • x, ( 2 ) 

and 

f ] = d 2 x’/dr 2 + r^u a u# = (d 2 x’/dt 2 + r> w )( 1 - 2 a • x). (3) 


Exercise 37.3 calculates jH’gg to precision of Odjc^j). Inserting its result and rearrang¬ 
ing terms, one finds that 

d 2 x j /dt 2 = (1 4- 2a • x) fi — a)( 1 + a • x) — (4) 


(“equation of motion for mass element”). 

Examine this equation, first from the viewpoint of an Einsteinian physicist, and 
then from the viewpoint of a Newtonian physicist. 

The Einsteinian physicist recognizes d 2 x*/dt 2 as the “coordinate acceleration” of 
the mass element—but he keeps in mind that, to precision of 0( |x i | 2 ), coordinate 
lengths and proper lengths are the same [see equation (37.2)]. The coordinate 
acceleration d 2 x’/dt 2 has three causes: (1) the externally applied force , 

(1 + 2a • X)p = (<Px*/dt ^externallorce ( 5 a) 

= (1 + 2 a- x)(d 2 x>/dT 2 ) exteTnaltOTCe 


(the origin of the a • x correction is simply the conversion between coordinate time 


1008 


37. DETECTION OF GRAVITATIONAL WAVES 



Box 37.1 (continued) 


and proper time); ( 2 ) the “inertial force” due to the acceleration of the reference 
frame, 

1 +«**) = (d 2 X>/di 2 )toertja force ( 5b ) 


(see exercise 37.4 for explanation of the a*x correction); and (3) a “Riemann 
curvature force” which will include Riemann curvature due to local, Newtonian 
gravitational fields (fields of Earth, sun, moon, etc.), plus Riemann curvature due 
to the impinging gravitational waves, 

— (^ 6 fc 0 )waves-*k (R o)New ton fields* ^ ~ ( (P'xfdt ) CU rvature (^ C ) 

(linear superposition because all gravitational fields in the solar system are so weak). 
This “Riemann curvature force” is not, of course, “felt” by the mass element; it 
does not produce any 4-acceleration. Rather, like the inertial force, it originates in 
the choice of reference frame: The spatial coordinates x* measure proper distance 
and direction away from the detector’s center of mass; and Riemann curvature tries 
to change this proper distance and direction (“relative acceleration;” “geodesic 
deviation”). 

A Newtonian physicist views the equation of motion (4) in a rather different 
manner. Having been told that the spatial coordinates x* measure proper distance 
and direction away from the detector’s center of mass, he thinks of them as the 
standard Euclidean spatial coordinates of Newtonian theory. He then rewrites 
equation (4) in the form 

d 2 xi/dt 2 = Fi - (6) 

i 

where 

( Newtonian force per unit mass\ ^ 

acting on mass element / 

= (1 + * x)p — a*(l 4- a • Jt) — (^'ofcoWwton fields*^ 


The Newtonian physicist is free to express F j in a form more familiar than tfiis. 
He can ignore the subtleties of the a • x “redshift effects” because (1) they are small 

\a’(a • x)| ~ | f\a • x)\ < |(*Wwaves**l; (8) 

and ( 2 ) they are steady in time, and therefore—by contrast with the equally small 
wave-induced forces—they cannot excite resonant motions of the detector. Also, he 
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can separate the “inertial acceleration,” — a\ into a contribution from the local 
acceleration of gravity at the detector’s center of mass, — (0^/0x J ) r ; : _ o , plus a 
contribution —a ^ solute due to acceleration of the detector relative to the “absolute 
space” of Newtonian theory. Finally, he can rewrite the Riemann curvature due 
to Newtonian gravity in the familiar form = d 2 <P/dx> dx^. The net result is 


F* = 


+ / J 


absolute 


-G). 


at mass element 


[ total Newtonian force per unit! 
mass acting on mass element J 

[ Newtonian force per unit mass exerted by 1 
adjacent matter and by electromagnetic fields] 

[ inertial force per unit mass due to acceleration 1 
of detector relative to Newtonian absolute space J 

r= -(dt/dx^o-idty/dxtdxW ] (9) 

L = Newtonian gravitational acceleration J 


Conclusion: The equation of motion for a mass element of a mechanical detector, 
when written in the detector’s proper reference frame, has the standard Newtonian 
form (6), with standard Newtonian driving forces (9), plus a driving force due to 
the gravitational waves given by 

(d 2 X>/dt 2 ) duetowaves = waves**' ( 10 ) 


V 


J 


from the metric coefficients of expression (37.2); to do so one would need the 
unknown corrections of 0 (|jc , ’| 2 )- However, one can easily obtain R a $y§ and 7~ ( £ w) 
from the corresponding components in the TT coordinate frame [equations (37.1)] 
by applying the transformation matrix \\dx a /dx^\\. To make the transformation 
trivial, orient the TT coordinate frame so that, to a precision of Ofl/^J) < 1, it 
coincides with the detector’s proper reference frame near the detector’s center of 
mass at the moment of interest, / = t = 0. Then the transformation matrix will be 

dx*/dx i = + 0{h^ v ) + 0(a } x*) + 0{\a\l). (37.4) 

corrections due to relative 
velocity of frames resulting 
_from detector’s acceleration 

The acceleration the detector experiences is typically 

|a| = one “Earth gravity” = 980 cm/sec 2 — l/(light-year). 


corrections due to 
ripples in spacetime 
[caused by waves J 


j redshift 
[c° rr ections_ 
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Description of waves in frame 
of detector 


Explicit form of accelerations 
due to waves 


This analysis is valid only for 
"small" detectors (L < A) 


EXERCISES 


Therefore to enormous precision \\dx a /dx^\\ = ||SjlL and components of tensors are 
the same in the two reference frames: 

RxOxO — ^yOyO = “ ~2 ^ +y ^xOyO ~ ^ybxb ” ” 

nr=nr=- ni w) =-^<4+^Wavg. (37.5) 


[see equation (37.1)]. 

Combining equations (37.3) and (37.5), one obtains for the wave-induced 
accelerations relative to the center of mass of the detector 


r d 2 x \ 

,dt 2 ) l 

i d 2 y\ 


due to waves 


m 


due to waves 


: ~~ Rxbxb* ~~ ^xbyby ~ 2 y )’ 

~“^ibyb9 — Rybio* = y + 



due to waves 


= 0. 


(37.6) 


These expressions, like the equation of geodesic deviation, are valid only over regions 
small compared to one wavelength , Second derivatives of the metric (i.e., the compo¬ 
nents of the Riemann tensor) give a poor measure of geodesic deviation and of 
wave-induced forces over regions of size L > A. Thus, to analyze large detectors 
(L > A), one must abandon the ‘‘local mathematics” of the curvature tensor and 
replace it by “global mathematics”—e.g., an analysis in the 7T coordinate frame 
using the metric components h For an example, see exercise 37.6. 

All detectors of high sensitivity that have been designed up until now (1973) are 
small compared to a wavelength, and therefore can be analyzed using the techniques 
of Newtonian physics and the driving forces of equations (37.6). 

It is useful to develop physical intuition for the driving forces, — Ri^x k , produced 
by waves of various polarizations. Figure 35.2 is one aid to such intuition; Box 37.2 
is another. [The reader may find it interesting to examine, compare, and reconcile 
them!] 


Exercise 37.3. CONNECTION COEFFICIENTS IN ? 

PROPER REFERENCE FRAME 

(a) Calculate for the metric (37.2), ignoring corrections of 0(\x*\). [Answer: Equations 
(13.69) with w* = 0.] 

(b) Calculate R } ^q using the standard formula (8.44), and leaving spatial derivatives of 
the connection coefficients unevaluated because of the unknown corrections of <9(|x ; |) in 
r' a tr [Answer: R> m = j - a>a k .} 

(c) Use the answer to part (b) to evaluate the 0 (|jc*|) corrections to T^. [Answer: 

7^66 = ^(1 + a k x k ) + R’ m x k + 0(U*| 2 ).] 


(37.7) 
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Box 37.2 LINES OF FORCE FOR GRAVITATIONAL-WAVE ACCELERATIONS 


A. Basic Idea 

Consider a plane wave propagating in the f direc¬ 
tion. Discuss it entirely in the proper reference 
frame of a detector. The relative accelerations due 
to the wave are entirely transverse. Relative to the 
center of mass of the detector (origin of spatial 
coordinates) they are 

d 2 x/dt 2 = j (A + x + A x y), 

d 2 y/dl 2 = i-(-A + y + A x x), (1) 

d 2 z/d? 2 = 0. 

Notice that these accelerations are divergence-free. 
Consequently they can be represented by “lines 
of force/’ analogous to those of a vacuum electric 
field. At a value o ft — z where A x = 0 (so polari¬ 
zation is entirely e + ), the lines of force are the 
hyperbolas shown here [sketch (a)]. The direction 
of the acceleration at any point is the direction of 
the arrow there; the magnitude of the acceleration 
is the density of force lines. Since acceleration is 
proportional to distance from center of mass, the 
force lines get twice as close together when one 
moves twice as far away from the origin in a given 
direction. When A + is positive, the arrows on the 
force lines are as shown in (a); when it is negative, 
they are reversed. As \A + \ increases, the force lines 
move in ^toward the origin so their density goes 
up; as \A + \ decreases, they move out toward in¬ 
finity so their density goes down. 

For polarization e x the force lines are rotated 
by 45° from the above diagram. For intermediate 
polarization (values of t — z where A + and A x are 
both nonzero), the diagram is rotated by an inter¬ 
mediate angle [sketch (b)] 


$0 = 2 arc tan 


(a) Force lines for A x = 0, A + > 0 
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Box 37.2 ( continued ) 


B. Three-Dimensional Diagram 

At each value of t — £, the wave-produced accel¬ 
erations have a specific polarization [orientation 
angle £ 0 of sketch (b)] and a specific amplitude 
(density of lines of force). Draw the lines of force 
in a three-dimensional (£,/,£) diagram for fixed 
f. Then as time passes the over-all diagram will 
remain unchanged in form, but will propagate 
with the speed of light in the £ direction. 

Sketch (c) shows such a diagram for righthand 
circularly polarized waves of unchanging ampli¬ 
tude. Note: The authors are not aware of diagrams 
such as these [(a), (b), (c) above] and their use in 
analyzing detector response prior to William H. 
Press (1970). 




Exercise 37.4. WHY THE a • x? 

Explain the origin of the a*x correction in equation (5b) of Box 37.1. [Hint: Take the 
viewpoint of an observer at rest at the spatial origin who watches two freely falling particles 
respond to the inertial force. At time t = 0, put one particle at the origin and the other 
at xK As time passes, the separation of the particles in their common Lorentz frame remains 
fixed; so there develops a Lorentz contraction from the viewpoint of the observer at x } = 0.] 

Exercise 37.5. ORIENTATION OF POLARIZATION DIAGRAM ' 

Derive equation (2) of Box 37.2. 
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Eight types of mechanical 
detectors: 


Figure 37.2 shows eight different types of mechanical detectors for gravitational 
waves. (By “mechanical detector” is meant a detector that relies on the relative 



(a) Oscillations in Earth-moon (b) Normal-mode vibrations 

separation (see exercise 37.7) of earth and moon [see 

Weber (1968)] 



(c) Oscillations in Earth’s 
crust [see Dyson (1969)] 



or hoop, or tuning fork 
[see Douglass (1971)] 





(d) Normal-mode vibrations 
of an elastic bar [see 
Weber (1969) and 
references cited therein] 



(f) Angular accelerations of 
rotating bars [“Heterodyne 
detector”; see Braginsky, 
Zel’dovich, and Rudenko (1969)] 



/ 



(g) Angular accelerations 
of driven oscillators 
[Sakharov (1969)] 


Figure 37.2. 

Various types of gravitational-wave detectors. 



(h) Pumping of fluid in a rotating loop of 
pipe [Press (1970)]. The pipe rotates 
with the same angular velocity as the 
waves; so the position of the pipe in 
the righthand polarized lines of force 
remains forever fixed 
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motions of matter. Nonmechanical detectors are described in §37.9, near end of this 
chapter.) These eight detectors, and others, can be analyzed easily using the force-line 
diagrams of Box 37.2. A qualitative discussion of each of the eight detectors is given 
below. (A full quantitative analysis for each one would entail experimental techni¬ 
calities' for which general relativity is irrelevant, and which are beyond the scope 
of this book. However, some quantitative details are spelled out in §§37.5-37.8.) 


1. The Relative Motions of Two Freely Falling Bodies 

(1) freely falling bodies As a gravitational wave passes two freely falling bodies, their proper separation 

oscillates (Figure 37.3). This produces corresponding oscillations in the redshift and 
round-trip travel times for electromagnetic signals propagating back and forth 
between the two bodies. Either effect, oscillating redshift or oscillating travel time, 
could be used in principle to detect the passage of the waves. Examples of such 
detectors are the Earth-Moon separation, as monitored by laser ranging [Fig. 
37.2(a)]; Earth-spacecraft separations as monitored by radio ranging; and the 
separation between two test masses in an Earth-orbiting laboratory, as monitored 
by redshift measurements or by laser interferometry. Several features of such 
detectors are explored in exercises 37.6 and 37.7. As shown in exercise 37.7, such 
detectors have so low a sensitivity that they are of little experimental interest. 



Figure 37.3, 

Time of round-trip travel between two geodesics responds to oscillations in the 
curvature of spacetime (diagram is schematic only; symbolic of a laser pulse sent 
from the Earth to a comer reflector on the Moon and back at a time when a very 
powerful, long-wavelength gravitational wave passes by; the wave would have to 
be powerful because a direct measure of distance to better than 10 cm is difficult, 
and such precision produces a much less sensitive indicator of waves than the 
vibrations in length [10 -14 cm or less] of a Weber bar; see exercise 37.7). The 
geodesics are curved toward each other in regions where the relevant component 
of the Riemann curvature tensor, call it R- 5 * 5 , has one sign, and curved away from 
each other in regions where it has the opposite sign. The diagram allows one to 
see at a glance the answer to an often expressed puzzlement: Is not any change 
in round-trip travel time mere trumpery flummery? The metric perturbation, 
of the wave changes the scale of distances slightly but also correspondingly changes 
the scale of time. Therefore does not every possibility of any really meaningful 
and measurable effect cancel out? Answer; (1) The widened separation between 
the geodesics is not a local effect but a cumulative one. It does not arise from the 
local value of directly or even from the local value of the curvature. It arises 
from an accumulation of the bending process over an entire half-period of the 
gravitational wave. (2) The change in separation of the geodesics is a true change 
in proper distance, and shows up in a true change in proper time (see “ticks” on 
the world line of one of the particles). See exercise 37.6. Note: When one investi¬ 
gates the separation between the geodesics, not over a single period, as here, but 
over a large number of periods, he finds a cumulative, systematic, net slow bending 
of the rapidly wiggling geodesics toward each other. This small, attractive acceler¬ 
ation is evidence in gravitation physics of the effective mass-energy carried by the 
gravitational waves (see Chapter 35). 
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2. Normal-Mode Vibrations of the Earth and Moon 

A gravitational wave sweeping over the Earth will excite its quadrupole modes of (2) Earth and Moon 

vibration, since the driving forces in the wave have quadrupole spatial distributions 

[see Fig. 37.2(b)]. The fundamental quadrupole mode of the Earth has a period of 

54 minutes, while that of the moon has a period of 15 minutes. Thus, the Earth 

and Moon should selectively pick out the 54-minute and 15-minute components of 

any passing wave train. Section 37.7 will analyze quantitatively the interaction 

between the wave and solid-body vibrations. By comparing that analysis with 

seismometer studies of the Earth’s vibrations, Weber (1967) put the first observational 

limit ever on the cosmic flux of gravitational waves: 

/„ = f flux — < 3 X 10 7 erg cm' 2 sec -1 Hz -1 at v = 3.1 X 10~ 4 Hz. (37.8) 

* a frequency 7 

3. Oscillations in the Earth's Crust 

If the neutron star in a pulsar is slightly deformed from axial symmetry, its rotation (3) Earth's crust 

will produce gravitational waves. The period of the waves is half the period of the 

pulsar (rotation of star through 180° produces one period of waves)—i.e., it should 

range from 0.017 sec for NP0532 (Crab Pulsar) to 1.87 sec. for NP0527. Such a 

wave train cannot excite the 54-minute quadrupole vibration or any of the other 

normal, low-frequency modes of vibration of the Earth. The kind of vibrations it 

can excite allow themselves in principle to be described in the language of normal 

modes. However, they are clearly and more conveniently envisaged as vibrations 

of localized regions of the Earth; or, more particularly, vibrations of the Earth’s 

crust. 

Dyson (1969) has analyzed the response of an elastic solid, such as the Earth, 
to an incident, off-resonance gravitational wave. He shows that the response depends 
on irregularities in the elastic modulus for shear waves, and that it is strongest at 
a free surface [Figure 37.2(c)]. For the fraction of gravitational-wave energy crossing 
a flat surface that is converted into energy of elastic motion of the solid, he finds 
the expression 

(fraction) = ( &7tGp/u: 2 )(s/c) 3 X sin 2 0|cos 0| -1 [1 + cos 2 0 + (^/t?) sin 2 ^]. (37.9) 

Here s and v are the velocities of shear waves and compressive waves, respectively, 
and 9 is the angle between the direction of propagation of the waves and the normal 
to the surface. Considering a flux of 2 x 10 -5 erg/cm 2 sec (an optimistic but con¬ 
ceivable value for waves from a pulsar) incident horizontally (9 = t n/2\ “divergent” 
factor |cos 0]” 1 cancels out in calculation!), and taking s to be 4.5 X 10 5 cm/sec and 
oj to be 6 rad/sec, he calculates that the 1-Hz horizontal displacement produced 
in the surface has an amplitude of | 0 ~~ 2 X 10 -17 cm, too small by a factor of the 
order of 10 5 to be detected against background seismic noise. He points to the 
possibilities of improvements, especially via resonance (elastic waves reflected back 
and forth between two surfaces; Antarctic ice sheet). 
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(4) elastic bar 


4. Normal-Mode Vibrations of an Elastic Bar 

As of 1972, the most often-discussed type of detector is the aluminum bar invented 
by Joseph Weber (1960, 1961) [see Figures 37.1 and 37.2(d)]. Weber’s bars are 
cylindrical in shape, with length 153 cm, diameter 66 cm, and weight 1.4 X 10 6 g. 
Each bar is suspended by a wire in vacuum and is mechanically decoupled from 
its surroundings. Around its middle are attached piezoelectric strain transducers, 
which couple into electronic circuits that are sensitive to the bar’s fundamental 
end-to-end mode of oscillation (frequency v = 1,660 Hz). When a gravitational wave 
hits the bar broadside, as shown in Figure 37.2(d), the relative accelerations carried 
by the wave will excite the fundamental mode of the bar. As of 1972, Weber observes 
sudden, simultaneous excitations in two such bars, one at the University of Maryland, 
near Washington, D.C.; the other at Argonne National Laboratory, near Chicago 
[see Weber (1969, 1970a,b)]. No one has yet come forward with a workable explana¬ 
tion for Weber’s coincidences other than gravitational waves from outer space. 
However, the history of physics is rich with instances where supposedly new effects 
had to be attributed in the end to long familiar phenomena. Therefore it would 
seem difficult to rate the observed events as “battle-tested.” To achieve that con¬ 
fidence rating would seem to require confirmation with different equipment, or under 
different circumstances, or both; experiments to provide that confirmation are now 
(1972) underway. If one makes this tentative assessment, one can be excused for 
expressing at the same time the greatest admiration for the experimental ingenuity, 
energy, and magnificent persistence that Joseph Weber has shown in his more than 
decade-long search for the most elusive radiation on the books of physics. 

Mechanical detectors of the above four types represent systems on which meas¬ 
urements have been made; so practical difficulties and realizable noise levels can be 
estimated properly. In the continuing search for improved methods, more elaborate 
detectors are being studied, and in 1972 one can list a number of interesting pro¬ 
posals, as below. For these it is hard to know how jjiuch development would be 
required in order to achieve the desired performance. 


5. Normal-Mode Vibrations of Elastic Bodies of Other Shapes 

(5) elastic bodies of other The “bar” of a Weber detector need not be cylindrical in shape. For a discussion 
sha P es of a detector with the shape of a hollow square, a hoop, or a tuning fork, see Douglass 

(1971); such a detector might allow its fundamental frequency to be adjusted for 
the most favorable response, with given mass, or given maximum dimension, or both. 
Sections 37.4 and 37.7 and exercises 37.9 to 37.12 analyze in detail the operation of 
a “vibrating-bar” detector of arbitrary shape. See also Douglass and Tyson (1971). 

6. Angular Accelerations of Rotating Bars 

All the potential detectors described thus far respond in the most obvious of manners 
to the tidal accelerations of a gravitational wave: relative distances oscillate in and 


(6) rotating bars 

("heterodyne detector") 
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out. But the tidal accelerations contain, in addition to a length-changing component, 
also a tangential, rotation-producing component. In picture (a) of Box 37.2, the 
length-changing component dominates near the x and^p axes, whereas the rotation- 
producing component dominates half-way between the axes. Vladimir B. Braginsky 
was the first to propose a detector that responds to the rotation-producing accelera¬ 
tions [see Braginsky, Zel’dovich, and Rudenko (1969); Braginsky and Nazarenko 
(1971)]. It consists of two metal rods, oriented perpendicular to each other, and 
rotating freely with angular velocity co 0 in their common plane [see Fig. 37.2(f)]. 
(The rotation is relative to the gyroscopes of the proper reference frame of the 
detector; equivalently, it is relative to the Lorentz frame local to the detector.) Let 
monochromatic gravitational waves of angular frequency co = 2co 0 (change of phase 
per unit of time equals twice the angular velocity at which the pattern of lines of 
force turns) impinge broadside on the rotating rods. The righthand circularly polar¬ 
ized component of the waves will then rotate with the rods; so their orientation in 
its lines-of-force diagram will remain forever fixed. With the orientation of Fig. 
37.2(f), rod A will undergo angular acceleration, while rod B will decelerate. The 
experimenter can search for the constant relative angular acceleration of the two 
rods (constant so long as the angle between them does not depart significantly from 
90°). Better yet, the experimenter can (all too easily) adjust the rotation rate w 0 
so it does not quite match the waves’ frequency to. Then for |to 0 /|co — 2to 0 | rotations, 
rod A will accelerate and B will decelerate; then will follow |to 0 /|to — 2to 0 | rotations 
in which A decelerates and B accelerates, and so on (frequency beating). The 
experimenter can search for oscillations in the relative orientation of the rods. One 
need not worry about the lefthand polarized waves marring the experiment. Since 
they do not rotate with the rods, their angular accelerations average out over one 
cycle. 

Such a device is called a “heterodyne detector” by Braginsky. He envisages that 
such detectors might be placed in free-fall orbits about the Earth late in the 1970’s. 
Heterodyne detectors would work most efficiently for long monochromatic wave 
trains such as those from pulsars; but even for short bursts of waves they may be 
more sensitive than vibrating bars [see Braginsky and Nazarenko (1971)]. 


7. Angular Accelerations of Driven Oscillators 

Andrei D. Sakharov (1969) has proposed a different type of detector for the angular (7) rotation of driven 

accelerations of a gravitational wave. Instead of two rotating bars, it consists of two oscillators 

identical, driven oscillators, initially parallel and nonrotating, but oscillating out of 

phase with each other. Each oscillator experiences angular accelerations in one 

direction at one phase of a passing wave, and in the opposite direction at the next 

phase, but the torques do not cancel out. When the oscillator is maximally distended, 

it experiences a greater torque (acceleration oc length; torque oc length 2 ) than when 

it is maximally contracted. Consequently, if the driven oscillations have the same 

angular frequency as a passing, monochromatic wave, and if the phases are as shown 

in Figure 37.2(g), then oscillator A will receive an angular acceleration in the right- 

hand direction, while B receives an angular acceleration in the lefthand direction. 
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(8) fluid in pipe 


EXERCISES 


8. Pumping of Fluid in a Rotating Loop of Pipe 

A third type of detector that responds to angular accelerations has been described 
by William Press (1970). This detector would presumably be far less sensitive than 
others, and therefore not worth constructing; but it is intriguing in its novel design; 
and it illustrates features of gravitational waves ignored by other detectors. Press’s 
detector consists of a loop of rotating pipe, containing a superfluid. The shape of 
the pipe and its constant rotation rate are chosen so that the gravitational waves 
will pump the fluid around inside the pipe. One conceivable pipe design (a bad 
one to build in practice, but an easy one to analyze) is shown in Fig. 37.2(h). Note 
that use is made of the variation in tidal acceleration along the direction of propaga¬ 
tion of the wave as well as perpendicular to that direction. To analyze the response 
of the fluid to a righthand circularly polarized wave, one can mentally place the 
rotating pipe in the three-dimensional line-of-force diagram of Box 37.2(c). 


Exercise 37.6. RELATIVE MOTION OF FREELY FALLING BODIES AS A 
DETECTOR OF GRAVITATIONAL WAVES 
[see Figures 37.2(a) and 37.3.] 

Consider two test bodies initially at rest with respect to each other in flat, empty spacetime. 
(The case where other, gravitating bodies are nearby can be treated without too much more 
difficulty; but this exercise concerns only the simplest example!) A plane, nearly monochro¬ 
matic gravitational wave, with angular frequency « and polarization e + , impinges on the 
bodies, coming from the — z direction. As shown in exercise 35.5, the bodies remain forever 
at rest in those TT coordinates that constituted the bodies’ global inertial frame before the 
wave arrived. Calculate, for arbitrary separations ( Ax, Ay , Az) of the test bodies, the redshift 
and the round-trip travel time of photons going back and forth between them. Compare 
the answer, for large Ax , Ay, Az, with the answer one would have obtained by using (without 
justification!) the equation of geodesic deviation. Physically, why does the correct answer 
oscillate with increasing separation? Discuss the feasibility and the potential sensitivity of 
such a detector using current technology. * 

Exercise 37.7. EARTH-MOON SEPARATION AS A 
GRAVITATIONAL-WAVE DETECTOR 

In the early 1970’s one can monitor the Earth-moon separation using laser ranging to a 
precision of 10 cm, with successive observations separated by at least one round-trip travel 
time. Suppose that no oscillations in round-trip travel time are observed except those (of 
rather long periods) to be expected from the Earth-moon-sun-planets gravitational interac¬ 
tion. What limits can one then place on the energy flux of gravitational waves that pass 
the Earth? The mathematical formula for the answer should yield numerically 5 

Flux < 10 18 erg/cm 2 sec for 0.3 cycle/sec < v < 1 cycle/day, (37.10a) 

corresponding to a limit on the mass density in gravitational waves of 

Density < 10“ 13 g/cm 3 . (37.10b) 


Why is this an uninteresting limit? 
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§37.4. VIBRATING, MECHANICAL DETECTORS: 

INTRODUCTORY REMARKS 

The remainder of this chapter (except for §37.9) gives a detailed analysis of vibrating, 
mechanical detectors (Earth; Weber bar; “bars” with complex shapes; and so on). 

The details of the analysis and its applications depend in a crucial way on the 
values of two dimensionless numbers: (1) the ratio t gw /t 0 , where 

( characteristic time scale for changes in 
gravitational-wave amplitude and spectrum 

( e-folding time for detector vibrations (in\ 
normal mode of interest) to die out as ; 
a result of internal damping / 


(37.11a) 

(37.11b) 



-N 

The rest of this chapter is 
Track 2. No earlier track-2 
material is needed as 
preparation for it, nor is it 
needed for any later chapter. 

V___/ 


and (2) the ratio E^^JkT, where 


( mean value of detector’s vibration energy (in\ 
normal mode of interest) while waves are , (37.12a) 
passing and driving detector i 


( Boltzman’s\ /detector’s \ 

constant / Vtemperature/ 


( Mean energy in normal mode\ 
of interest when gravitational ]. 
waves are not exciting it 


/ 


(37.12b) 


When t gw > r 0 , the detector views the radiation as having a “ steady flux and it 
responds with steady-state vibrations; when r GW < r 0 (short burst of waves), the 
waves deal a “hammer blow ” to the detector. When E^^^ > kT, the driving force 
of the waves dominates over the detector’s random, internal, Brownian-noise forces 
(“ wave-dominated detector ”); when £' vibration < kT, the driving force of the waves 
must compete with the detector’s random, internal, Brownian-noise forces (“noisy 
detector”). 

Sections 37.5 to 37.7 deal with wave-dominated detectors (E^ ation > kT). The 
key results of those sections are summarized in Box 37.3, which appears here as 
a quick preview (though it may not be fully understandable in advance). Section 
37.8 treats noisy detectors. 

Warning: Throughout the rest of this chapter prime attention focuses on the 
concept of cross section. This is fine for a first introduction to the theory of detectors. 
But cross section is not the entire story, especially when one wishes to study the 
detailed wave-form of the radiation. And sometimes (e.g., for the detector of Figure 
37.2a), it is none of the story. A first-rate experimenter designing a new detector 
will not deal primarily in cross sections any more than a radio engineer will in 
liesighihg~a newTadio telescope. Attention will also focus heavily on the bandwidth 

{continued on page 1022) 


Definitions: "steady flux," 
"hammer-blow waves," 
"wave-dominated detector," 
"noisy detector" 


Design of detectors requires 
much more than the concept 
of cross section 
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Box 37.3 WAYS TO USE CROSS SECTION FOR WAVE-DOMINATED DETECTORS 


A. To Calculate Rate at which 
Detector Extracts Energy from 
a Steady Flux of Radiation 

( t gw ^ T «) 

1. Frequency distribution of radiation arbitrary: 

/steady rate at which detector extracts\ 
\energy from gravitational waves / 



erg/cm 2 sec Hz cm 2 Hz 


2. Frequency spread of radiation small 
compared to line width of detector: 




1 Steady rate at which ^ 
detector extracts energy 
^from gravitational waves y 


: = ®(^source) f fc) * = oF 


3. Frequency spread of.radiation large 
compared to line width of detector: 

/steady rate a.t* which detector extracts^ 
\energy from gravitational waves > 

/"V * r 

^detector) J ^ 

erg/cm 2 sec Hz “resonance 

integral”, cm 2 Hz 
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erg/cm 2 Hz cm 2 Hz, 

‘‘resonance 
integral” 
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waves 


Figure 37.4. 

An idealized detector (vibrator) responding to 
linearly polarized gravitational waves. 


of the antenna, and on other, more detailed characteristics of its response, on 
coupling of the antenna to the displacement sensor, on response characteristics of 
the sensor, on antenna noise, on sensor noise, and so on. For an overview of these 
issues, and for discussions of detectors for which the concept of cross section is useless, 
see, e.g., Press and Thorne (1972). 


Idealized detector: oscillator 
driven by a steady flux of 
monochromatic waves: 

(1) derivation of equation of 
motion 


§37.5. IDEALIZED WAVE-DOMINATED DETECTOR, 

EXCITED BY STEADY FLUX OF 
MONOCHROMATIC WAVES 

Begin with the case of a wave-dominated detector ^ kT) being driven by 

a steady flux of radiation (r GW > r 0 ). Deal at first, not with a solid bar of arbitrary 
shape, but rather with the idealized detector of Figure 37.4: an oscillator made of 
two masses M on the ends of a spring of equilibrium length 2 L. Let the detector 
have a natural frequency of vibration co 0 and a damping time r 0 > l/w 0 , so that 
its equation of motion (in the detector’s proper reference frame) is 


£ -h £/t 0 + driving acceleration. 


(37.13) 


Let gravitational waves of polarization e + and angular frequency w impinge on the 
detector from the —i direction; and let the polar angles of the detector relative to 
the wave-determined i, jp, z-axes be 8 and <f>. 

The incoming waves are described by equations (37.1) with the amplitude 

A x =0, A + = <f + e~ ia(t ~ z) . (37.14) 

(Here and throughout one must take the real part of all complex expressions.) 
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Assume that the detector is much smaller than a wavelength, so that one can set 
zzz z = 0 throughout it. Then the tidal accelerations produced by the wave 


(<Px\ 

Vdt 2 ), 


due to wave 


m. 


due to wave 


= — R io]6 x ’ = 

= - R m x ’ = 


-ju 2 a + e~ ia % 
+ ~cw 2 *? + e“ ia, *j>, 


have as their component along the oscillator 

d 2 i x d 2 x y d 2 y z d 2 z _ 

It* = T OF + T It 2 + T ~dt 2 = 

= —\-^ 2 d,Le ~ iu>i ' sin 2 # cos 2<#>. 
2 + 


—o3 2 a.Le~ iut - 
2 + 



Consequently, the equation of motion for the oscillator is 

| -j- i/r 0 + to 0 2 £ = — y w 2 ^ + Le _lw * sin 2 0 cos 2<|>. (37.15) 


The driving force varies as cos2<j> because of the “spin-2” nature of gravitational 
waves: a rotation through 180° in the transverse plane leaves the waves unchanged; 
a rotation through 90° reverses the phase. The sin 2 0 term results from the transverse 
nature of the waves [one factor of sin 6 to account for projection onto the detector's 
direction], plus their tidal-force nature [another factor of sin 0 to account for (relative 
force) ct (distance in transverse plane)]. 

The straightforward steady-state solution of the equation of motion (37.15) is 


~o^ 2 (f + L sin 2 0 cos 2<j> 

£ — 

to 2 — to 0 2 + /co/t 0 


( 2 ) 

(37.16) 


When the incoming waves are near resonance with the detector, |co ± to 0 | < l/r 0 , 
the oscillator is excited to large amplitude. Otherwise the excitation is small. Focus 
attention henceforth on near-resonance excitations; then equation (37.16) can be 
simplified (note: to 0 is positive, but to may be negative or positive): 

—t o 0 tf + L sin 2 0 cos 2<f> 

£ = -i-(37.16') 

M - w 0 + ySgn(w)//r 0 


One measure of the detector’s usefulness is its cross section for absorbing gravita¬ 
tional-wave energy. The steady-state vibrational energy in a detector with the above 
amplitude and with 2 masses M is 

J- ML 2 u 0 4 a + 2 sin 4 # cos 2 2<f> 

F — 2 • — • A/* (P 2 ) ~ —_ (37 171 

^vibration Z 2 (M - <O 0 ) 2 + (l/2r 0 ) 2 ' 1 * * 


oscillator amplitude as 
function of frequency and 
orientation 
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(3) cross sections for 
polarized radiation 


(4) cross sections for 
unpolarized radiation 


This energy is being dissipated internally at a rate ^vibrationAo- If one ignores 
reradiation of energy as gravitational waves (a negligible process!), one can equate 
the dissipation rate to the rate at which the detector absorbs energy from the 
incoming waves—which in turn equals the “cross section” a times the incoming flux: 


^vibration/ 7 *) ~ 


Jdt = oT 0ziGW) = 


ou 2 a + 2 . 
3277 + 


Consequently, near resonance (|« =t w 0 | < w 0 ), the cross section for interception of 
gravitational-wave energy is 


2t 7ML 2 K 2 A 0 ) sin 4 # cos 2 2(f> 
(M - *o) 2 + (1/2To) 2 


for polarized radiation. (37.18) 


This expression applies to monochromatic radiation. However, experience with many 
other kinds of waves has taught that one often has to deal with a broad continuum 
of frequencies, with the “bandwidth” of the incident radiation far greater than the 
width of the detector resonance (see Box 37.3). Under these conditions, the relevant 
quantity is not the cross section itself, but the “resonance integral” of the cross section, 

J o dv — J o(dco/2'7r) = 277A/L 2 to 0 2 sin 4 # cos 2 2<J>, 
resonance (37.19) 

for polarized radiation. 


Before examining the magnitude of this cross section, scrutinize its directionality 
(the “antenna-beam pattern”). The factor of sin 4 # cos 2 2<J> refers to linearly polarized, 
e + radiation (see Figure 37.4). For the orthogonal mode of polarization, e x , cos 2 2<J> 
is to be replaced by sin 2 2<J>; and for unpolarized (incoherent mixture) radiation or 
circularly polarized radiation, the cross section is the average of these two expres¬ 
sions; thus 


77ML 2 (co 0 2 /t 0 ) sin 4 # 
(M - c 0 ) 2 + (1/2 t 


for unpolarized radiation. 


(37.20) 


Notice that this unpolarized cross section is peaked, with half-width 33°, about the 
equatorial plane of the detector. Averaged over all possible directions of incoming 
waves, the cross section is 


< 0 >all directions = J 0 sin 9 d0 = 


15 


(877/15)A/L 2 ( w 0 2 /r 0 ) 


(37.21) 


for unpolarized radiation. 


(M - « 0 ) 2 + 0 / 2 t 0 ) 2 

One can rewrite the above cross sections in several suggestive forms. For example, 
on resonance, the cross section (37.21) reads 

A >all directions = ^ 2 ^ KToXW 
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Recall that co 0 r 0 defines the “(?” of a detector, l/Q = (fraction of energy dissipated 
per radian of oscillation). Note that 2 t:/w 0 is the wavelength X 0 of resonant radiation. 

Finally, denote by r g = 4Mthe gravitational radius of the detector. In terms of these 
three' familiar quantities, find for the cross section the formula 

directions (cross section for absorbing waves on resonance) 

(2 L) 2 (“geometric” cross section of detector) 

= (4*715)(r /X 0 )Q for un P 0 larized radiation 

g " on resonance. 

This relation holds in order of magnitude for any resonant detector. It shows starkly 
that gravitational-wave astronomy must be a difficult enterprise. How large could 
you make the factor r g /\ 0 , given a reasonable budget? Weber’s 1970 detectors have 
2L elle J ve ss 1 meter, r g ~ (0.74 X 10“ 28 cm/g) X (10 6 g) ~ 10“ 22 cm, v Q = w 0 /2tt = 

1,660 Hz, X 0 a; 200 km, r g /X Q ~ \ x 10" 29 , r 0 ss 20 sec, Q~ 2 X 10 5 ; so that 

^Weber - 3 X 10“ 20 cm 2 on resonance. (3723) 

What flux of gravitational-wave energy would have to be incident to excite a cold Flux required to excite a 
detector (^0° K) into roughly steady-state vibrations with a vibration energy of We ber-type detector 
(Boltzmann’s constant) X (room temperature) ~ 4 X 10“ 14 erg? The vibrator, if 
cooled enough to be wave-dominated, dissipates its energy at the rate £' vi bration/ T o 
^2 X 10" 15 erg/sec. The incident flux has to make up this loss, at the rate 

r[£ w) a - 2 x 10- 15 erg/sec, (37.24a) 

implying an incident flux of the order of 2 X 10~ 15 /3 X 10~ 20 ~ 10 5 erg/cm 2 sec. 

Moreover, this flux has to be concentrated in the narrow range of resonance 

v s *>0 ± 1/4t7t 0 = (1660 ± 0.004) Hz. (37.24b) 

By anybody’s standards, this is a very high flux of gravitational radiation for such 
a small bandwidth (~~ 10 7 erg/cm 2 sec Hz, as compared to the flux of blackbody 
gravitational radiation, §irv 2 kT/c 2 = 3 X 10" 27 erg/cm 2 sec Hz, that would corre¬ 
spond to Planck equilibrium at the same temperature; the large factor of difference 
is a direct reflection of the difference in rate of damping of the oscillator by friction 
and by gravitational radiation). 

Equation (37.22) makes it seem that an optimal detector must have a large Q. A large Q is not necessarily 
This is not necessarily so. Recall that the bandwidth, Aco ~ u 0 /Q, over which the optimal 
cross section is large, decreases with increasing Q. When an incoming steady flux 
of waves of bandwidth Aco co 0 /Q = 1/t 0 and of specific flux 

f/erg/cm 2 sec Hz) 

drives the detector, it deposits energy at the rate 


Magnitude of cross sections 
for any resonant detector 

(3722) 
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( rate of deposit 
of energy 


)-■£-/ 

' resonance 


F v o dv = F„(v 0 ) J a dv. 

-f resonance 


4 


for radiation with 
bandwidth Av > l/r 0 


Consequently, the relevant measure of detector effectiveness will be the integral of 
the cross section over the resonance, fo dv (37.19). (See next section.) This frequency- 
integrated cross section is independent of the detector’s Q , so one must use more 
sophisticated reasoning (e.g., signal-to-noise theory) in deciding whether a large Q 
is desirable. (See §37.8). 


§37.6. IDEALIZED, WAVE-DOMINATED DETECTOR, 

EXCITED BY ARBITRARY FLUX OF RADIATION 


Response of idealized 
detector to an arbitrary, 
non-monochromatic flux: 

(1) derivation 


Let plane-polarized waves of polarization e + but arbitrary spectrum [equation (37.1) 
with A x = 0] impinge on the idealized detector of Figure 37.4. Then the equation 
of motion for the detector is the same as for monochromatic waves [equation (37.15)], 
but with — co 2 ^ + e _ia3( replaced by A + : 


I + £/t 0 + to 0 2 £ = y A + L sin 2 6 cos 2 <j>. 


(37.26) 


[By now one is fully accustomed to the fact that all analyses of detectors (when the 
detector is much smaller than the wavelength of the waves) are performed in the 
proper reference frame, with coordinates t, x,y, z. Henceforth, for ease of eyesight, 
abandon the “hats” on these “proper coordinates,” and denote them as merely t, x, 

y> 

Fourier-analyze the waves and the detector displacement, 


and conclude from 


A+(t) = (27T)- 1 / 2 
ttO = (2 tt) _1/2 
equation (37.26) that 




y u 2 A+L sin 2 6 cos 2<p 

to 2 - w 0 2 + jw/t 0 


(37.27a) 

(37.27b) 


This Fourier amplitude is negligible unless |<o Az w 0 | w 0 ; consequently, without loss 

of accuracy, one can rewrite it as 
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— c JqA+L sin 2 0 cos 2<f> 

? = -j-. (37.28) 

M - + j s S n (^W r o 


[Compare with the steady-state amplitude (37.16').] 

Ask how much total energy is deposited in the, detector by the gravitational waves. 
Do not seek an answer by examining the amplitude of the vibrations, £(/), directly; 
since that amplitude is governed by both internal damping and the driving force 
of the waves, it does not reflect directly the energy deposited. To get the total energy 
deposited, integrate over time the force acting on each mass multiplied by its velocity: 

/total^energyX = f + ” J 1 MA L ^ CQS 2 \ £ d[ 

\ deposited / J_ x j \2 + _7j 

[2 masses]- > r „ T , , L[ velocity of each mass] 

[force on each mass] 

Use Parseval’s theorem (one of the most powerful tools of mathematical physics!) 
to replace the time integral by a frequency integral 

(depos^tTd^) = S f ( ML sin20 cos 2 <p)(-u 2 A + *)(-iul) du. 

% 

Then use equation (37.28) to rewrite this entirely in terms of the wave amplitude 
/total energy\ _ p+” [2tt(u 0 2 /t 0 )ML 2 sin 4 0 cos 2 2<#>~| [ u 2 \A + \ 2 ] A nn ^ 

Vdeposited J ~ J_„ L (|w| - u 0 ) 2 + (l/2r 0 ) 2 JL 16tt J ( } 


The first term in this expression is precisely the cross section for monochromatic 
waves, derived in the last section (37.18). The second term has an equally simple 
interpretation: the total energy that the gravitational waves carry past a unit surface 
area of detector is 


^ (ergs/cm 2 ) = f P™> dt = f j^A + 2 dt 


r ^ 2 Kfl 2 
J 16 IT 



« 2 \A + \ 2 
8 


dv 


(37.30) 


(ParsevaFs theorem again!). Consequently, the energy per unit frequency interval, 
per unit area carried by the waves is 

^(ergs/cm 2 Hz) = — u 2 \A+\ 2 (37.31) 

[for — oo < v < +oo; double this for 0 < v < + oo, a convention we use for the 
rest of this chapter]. This is 2tt times the second term in (37.29). 

Combining equations (37.18), (3729), and (37.31), then, one finds 

(2) answer— 

CTO-/<**>* < 37 - 3j > tasJ-/•*"’ 
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How one can measure 
energy deposited 


This is the total energy deposited, regardless of the spectrum of the waves, and regardless 
of whether they come in a steady flux for a long time, or in a short burst, or in 
any other form. It is perfectly general—so long as the detector is wave-dominated 

(^vibration > kT ) whiIe the waves are driving it. 

How can an experimenter measure the total energy deposited? He cannot measure 
it directly, in general, but he can measure a quantity equal to it: the total energy 
that goes into internal damping, i.e., into “friction.” Energy is removed by “friction” 
at a rate £ vib rauo n / T 0 ' when the vibration energy is much greater than kT (during 
period of wave-dominance). Therefore, the experimenter can measure 

(,o,al energy) = i f in general. (37.33) 

\ deposited / r 0 J 

I—[integrate over the period that bration > kTf 

In the special case of “hammer-blow waves” (r GW = duration of waves < r 0 ), the 
vibration energy is driven “instantaneously from ~~kT to a peak value, ^^bration 
> kT\ and then decays exponentially back to ~~kT\ thus 


/total energy\ = J_ f“ dt = £^ ation (37.34) 

\ deposited / r 0 


for hammer-blow waves. 

When the waves are steady for a long period of time (r GW > r 0 ), with specific 
flux 


£„ = f„/T GW (ergs/cm 2 sec Hz), 

then the energy will be deposited at a constant rate 

( dE/dt ) = (total energy deposited)/r GW ; 

and equation (37.32) can be rewritten » 

(rate of deposit) _ f ^ f or stea dy waves (r GW > r 0 ). (37.35) 
\of energy / J 

Equations (37.32) and (37.35) are the key equations for application of the concept 
of cross section to realistic situations. They are applicable not only to polarized 
radiation, but also to unpolarized radiation and to radiation coming in from all 
directions, if one merely makes sure to use the appropriate cross section {equation 
(37.20) or (37.21) instead of (37.18)]. For examples of their application, see Box 37.3. 


§37.7. GENERAL WAVE-DOMINATED DETECTOR, 

EXCITED BY ARBITRARY FLUX OF RADIATION 

The cross sections of the idealized spring-plus-mass detector can be put into a form 
more elegant than equations (37.18) to (37.21)—a form that makes contact with many 
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branches of physics, and is valid for any vibrating resonant detector whatsoever. 

Introduce the “Einstein ^-coefficients,” which describe the rate at which a unit 
amount of detector energy is lost to internal damping and to reradiation of gravita¬ 
tional waves: 


^diss ; 


f rate at which energy is dissipated internally ^ 
^ energy in oscillations of detector 


^gw — 


_/ rate at which energy is reradiated \ 


=v 


energy in oscillations 




= —, (37.36a) 

> T o 

(37.36b) 


For the idealized detector of Figure 37.4, the standard formula (36.1) for the emission 
of gravitational waves yields 


(power reradiated) = — (37.37) 


(see exercise 37.8). Consequently 

A GW = -}f MLW. (37.38) 

One can use these relations ^to rewrite the detector cross sections in terms of A^ ss , 
A gw , and the reduced wavelength 


A = 1/co 


(37.39) 


of the radiation. For example, the cross section (37.21)—now with co > 0—is 


<*> 


all directions 


= -7 ZA 2 - 


^GW^diss 


(OJ - OJ 0 ) 2 + (A di^/2) 2 


for unpolarized (37.40) 
radiation 


(recall the assumption |w — w 0 | 4. w 0 in all cross-section formulas) and the corre¬ 
sponding integral over the resonance is 


i 


< ° > ail directions for polarized radiation. 


(37.41) 


These expressions for the cross section are comprehensive in their application. They 
apply to any vibrating , resonant , gravitational-wave detector whatsoever , as one sees 
from the “detailed balance” calculation of exercise 37.9, and from the dynamic 
calculations of exercise 37.10. They also apply, with obvious changes in statistical 
factors and notation, to compound-nucleus reactions in nuclear physics (“Breit-Wig- 
ner formula”; see Blatt and Weisskopf, pp. 392-94,408-10,555-59), to the absorption 
of photons by atoms and molecules, to reception of electromagnetic waves by a 
television antenna, etc. Equation (37.41) says in effect, “Calculate the rate at which 
the oscillator is damped by emission of gravitational radiation; multiply that rate 
by the geometric factor familiar in all work with antennas, |t 7A 0 2 , and immediately 
obtain the resonance integral of the cross section. The result is expressed in geometric 


Cross sections reexpressed in 
terms of "Einstein 
4-coefficients" 


Generality of the 4-coefficient 
formalism 
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Scattering of radiation by 
detector 


EXERCISES 


units (cm). To get the resonance integral in conventional units, multiply by the 
conversion factor c = 3 X 10 10 cm Hz. 

The ‘dynamic analysis’ of the idealized masses-on-spring detector, as developed 
in the last section, is readily extended to a vibrating detector of arbitrary shape 
(Earth; Weber’s bar; an automobile fender; and so on). The extension is carried 
out in exercise 37.10 and its main results are summarized in Box 37.4. 

Part of the energy that goes into a detector is reradiated as scattered gravitational 
radiation. For any detector of laboratory dimensions with laboratory damping 
coefficients, this fraction is fantastically small. However, in principle one can envisage 
a larger system and conditions where the reradiation is not at all negligible. In such 
an instance one is dealing with scattering. No attempt is made here to analyze such 
scattering processes. For a simple order-of-magnitude treatment, one can use the same 
type of Breit-Wigner scattering formula that one employs to calculate the scattering 
of neutrons at a nuclear resonance or photons at an optical resonance. A still more 
detailed account will analyze the correlation between the polarization of the scattered 
radiation and the polarization of the incident radiation. The kind of formalism useful 
here for gravitational radiation with its tensor character will be very much like that 
now used to treat polarization of radiation with a spin-1 character. Here notice 
especially the “Madison Convention” [Barschall and Haeberli (1971)] developed by 
the collaborative efforts of many workers after experience during many years with 
a variety of conflicting notations. Considering the way in which the best notation 
that is available today for spin-1 radiation was evolved, one can only feel that it 
is too early to canonize any one notation for describing the scattering parameters 
for an object that is scattering gravitational radiation. 


Exercise 37.8. POWER RERADIATED 

The idealized gravitational wave detector of Figure 37.4*vibrates with angular frequency 
«. Show that the power it radiates as gravitational waves is given by equation (37.37). 

Exercise 37.9. CROSS SECTIONS CALCULATED 
BY DETAILED BALANCE 

Use the principle of detailed balance to derive the cross sections (37.41) for a vibrating, 
resonant detector of any size, shape, or mass (e.g., for the vibrating Earth, or Weber’s vibrating 
cylinder, or the idealized detector of Figure 37.4). [Hints: Let the detector be in thermal 
equilibrium with a bath of blackbody gravitational waves. Then it must be losing energy 
by reradiation as rapidly as it is absorbing it from the waves. (Internal damping can b£ ignored 
because, in true thermal equilibrium, energy loss by internal damping will match energy 
gain from random internal Brownian forces.) In detail, the balance of energy in and out 
reads [with I v = “specific intensity,” equation (22.48)] 

[4ff I v (v ^o)3blackbody ^ all directions 

— 4 ow X (Energy in normal mode of detector). 

Solve for J<a) dv , using the familiar form of the Planck spectrum and the fact that gravita¬ 
tional waves have two independent states of polarization.] Note: Because detailed balance 
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' \ 

Box 37.4 VIBRATING, RESONANT DETECTOR OF ARBITRARY SHAPE 
A. Physical Characteristics of Detector 


1. Detector is a solid object (Earth, Weber bar, automobile fender, . . .) with 
density distribution p(x) and total mass M = Jp d 3 x. 

2. Detector has normal modes of vibration. The / 2 th normal mode is character¬ 
ized by: 

to n = angular frequency; 

( e-folding time for vibration energy \ 
to decay as result of internal damping/ ^ n ’ ^ ' 

u n (x) = eigenfunction (defined here to be dimensionless and real). 

The eigenfunctions u n are orthonormalized, so that 

f pu n -u m d 3 x = MS nm . (2) 


3. During a normal-mode vibration with jE , vibra ti OIl > kT, a mass element origi¬ 
nally at x receives the displacement 

Sx = | = i/ tt (x)9 n e’ iw ^ t/T », (3a) 

^constant amplitude] 

the density at fixed x changes by 

8p = — V • (p« n )$ n e _iw "*“ f/T ", (3b) 

and the moment of inertia tensor oscillates 

8J jk = 4ofk®»r Kf ~ ,/rn - ( 3c ) 

Here I (n)jk is the “ moment of inertia factor for the nth normal mode”: 


AnW ft = f (P«n l ),lX i X* d 3 X 

= f p(ufx k + u n k x>) d 3 x 


(4) 


[dimensions: mass x length, multiply by $ n (length) to get I jk ], 

The corresponding '‘‘‘reduced quadrupole factor for the nth normal mode" is 
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Box 37.4 (continued) 

B. Cross Sections for Detector (exercise 37.10) 

1. For polarized radiation with propagation direction n and polarization ten¬ 
sor e: 

h ik = A(t - n- x)e jk , 

e ik n k = °. e ,j = o. e ik e ik = 2 ; ( 6 ) 

4 Af (|w| - w B )- + (1/2 t b )- 

/ <V* = < 7b ) 

resonance J 1 

2. For unpolarized radiation (random mixture of polarizations) with propaga¬ 
tion direction u, cross sections are 


, x , « X * 

°n( v ) = °„( w /2w) = —-77— 


“>„ 2 A» 


4 M 0«l - “n) 2 + (1/2 t„) 2 ’ 

„ _ i<a>L 2 


O dv = - v (n) ^-co 2 

. " 4 M 


Here is the transverse-traceless part of i (n)jk (transverse and traceless 
relative to the propagation direction n ): 

= P^P ~\P trace (P* w ), P* = 6 ; . fc - (9) 


(See Box 35.1) 

3. Cross sections for unpolarized radiation , averaged over all directions , are 


/ . vv _ J_ *2 _ ^GW^diss _ 

all directions ” 2 (Jco| - 0 > n ? + 


f (°n) all directions ^ — 2 77 ^^ GW ’ 


where the Einstein A coefficients are 


^diss ^/ T n’ 

A — ^ (4i);fc) 2 4 

GW ” 5 A# " * 
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C. Spectrum Radiated by an Aperiodic Source (exercise 37.11) 

It is instructive to compare these formulas with expressions for the radiation 
emitted by an aperiodic source. 

1. Fourier-analyze the reduced quadrupole factor of the source 

*ik (0 = (2w)- 1/2 f 7 jk (u)e~ iut dio. 

— 00 

2. Then the total energy per unit frequency (v > 0) radiated over all time, into 
a unit solid angle about the direction n, and with polarization tensor e, is 

dvdtt = J 2 \*jk e ik\ 2 “ 6 (13a) 

W — ±2*77 V 



[compare with equations (7)]. Summed over polarizations, this is 


* 


IQ 2 2 ^ 

j,k 


dE 
dv dS2 2 


TT|2 w 6 


(13b) 


[compare with equations (8)], Here v > 0. 

3. The total energy radiated per unit frequency, integrated over all directions, 
still with v > 0, is 

dE/dv = ^ 2 l^l 2w6 (14) 

i,k 


^_[compare with equations (10)—(12)]. 




can be applied to any kind of resonant system in interaction with any kind of thermal bath 
of radiation or particles, equations (37.40) and (37.41), with appropriate changes of statistical 
factors, have wide generality. 

Exercise 37.10. NORMAL-MODE ANALYSIS OF VIBRATING, 

RESONANT DETECTORS 

Derive all the results for vibrating, resonant detectors quoted in Box 37.4. Pattern the 
derivation after the treatment of the idealized detector in §37.6. [Guidelines: (a) Let the 
detector be driven by the polarized waves of equation (6), Box 37.4; and let it be wave-dom¬ 
inated (F^ ibralion > kT). Show that the displacements Sx = £(*, 0 of its mass elements are 
described by 


£ = S B n(‘)u n (x), 


(37.42a) 
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where the time-dependent amplitude for the nth mode satisfies the driven-oscillator equation 


K + Khn + «.**„ - *„('), 


(37.42b) 


and where the curvature-induced driving term is 


rt,(0 = - Rji)k<> f (p/M)uJx k d 3 x 


(37.42c) 


(See Box 37.4 for notation.) 

(b) Fourier-analyze the amplitudes of the detector and waves, 


B n (t) = (2*r w °-f + du, 

— X 

r +ac - 

A(t) = (2 tt)- 1/2 J 


(37.42d) 

(37.42e) 


and solve the equation of motion (37.42b,c) to obtain, in the neighborhood of resonance, 

\°>nA* {n)jk e jk )/M 

B„ =- T - for |<o ± o) | < <o . (37.42f) 


l«l - + J l/ T n 


(c) Calculate the total energy deposited in the detector by integrating 

(deposited) = / ^ orce P er un ^ v0 ^ ume ) * (Velocity) d 3 x dt . 

Thereby obtain 

/energy deposited in\ = 1 (/ } f dl 

\rtth normal mode I 4 v ( ; ) ; J 

(d) Apply Parseval’s theorem and combine with expression (37.42f) to obtain 

/energy deposited in\ = r (37.43) 

\«th normal mode / J n 


where o n is given by equation (7a) of Box 37.4, and (for — oo < w < +oo) 

l J v (v) — f^,(w/277) = ^-w 2 |^| 2 . § (37.44) 

(e) Show that ^(v) is the total energy per unit area per unit frequency carried by the 
waves past the detector. 

(f) Obtain all the remaining cross sections quoted in Box 37.4 by appropriate manipula¬ 
tions of this cross section. Use the mathematical tools for projecting out and integrating 
“transverse-traceless parts,” which were developed in Box 35.1 and exercise 36.9. 

Exercise 37.11. SPECTRUM OF ENERGY RADIATED BY A SOURCE 

Derive the results quoted in the last section of Box 37.4. 
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Exercise 37.12. PATTERNS OF EMISSION AND ABSORPTION 

The elementary dumbbell oscillator of Figure 37.4, initially unexcited, has a cross section 
for absorption of unpolarized gravitational radiation proportional to sin 4 #, and when excited 
radiates with an intensity also proportional to sin 4 # (Chapter 36). The patterns of emission 
and absorption are identical. Any other dumbbell oscillator gives the same pattern, apart 
from a possible difference of orientation. Consider a nonrotating oscillator of general shape 
undergoing free vibrations in a single nondegenerate (and therefore nonrotatory) mode, or 
excited from outside by unpolarized radiation. 

(a) Show that its pattern of emission is identical with its pattern of absorption. [Hint: Make 
the comparisons suggested in the last few parts of Box 37.4.] 

(b) Show that this emission pattern (= absorption pattern), apart from three Euler angles 
that describe the orientation of this pattern in space, and apart from a fourth parameter 
that determines total intensity, is uniquely fixed by a single (“fifth"’) parameter. 

(c) Construct diagrams for the pattern of intensity for the two extreme values of this 
parameter and for a natural choice of parameter intermediate between these two extremes. 

(d) Define the parameter in question in terms of a certain dimensionless combination of 
the principal moments of the reduced quadrupole tensor. 


Exercise 37.13. MULTIMODE DETECTOR 

Consider a cylindrical bar of length very long compared to its diameter. Designate the 
fundamental mode of end-to-end vibration of the bar as “w = 1,” and call the mode with 
n — 1 nodes in its eigenfunction the “wth mode.” Show that the cross section for the inter¬ 
ception of unpolarized gravitational waves at the nth resonance, integrated over that reso¬ 
nance, and averaged over direction, is given by the formula [Ruffini and Wheeler (1971b)] 


37 / > 2 M 

o(v)dv = ‘ c 2 n 2 f° r n °dd ( zer0 f° r even n )> (37.45) 

resonance; 

random 



where v is the speed of sound in the bar expressed in the same units as the speed of light, 
c; and M is the mass of-the bar (geometric units; multiply the righthand side by the factor 
G/c = 2.22 x 10“ 18 cm 2 Hz/g when employing conventional units). Show that this expression 
gives Jc dv = 1.0 X 10 -21 cm 2 Hz for the lowest mode of Weber’s bar. Multimode detectors 
are (1973) under construction by William Fairbank and William Hamilton, and by David 
Douglass and John A. Tyson. 


Exercise 37.14. CROSS SECTION OF IDEALIZED MODEL OF EARTH FOR 
ABSORPTION OF GRAVITATIONAL RADIATION 

The observed period of quadrupole vibration of the earth is 54 minutes [see, e.g., Bolt (1964) 
or Press (1965) for survey and bibliography]. To analyze that mode of vibration, with all 
due allowance for elasticity and the variation of density in the earth, is a major enterprise. 
Therefore, for a first estimate of the cross section of the 'earth for the absorption of quadrupole 
radiation, treat it as a globe of fluid of uniform density held in the shape of a sphere by 
gravitational forces alone (zero rigidity). Let the surface be displaced from r = a to 


r — a + aaP 2 ( cos #), 


(37.46a) 


where # is polar angle measured from the North Pole and a is the fractional elongation 
of the principal axis. The motion of lowest energy compatible with this change of shape 
is described by the velocity field 


1 

i = - -rax, 

(zero divergence, zero curl). 



V = oz 


(37.46b) 
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(a) Show that the sum of the kinetic energy and the gravitational potential energy is 

E = -(3/5)(A* 2 A0(l - « 2 /5) + (3/20)A/tf 2 a 2 . (37.46c) 

(b) Show that the angular frequency of the free quadrupole vibration is 

« = (16tt/15) 1/2 p 1/2 . (37.46d) 

(c) Show that the reduced quadrupole moments are 

i xx = i yy = — Ma 2 <x/5, i zz = 2Ma 2 a/5. (37.46e) 

(d) Show that the rate of emission of vibrational energy, averaged over a period, is 

-< dE/dt > = (3/125)M 2 a 4 w 6 a peak 2 . (37.46f) 

(e) Show that the exponential rate of decay of energy by reason of gravitational wave 
damping, or “gravitational radiation line broadening,” is 

A q „ = (4/25)A/a 2 a> 4 . (37.46g) 

(f) Show that the resonance integral of the absorption cross section for radiation incident 
from random directions with random polarization is 

J <ct(j')> dv = (v/2)r-A GW = (2v/25)Ma 2 /A 2 . (37.46h) 

resonance 

(g) Evaluate this resonance integral. Note: This model of a globe of fluid of uniform 
density would imply for the earth, with average density 5.517 g/cm 3 , a quadrupole vibration 
period of 94 min, as compared to the observed 54 min; and a moment of inertia ( 2/5)Ma 2 
as compared to the observed 0.33 A/a 2 . Ruffini and Wheeler (1971b) have estimated correction 
factors for both effects and give for the final resonance integral ^5 cm 2 Hz. 


§37.8. NOISY DETECTORS 


Extraction of small signal 
from large noise—general 
remarks 


When the bandwidth of the incoming waves is large compared to the resonance 
width of the detector, the waves deposit a total energy in the detector given by 

(total energy deposited) = f <3& v dv = sr„o = "o) / o dv. 

[ergs]J [erg cm -2 Hz -1 }^ [cm 2 Hz}^ 

At least, this is so if the detector is wave-dominated (i.e., if £^^011 ^ W w ^ e 
waves act; i.e., if initial amplitude of oscillation, produced by Brownian forces, is 
too small to interfere constructively or destructively with the amplitude due to waves). 

Unfortunately, all experiments today (1973) are faced with noisy detectors. Nobody 
has yet found waves so strong, or constructed a detector so sensitive, that the detector 
is wave-dominated. Consequently, a key experimental task today is to pick a small 
signal out of large noise. Many techniques for doing this have been developed and 
used in a variety of fields of physics, as well as in astronomy, psychology and 
engineering [see, e.g., Davenport and Root (1958), Blackman and Tukey (1959), and 
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Figure 37.5. 

Detection of hammer-blow gravitational waves with a noisy detector. Detection of 
even a weak pulse is possible if the time of the pulse is short enough. The amplitude 
$ n of the detector’s vibrations changes by an amount ~~$ r n ms {At/r n ) U2 during a 
time interval At, due to thermal fluctuations (random-walk, Brownian-noise forces). 
Depicted in the inset is a change in amplitude produced by a burst of waves of 
duration r GW arriving out of phase with the detector’s thermal motions (energy 
extracted by waves!). The waves are detectable because 

4«Jdu.towav.. > (WO' /2 . 

even though 4|J8J 4 . S8™ s . 


references given there]. The key point is always to find some feature of the signal 
that is statistically more prominent than the same feature of the noise, plus a 
correlation to show that it arises from the expected signal source and not from 
elsewhere (“protection from systematic error”). Thus to detect steady gravitational 
waves from a pulsar, one might seek to define very precisely two numbers {N 2 ) 
and <( N 4* S ) 2 > = (N 2 ) + ( S 2 >, where N and S are the noise and signal amplitudes 
respectively. A long series of observations (with the pulsar out of the antenna beam) 
gives one value of (N 2 ). Another equally long series of observations, interspersed 
with the first series, will be expected in zeroth approximation to give the same value 
of (N 2 ). In the next approximation one recognizes and calculates the influence of 
normal statistical fluctuations. In an illustrative example, theory, confirmed by 
statistical tests of other parameters drawn from the same data, guarantees that the 
fluctuations are less than lO -5 ^ 2 ) with 95 per cent confidence (only 5 per cent 
chance of exceeding lO -5 ^ 2 ); this limit is set by time and money, not by absolute 
limitations of physics). Let the second series of observations be carried out only at 
times when the pulsar is in the antenna beam. Let it give 


{<V2> + <S 2 >}, dseries = (1 + 7.3 X 10- 5 ){<V 2 >} lstserie , 


JThen in first approximation one can say that (S 2 ) lies with 95 per cent confidence 
in the limits (7.3 ±: 1.0) X lO -5 ^ 2 ). 

Many conceivable sources of gravitational radiation produce bursts rather than 
a steady signal strength (Figure 37.5). Thus one is led to ask in what features 
“hammer-blow radiation” (t Gw < t 0 ) differs from noise. The “Brownian motion” 
noise in the detector may be thought of as arising from large numbers of small 


Rate-of-change of detector 
amplitude as a tool for 
extracting burst signals from 
thermal noise 
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(molecular) energy exchanges with a heat bath. The calculations below estimate the 
typical rate of change of amplitude that a series of such molecular “knocks” can 
produce in a detector, and compare it with the rapid amplitude change produced 
by a “hammer-blow” pulse of radiation. The calculations show that sudden thermally 
induced changes, even of very small amplitude, are rare. Thus sudden changes are 
a suitable feature for the observations to focus on. The actual detection of pulses 
requires a more extended analysis, however, which goes beyond the estimates made 
below. Such an analysis would calculate the probabilities that rare events (sudden 
changes in amplitude) occur by chance (i.e., due to thermal fluctuations) in specified 
periods of time, the still smaller probabilities that they occur in coincidence between 
two or more detectors, and the correlations with postulated sources. 

Consider a realistic detector of the type described in Box 37.4. But examine it 
at a time when it is not radiation-dominated. Then its motions are being driven 
by internal Brownian forces (thermal fluctuations), and perhaps also by an occasional 
burst of gravitational waves. Focus attention on a particular normal mode (mode 
“h”), and describe that mode’s contribution to the vibration of the detector by the 
vector field 


&c = { = ^ n {t)e-^u n {xy (37.47) 

Since u n is dimensionless with mean value unity ( fpu n 2 d 3 x = M ), the complex 
number c $ n (t) is the mass-weighted average of the amplitudes of motion of the 
Description of thermal noise detector’s mass elements. This amplitude changes slowly with time (rate < co n ) as 
in resonant detector a resu it of driving by Brownian forces; but averaged over time it has a magnitude 

corresponding to a vibration energy of kT: 

< ^vibration) = 2 (j M 2 d*x) = \ Mo„ 2 < 1«J 2 > = kT, (37.48) 


i.e., 


= <l$ n l 2 > 1/2 = (2kT/Mu n 2 ) 1/2 . (37.49) 

Example: for Weber’s detector (M ~~ 10 3 kg, co 0 — 10 4 /sec), the fundamental mode 
at room temperature has 


= I 


2 X 1.38 X 10~ 16 X 300 e, 8 y/» 
10 6 g X 10 8 sec -2 / 


(37.50) 

I 


One’s hope for detecting weak hammer-blow radiation lies not in an examination 
of the detector’s vibration amplitude (or energy), but in an examination of its rate 
of change (Figure 37.5). The time-scale for large Brownian fluctuations in amplitude 
(\A$ n \ — 38™), when the detector is noisy, is the same as the time scale r n for internal 
forces to damp the detector, when it is driven to E^ VbeBtian ^ Thus, the amplitude 
<$ n does a “random walk ” under the influence of Brownian forces , with the mean time 
for “large walks ” (|4#J — ®£ ms ) being At s v The change in over shorter times 
At is smaller by the “1 /y/N factor,” which always enters into random-walk processes: 
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_ / number of vibration cycles in time r n \ 1/z _ / r n \ 1/2 

\ number of vibration cycles in time At) ~ \ At) ’ ^ ’ 

< |^(thermal)| > ~ <grms l&\ n = /_2kT_V /2 /At\^ 2 during _ (3? 52) * 

\ r n / \ Mo n 2 / \ r n / time At 

Now suppose that “hammer-blow” radiation (burst of duration At = r GW < r n ) 
strikes the detector, producing a change in the detector’s amplitude. This 

change in amplitude, because it comes so quickly, (1) superposes linearly on any 
change in amplitude produced in the same time interval by the action of Brownian- 
motion forces; and (2) is therefore independent in value of the presence or absence 
of Brownian-motion forces, i.e., independent of all thermal agitation. Therefore 
A$\ p w) (a quantity with both magnitude and phase!) is identical to what it would 
have been if the detector were at zero temperature: 


|mo 2 |AB<?w>|2 

energy that would 
be deposited if 
detector were at 
zero temperature 


/ 


a n(")^0) dv = ^„(to n /2w) f a n 0) dr, 

_ i_ _ 

For hammer-blow radiation, bandwidth of 
radiation is always > bandwidth of detector; 
see Box 37,4 


i 


i 


i.e., 


|4g«3W>| _ dv \ /2 

" v Mu n 2 ) 


(37.53) 


This wave-induced change in amplitude will be distinguishable from thermal changes 
only if it is significantly bigger than the thermal changes (37.52) expected during 
the same length of time t gw : 


| A0^ w) | > < | A$ { £ herm ^ | ) during time t gw 


equivalently: F v (u n /2m) > 



criteria for 
detectability 


(37.54) 


Of course, if one is equipped only to measure the magnitude of the detector’s 
amplitude or energy, and not its phase, these criteria for detectability are not quite 
sufficient. The wave-induced change in squared amplitude (proportional to change 
in energy) will depend on the relative phases of the initial amplitude and amplitude 
change 


* For a fuller derivation and discussion of this formula, see, e.g., Braginsky (1970). Two key points 
covered there are: (1) a statistical version of the formula, which describes the probability that in time 
At the amplitude will change by a given amount, from a given initial value; and (2) quantum-mechanical 
corrections, which come into play in the limit as r n —+• oc, but which are unimportant for detectors 
of the early 1970’s. 


Effect of a burst of waves 
a noisy, resonant detector 


Criteria for detectability of 
burst 
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Ways to improve sensitivity 
of detector 


Non-mechanical detectors 


The future of 

gravitational-wave astronomy 


zl |$ n | 2 = |# ( n initial) 4 - 

zr 2|fy?5j nitial) || Jf’/^ GW) | 

~0 

— -2|f/^ Ilitial) P^ ( n GW> | 

Thus, only a burst that arrives in phase with the initial motion of the detector or 
with reversed phase will be measurable. But for such a burst, the criteria (37.54) 
are sufficient. 

Equations (37.54) make it clear that there are three ways to improve the sensitivity 
of vibratory detectors to hammer-blow radiation: (7) increase the detector's integrated 
cross-section [which can be done only by increasing the rate ^ Gw at which it reradiates 
gravitational waves; see equations (10b) and (lib) of Box 37.4]; (2) cool the detector ; 
(3) increase the detector's damping time. 

Box 37.5 applies the above detectability criteria to some detectors that seem 
feasible in the 1970’s, and to some bursts of waves predicted by theory. The conclu¬ 
sions of that comparison give one hope! 

To be complete, the above discussion should have analyzed not only noise in the 
detector, but also the noise in the sensor which one uses to measure the amplitude 
of the detector’s displacements. However, the theory of displacement sensors is 
beyond the scope of this book. For a brief discussion and for references, see Press 
and Thorne (1972). 

§37.9. NON-MECHANICAL DETECTORS 

When gravitational waves flow through matter, they excite it into motion. Such 
excitations are the basis for all detectors described thus far. But gravitational waves 
interact not only with matter; they also interact with electromagnetic fields; and 
those interactions can also be exploited in detectors. One of the most promising 
detectors that may be built in the future, one designed by Braginsky and Menskii 
(1971), relies on a resonant interaction between gravitational waves and electro¬ 
magnetic waves. It is described in Box 37.6. 

§37.10. LOOKING TOWARD THE FUTURE 

As this book is being written, it is not at all clear whether the experimental results 
of Joseph Weber constitute a genuine detection of gravitational waves. (See §37.4, 
part 4.) But whether they do or not, gravitational-wave astronomy has begun, and 
seems to have a bright future. The technology of 1973 appears sufficient for the 
construction of detectors that will register waves from a star that collapses to form 
a black hole anywhere in our galaxy (Box 37.5); and detectors of the late 1970’s 
and early 1980’s may well register waves from pulsars and from supernovae in other 
galaxies. The technical difficulties to be surmounted in constructing such detectors 
are enormous. But physicists are ingenious; and with the impetus provided by Joseph 
Weber’s pioneering work, and with the support of a broad lay public sincerely 
interested in pioneering in science, all obstacles will surely be overcome. 


_ |^(initial)j2 

if in phase 

if phase difference is ±tt/ 2 
if phase difference is t n. 


(37.55) 
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Box 37.5 DETECTABILITY OF HAMMER-BLOW WAVES 

FROM ASTROPHYSICAL SOURCES: TWO EXAMPLES 
(The following calculations are accurate only to 
within an order of magnitude or so) 

A. Waves from a Star of Ten Solar Masses Collapsing to Form 
a Black Hole; 1972 Detector with 1975 (?) Sensor 

1. Predicted characteristics of radiation: 


(intensity at Earth) = W v • 




47r(distance) 2 y 

~(2 x 10 5 ergs/cm 2 Hz) [(distance to center of galaxy)/(distance)] 2 , 
(frequency of waves) = v ~~ 10 3 Hz, 

(bandwidth of waves) = Av ~ 10 3 Hz, 

(duration of burst) = r GW — 10~ 3 sec to 10“ 1 sec. 

2. Detector properties: A Weber bar, vibrating in its fundamental mode, with 

i 

M = 10 6 g, Jo dv = 10“ 21 cm 2 Hz (exercise 37.13), 

v 0 = u>J2ni — 1,660 Hz, f=3K (liquid Helium temperature), 
r 0 — 20 seconds, 

rms = ^ x 1.37 X 10- 16 X 3 erg \ 1/2 _ 




3 x 10 15 cm, 


10 6 g x 10 s sec -2 / 

|j^(thermaD| = (3 x 10' 15 cm)( 10 ” 3 / 20) 1/2 = 2 x 10" 17 cm, 

during At = 10 -3 sec, 

ij^Uhermai*! _ 2 x 10“ 16 cm, during At = 0.1 sec. 

3. Effect of waves [equation (37.53)]: 


|Jsjj ( Gw)| _ ^ x 2 X 10 5 X 10 21 ergsy /2 [ center of Galaxy 


10 6 x 10 8 sec - 


7 


distance 


= 2 x 10“ 15 cm 


distance to 
center of Galaxy 
\ distance 


4. Conclusion: Gravitational waves from a massive star collapsing to form a 
black hole anywhere in our galaxy are readily detectable, if one can construct 
a “ sensor ” to measure changes in vibration amplitudes of magnitude 
<10“ 15 cm on time scales <0.1 seconds. This does appear to be feasible 
with 1972 technology; see Press and Thorne (1972). 
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Box 37.5 (continued) 


B. Waves from a Supernova Explosion in the Virgo Cluster of 

Galaxies; a Detector that might be constructable by late 1970's 
or early 1980's 

1. Predicted characteristics of radiation: 

0.03 

(intensity at Earth) = F v ~— 

-4 X 10 -3 ergs/cm 2 Hz, 

(frequency of waves) = v ~ 10 3 Hz, 

(bandwidth of waves) — v — 10 3 Hz, 

(duration of burst) = t gw — 0.3 sec, or t gw 2 X 10 3 sec.* 

2. Detector: A Weber-type bar made not of metal, but of a 1,000-kg monocrys¬ 
tal of quartz, cooled to a temperature of 3 x 10 -3 K. (For such a monocrys¬ 
tal, it is thought that the damping time would increase in inverse proportion 
to temperature, r 0 cc 1/F.) Estimated properties of such a detector: 

M ~ 10 6 g, j odv = 10" 21 cm 2 Hz (same as for Weber bar), 
v 0 = co 0 /2tt - 1,500 Hz, T= 3 X 10~ 3 K; 


r 0 — 10 6 sec, 


^rms _ ^ 


2 X 1.37 X 10" 16 X 3 X 10~ 3 erg 


10 6 g X 10 8 sec' 


| ^(thermal) | = (} X IQ -16 cm) | 


>.3 or 2 X 10 3 


y/2 


y/2 


1 X 10“ 16 cm, 


6 x 10 -20 cm, 


5 X 10“ 21 cm. 


3. Effect of waves [equation (37.53)]: 


1 « ( 0 GW) | = ( 


2 X 4 X 10" 3 X 10~ 21 ergs 
10 6 X 10 8 seer 2 


y/2 


3 X 10- 19 cm. 


4. Conclusion: Gravitational waves are detectable from a supernova in the 
Virgo cluster, if one can construct a sensor to measure changes in vibration 
amplitudes of magnitude < 10~ 19 cm on time scales of <0.1 seconds; and 
if one can construct a detector with the above characteristics. 

*For the duration of waves from a supernova explosion, two time scales appear to be relevant: (1) the 
time required for the final stages of the collapse of the white-dwarf core to a neutron star or a neutron-star 
pancake, t — (dimensions of neutron star)/(speed of sound in nuclear matter) —■ 2 x 10 3 sec ( pulse 
of gravitational radiation”); and (2) the time required for a vibrating neutron star to lose its energy 
of vibration by gravitational radiation (“damped train of waves”), t — 0.3 sec. 
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Box 37.6 A NONMECHANICAL DETECTOR OF GRAVITATIONAL WAVES 
[Braginsky and Menskii (1971)] 


The Idea in Brief 

(see diagram at right) 

A toroidal waveguide contains a monochromatic 
train of electromagnetic waves, traveling around 
and around it. Gravitational waves propagate per¬ 
pendicular to the plane of the torus. If the circuit 
time for the EM waves is twice the period of the 
gravitational waves, then one circularly polarized 
component of the gravitational waves will stay 
always in phase with the traveling EM waves. 
Result: a resonance develops. In one region of the 
EM wave train, gravitational tidal forces always 
“push” the waves forward {blue shift!) in another 
region the tidal forces “push” backward {red 
shift!). An EM frequency difference builds up 
linearly with time; a phase difference builds up 
quadratically. 


Outline of Quantitative Analysis 

1. Let waveguide fall freely in an Earth orbit. 
Orient axes of waveguide’s proper reference 
frame (= local Lorentz frame) so (1) wave¬ 
guide lies in i,^-plane, and (2) gravitational 
waves propagate in z direction. 

2. Let gravitational waves have amplitudes 

A + - iA x = (1) 

[Recall: t ~ r, z zr z; i.e., proper frame and TT 
coordinates almost agree.] Then in plane of 
waveguide (z = 0), 

R im = ~ R iw = 4 « 2 ^ G0S<cjr) 
-— ( 2 ) 

RxOyO — RyOxb S * n 

3. Consider two neighboring parts of the 
EM wave, one at <p = a + \ut\ the other at 
$ = a -f* 8a -f* Treat them as photons. 
Each moves along a null geodesic, except for 



w = (angular frequency of gravitational waves) = (rate 
of change of phase of waves with time) = (two times 
angular velocity with which pattern of “lines of force” 
rotates) 


r = (radius of torus), is adjusted so the speed of propa¬ 
gation of EM waves in waveguide is v = Jcor. 




[EM waves propagate counterclockwise; gravitational 
line-of-force diagram rotates counterclockwise; they 
stay in phase.] 
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Box 37.6 (continued) 

the deflective guidance of the wave guide. Thus, 
their wave vectors k satisfy 

y k _ /deflective “acceleration”^. 
k Vof waveguide / 

and the difference 8k = between the wave 
vectors of the two parts of the wave (difference 
measured via parallel transport) satisfies the 
equation 

V k dk = V k V n k = [V k ,V n ]k + V n \k (4) 
= Riemann (... , Ac, Ac, n) + V„ V k k 

[deflective acceleration of wave guide}-^ 

The waveguide influences the direction of 
propagation of the waves, but not their fre¬ 
quency. Thus only Riemann enters into the 0 
component of the above equation: 

**«**,& = R\fiyk*k*ny. (5) 

4. Let k® = be the angular frequency of the 
electromagnetic wave. The direction of the 
space component k of the propagation 4-vector 
is along the purely spatial vector n; so 

k ® k = ( vu e /r8a)n , n® = 0. (6) 

Use these relations to rewrite equation (5) as 

(d moving with photons 

= (vuJr8a)R b i$n l nk (7) 

5. Combine the expression for n in the spacetime 
diagram with equations (2) and (7), and with 
the world line <p = a + fat for the photons, to 
obtain 


(d 8u e /dt) 


moving with photons 

= — vco e cc 2 dr (cos 2a) 6a. (8) 


6. Integrate over time and over a to obtain 
= " eo [ 1 - J (sin 2a)(ur)(at) j. 




PART 


EXPERIMENTAL TESTS 

OF GENERAL RELATIVITY 

* 

Wherein the reader is tempted by a harem of charming 
gravitation theories {and some not so charming ), is saved from 
his foolish passions by an army of experiments , cleaves unto his 
faithful spouse, Geometrodynamics , vows to lead an honest life 
hereafter, and becomes a True Believer. 




CHAPTER 


TESTING THE FOUNDATIONS 

OF RELATIVITY 

Provan do e riprovando 
(Verify the one and disprove the other) 

GALILEO 


§38.1. TESTING IS EASIER IN THE SOLAR SYSTEM 
THAN IN REMOTE SPACE 

For the first half-century of its life, general relativity was a theorist’s paradise, but 
an experimentalist s hell. No theory was thought more beautiful, and none was more 
difficult to test. 

The situation has changed. In the last few years general relativity has become 
one of the most exciting and fruitful branches of experimental physics. A half-century 
late, the march of technology has finally caught up with Einstein’s genius—not only 
on the astronomical front, but also in laboratory experiments. 

On the astronomical front, observers search for phenomena in which relativity Clean tests of general 
is important, and study them: cosmology, pulsars, quasars, gravitational waves, black relativity are currently 
holes. Unfortunately, in pulsars and quasars, and in the sources of cosmological con,ined ,0 solar system 
radiation and gravity waves, gravitational effects are tightly interwoven with the local 
hydrodynamics and local plasma physics. There is little hope of separating the several 
effects sufficiently sharply to get clean tests of the nature of gravity. Instead, astro¬ 
physicists must put the laws of gravity into their calculations along with all the other 
laws of physics and the observational data; and they must then seek, as output, 
information about the doings of matter and fields “way out there.” 

Thus, for clean tests of general relativity one turns to the laboratory—but to a 
laboratory that is much larger today than formerly: a laboratory that includes the 
entire solar system. 
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Capabilities of technology in In the solar system all relativistic effects are tiny. Nonetheless, some of them are 
1970 s measurable with a precision, in the 1970’s, of one part in 1,000 of their whole 

magnitude or better (see Box 38.1). 


§38.2. THEORETICAL FRAMEWORKS FOR ANALYZING TESTS 
OF GENERAL RELATIVITY 

There are now possible many experiments for testing general relativity. But most 
of them are expensive; very expensive. They involve atomic clocks flown on space- 


f Box 38.1 TECHNOLOGY OF THE 1970's CONFRONTED WITH RELATIVISTIC PHENOMENA 


A 


Quantity to 
be measured 

Magnitude of 
relativistic effects 

Precision of a one-day 
measurement in the early 1970's 

Angular separation of 
two sources on the sky 

Solar deflection of starlight 

(1) if light ray grazes edge of Sun, 

1".75 

(2) if light ray comes in perpendicular 
to Earth-sun line, 

(a) With optical telescope, — 1" 

(b) Angular separation of two quasars with 
radio telescope (differential 
measurement from day to day, not 
absolute measurement) 


0".004 

in 1970, -0'M 
in mid 1970’s, —0".001 

Distance between two 
bodies in solar system 

(a) Perihelion shift per Earth year 

(1) for Mercury, 120 km 

(2) for Mars, 15 km 

(b) Relativistic time delay for radio waves 
from Earth, past limb of sun, to Venus 
(one way), 

1 x 10 -4 sec = 30 km 

(a) Separation of another planet (Mercury, 
Venus, Mars) from Earth, by bouncing 
radar signals off it, 

—0.3 km 

(b) Separation of a radio transponder (on 
another planet or in a space craft) from 
Earth, by measuring round-trip radio 
travel time, 



—3 x 10" 8 sec = 10 m = 0.01 km 


(c) Periodic relativistic effects in 

Earth-moon separation 
(1) in general relativity, 

(c) Earth-moon separation by laser ranging, 
— 10 cm 


100 cm 



(2) in Jordan-Brans-Dicke theory, 

2 


100 cm; (840 cm)/(2 + u) 


Difference in lapse of 
proper time between 
two world lines in solar 
system 


(a) Clock on Earth vs. clock in 
synchronous Earth orbit, 

At/t - 6 x 10~ 10 

(b) Clock on Earth vs. clock in orbit about 
sun, 


Stability of a hydrogen maser clock, 

At/t — 10" 13 for t up to 
one year 


At/t - 1CT 8 
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craft; radar signals bounced off planets; radio beacons and transponders landed on 
planets or orbited about them; etc. Because of the expense, it is crucial to have as 
good a theoretical framework as possible for comparing the relative values of various 
experiments—and for proposing new ones, which might have been overlooked. 

Such a framework must lie outside general relativity. It must scrutinize the foun¬ 
dations of Einstein’s theory. It must compare Einstein’s theory with other viable 
theories of gravity to see which experiments can distinguish between them. It must 
be a “theory of theories.” 

At present, in 1973, there are two different frameworks in broad use. One, devised 
largely by Dicke (1964b),* assumes almost nothing about the nature of gravity. It is 
used to design and discuss experiments for testing, at a very fundamental level, the 
nature of spacetime and gravity. Within it, one asks such questions as: Do all bodies 
respond to gravity with the same acceleration? Is space locally isotropic in its intrinsic 
properties? What are the theoretical implications of local isotropy? What types of 
fields, if any, are associated with gravity: scalar fields, vector fields, tensor fields, 
affine fields? Although some of the experiments that tackle these questions will be 
discussed below, this book will not attempt a detailed exposition of the Dicke 
framework. 

The second framework in broad use is the “parametrized post-Newtonian (PPN) 
formalism.” It has been developed to higher and higher levels of sophistication by 
Eddington (1922), Robertson^ 1962), Schiff (1962, 1967), Nordtvedt (1968b, 1969), 
Will (1971c), and Will and Nordtvedt (1972). 

The PPN formalism is an approximation to general relativity, and also to a variety 
of other contemporary theories of gravity, called “metric theories.” It is a good 
approximation whenever, as in the solar system, the sources of the field gravitate 
weakly {\<P\/c 2 1) and move slowly ( v 2 /c 2 1). The PPN formalism contains a 

set of ten parameters whose values differ from one theory to another. Solar-system 
experiments (measurements of perihelion shift, light deflection, etc.) can be regarded 
as attempts to measure some of these PPN parameters, and thereby to determine 
which metric theory of gravity is correct—general relativity, Brans-Dicke (1961)- 
Jordan (1959) theory, one of Bergmann’s (1968) scalar-tensor theories, one of 
Nordstrom’s theories, Whitehead’s (1922) theory, or something else. [For reviews of 
Nordstrom and Whitehead, see Whitrow and Morduch (1965), Will (1971b), and Ni 
(1972). For a significant nonmetric theory, see Cartan (1920) and Trautman (1972).] 

Chapter 39 will discuss the concept of a metric theory of gravity and will construct 
the PPN formalism; and then Chapter 40 will use the PPN formalism to analyze 
the systematics of the solar system, and to discuss a variety of past and future 
experiments that distinguish between various metric theories of gravity. But first, 
as a prelude to those topics, this chapter will examine experiments that test the 
foundations of general relativity—foundations on which most other metric theories 
also rest. For a more detailed discussion of most of these experiments, see Dicke 
(1964b). 


Theoretical frameworks for 
analyzing gravitation 
experiments: 


(1) Dicke framework 


(2) PPN framework 


*See Thome and Will (1971), or Will (1972), for expositions of both frameworks and a comparison 
of them. 
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- \ 

The rest of this chapter is 

Track 2. 

No earlier Track-2 material is 
needed as preparation for it, 
but Chapter 7 
(incompatibility of gravity 
and special relativity) will be 
helpful. 

This chapter is not needed as 
preparation for any later 
chapter, but it will be helpful 
in Chapters 39 and 40 (other 
theories; PPN formalism; 
experimental tests). 

_ ) 

Eotvos-Dicke experiment to 
test uniqueness of free fall 


§38.3. TESTS OF THE PRINCIPLE OF THE UNIQUENESS 
OF FREE FALL: EOTVOS-DICKE EXPERIMENT 

One fundamental building block common to Einstein’s theory of gravity and to 
almost all other modern theories is the principle of “uniqueness offree fall”:* “The 
world line of a freely falling test body is independent of its composition or structure.” 
By “test body” is meant an electrically neutral body, small enough that (1) its 
self-gravitational energy, as calculated using standard Newtonian theory, can be 
neglected compared to its rest mass ( M/R 1), and (2) the coupling of its multipole 
moments to inhomogeneities of the gravitational field can be neglected.f 

The uniqueness of free fall permits one to regard spacetime as filled with a set 
of curves, the test-body trajectories, which are unique aside from parametrization. 
Through each event, along each timelike or null direction in spacetime, there passes 
one and only one test-body trajectory. Describe these trajectories mathematically: 
that is a central imperative of any theory of gravity. 

When translated into Newtonian language, the uniqueness of free fall states that 
any two test bodies must fall with the same acceleration in a given external gravita¬ 
tional field. Experimental tests of this principle search for differences in acceleration 
from one body to another. The most precise experiments to date are of a type devised 
by Baron Lorand von Eotvos (Box 38.2), redesigned and pushed to much higher 
precision by the Princeton group of Robert H. Dicke (Box 38.3), and extended with 
modifications by the Moscow group of Vladimir B. Braginsky. (See Figure 1.6 and 
Box 1.2 for experimental details.) 

These Eotvos-Dicke experiments are “null experiments.” They balance the accel¬ 
eration of one body against the acceleration of another, and look for tiny departures 
from equilibrium. The reason is simple. Null experiments typically have much higher 
precision than experiments measuring the value of a nonzero quantity. 

Eotvos, Pekar, and Fekete (1922) checked to an accuracy of 5 parts in 10 9 that 
the Earth imparts the same acceleration to wood, platinum, copper, asbestos, water, 
magnalium (90% Al, 10% Mg), copper sulphate, and tallow. Renner (1935) checked, 
to 7 parts in 10 10 , the Earth’s acceleration of platinum, copper, bizmuth, brass, glass, 
ammonium fluoride, and an alloy of 30% Mg, 70% Cu. Dicke, and later Braginsky, 
chose to use the sun’s gravitational acceleration rather than the Earth’s, since the 
alternation in the direction of the sun’s pull every 12 hours lends itself to amplifica¬ 
tion by resonance. (See Figure 1.6.) Roll, Krotkov, and Dicke (1964) reported an 


*R. H. Dicke calls this principle “The weak equivalence principle.” We prefer to avoid^ confusion 
with the equivalence principle (Chapter 16). 

fin general relativity, one often uses an alternative definition of test body, which places no constraint 
on the self-gravitational energy [abandon condition (1) while retaining (2)]. Such a definition is preferable, 
in principle, because the theory of matter has not been developed sufficiently to decide whether (and 
no objective test has ever been proposed to decide whether), gravitational energy at the subnuclear scale 
is a small fraction, a large fraction, or the entirety of the rest mass. But for present purposes a definition 
constraining test bodies to have M/R 1 is preferable for two reasons. First, most theories of gravity 
that currently “compete” with Einstein’s (a) agree with the principle of uniqueness of free fall when 
the macroscopic, Newtonian, self-gravitational energy is neglected ( M/R < 1), but (b) violate that 
principle when macroscopic, Newtonian self-gravitational energy is taken into account See §40.9 for 
details. Second, the test bodies used in the EStvSs-Dicke experiment have M/R so small that their 
macroscopic, Newtonian, self-gravitational energies are, in fact negligible ( M/R ~~ E^ v /M ~~ 10~ 27 ). 




§38.3. EOTVOS-OICKE EXPERIMENT 


1051 


agreement of 1 part in 10 n between the sun’s acceleration of aluminum and gold, 
while Braginsky and Panov (1971) reported agreement to 1 part in 10 12 for aluminum 
and platinum. 

From this agreement, one can infer the response of neutrons, protons, electrons, Theoretical implications of 
virtual electron-positron pairs, nuclear binding energy, and electrostatic energy to Eotvos-Dicke experiment 
the sun’s gravity. Gold is 60% neutrons, while aluminum is only 50% neutrons. 

Therefore even from the 1964 results one could conclude that neutrons and protons 
must have the same acceleration to within [0.6 - 0.5 = 0.1] -1 parts in 10 11 = 1 part 
in 10 10 . Similarly, electrons must accelerate the same as nucleons to 2 parts in 10 7 ; 
virtual pairs (being more abundant in gold than in aluminum) must accelerate the 
same to 1 part in 10 4 ; nuclear binding energy, to 1 part in 10 7 ; and electrostatic 
energy to 3 parts in 10 9 . 

This accuracy of testing gives one confidence in the principle of the uniqueness 
offree-fall. 

(continued on page 1054 ) 

( ---'s 

Box 38.2 BARON LORAND VON EOTVOS 

Budapest, July 27, 1848—Budapest, April 8, 1919 

Eotvos (pronounced ut’vush) studied at Heidelberg with Kirchhoff, Helmholtz, and 
Bunsen and at Konigsberg with Neumann and Richelot. His 1870 Heidelberg Ph.D. 
thesis dealt with an issue of relativity: can the motion of a light source relative to 
an “ether” be detected by comparing the light intensities in the direction of the 
motion and in the opposite direction? 

Studies of his at the same time resulted in the Eotvos law of capillarity, (surface 
tension) 2.12 (T crit — T)/(specific volume) 273 . Eotvos, made professor of physics 
at Budapest in 1872, concentrated on gravity from 1886 onward. He developed and 
extended the original Michell-Cavendish torsion balance, which measured not only 
0 yXX and 0 xy (where 0 is the gravitational potential) but also 0 xz and 0 all to 
a precision destined to be unexcelled for decades. He showed that the so-called “ratio 
between gravitational mass and inertial mass” cannot vary from material to material 
by more than 5 parts in 10 9 . He investigated the paleomagnetism of bricks and other 
ceramic objects, and studied the shape of the earth. He served (June 1894—January 
1895) as minister of public instruction and religious affairs (a cabinet position held 
in earlier years by his father). He founded a school which trained high-school 
teachers, to whose leavening influence one can give some of the credit for such 
outstanding scientists as von Karman, von Neuman, Teller, and Wigner. He served 
one year as rector of the University of Budapest. 

"I can never forget the moment when my train rushed into the railroad station of 
Heidelberg along the banks of the Neckar. . . / cannot forget my happiness that 
now I could breathe the same air as those men of science whose fame attracted 

me there. " 

[EOTVOS IN 1887, AS QUOTED IN FEJER AND MIKOLA (1918), P. 259.] 

V_ J 
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'Insofar as it is permitted on the basis of a few experiments f we can therefore 
declare that \i, that is, the weakening of the Earth's attraction through the 
intervening compensator quadrants, is less than one part in 5 X 10 lQ . . . . the 
absorption (of gravity) by the entire earth along a diameter fs less than about one 

part in 800. 

'We have carried out a series of observations which surpassed all previous ones in 
precision, but in no case could we discover any detectable deviation from the law 

of proportionality of gravitation and inertia. 

[EOTVOS, PEKAR, AND FEKETE (1922).] 


"Science shall never find that formula by which its necessary character could be 
proved. Actually science itself might cease if we were to find the due to the 

secret ." 

[EOTVOS, PRESIDENTIAL ADDRESS TO THE HUNGARIAN ACADEMY OF SCIENCES, 189D, 

AS QUOTED IN FEJE:R AND MIKOLA (1918), P. 280.] 


"We should consider it as one of the most astonishing errors of the present pge 
thet so many people listen to the words of pseudoprophets who, in place of the 
dogmas of religion, offer scientific dogmas with medieval impatience but without 

historical justification." 

[EOTVOS, 1877, AS QUOTED IN FEJER AND MIKOLA (1918), P. 280.] 
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Box 38.3 ROBERT HENRY DICKE May 6, 1916, St. Louis, Missouri 

Cyrus Fogg Brackett Professor of Physics at Princeton University 


During 1941-1946, Dicke was a leader in replacing 
the outmoded concept of lumped circuit elements 
by a new microwave analysis based on symmetry 
considerations, conservation laws, reciprocity rela¬ 
tions, and the scattering matrix—concepts that led, 
among others, to the lock-in amplifier and the 
microwave radiometer. Searching for a means to 
reduce the Doppler width of spectral lines for 
precision measurements, Dicke discovered recoil¬ 
less radiation in atomic systems held in a box or 
in a buffer gas. This development led to (1) the 
discovery of the basic idea of the gas-cell atomic 
clock and (2) a much more precise measurement 
of the gyromagnetic ratio of electrons in the Is and 
2s levels of hydrogen and of the hyperfine struc¬ 
ture of atomic hydrogen. 

A fundamental paper by Dicke in 1954 set forth 
the theory of coherent fediation processes and of 
the superradiant state, and laid the foundation for 
the future development of the laser and the maser, 
to which he also contributed. His patent no. 
2,851,652 (filed May 21, 1956) was the first disclo¬ 
sure of a device for the generation of infrared 
radiation by a coherent process, and supplied the 
first suggestion for combining the use of an etalon 
resonator with an amplifying gas. 

Beginning in the 1960’s, Dicke brought his talent 
for precision measurement to the service of experi¬ 
mental cosmology, and with his collaborators: (1) 



checked the equivalence principle with the up- 
to-then unprecedented accuracy of 1 part in 10 11 ; 
(2) determined the solar oblateness; and (3) sug¬ 
gested that the primordial cosmic-fireball radia¬ 
tion, a tool for seeing deeper into the past history 
of the universe than has ever before been possible, 
should be observable, and therefore should be 
hunted down and found. 


"For want of a better term , a gas which is radiating strongly because of coherence 
will be called ' superradiant .'... As the system radiates it passes to states of 
lower m with r unchanged—to the 'superradiant' region m ~~ 0" 

(1954) 

"Possibilities are examined for the excitation of optical 
'superradiant' states of gas" 


( 1957 ) 



1054 


38. TESTING THE FOUNDATIONS OF RELATIVITY 



Box 38.3 ( continued ) 




Experimental evidence for 
existence of a metric 



"A 'gravitational oblateness' of [the sun of ] 5 X 10 5 would require the 
abandonment of Einstein's purely geometrical theory of gravitation . , . . Such a 
flattening [of the sun ] could be understood as the effect of a rather rapidly rotating 
interior. . . . The answer appears to be that in the past, and to this day, the solar 
corona with its magnetic field has acted as a brake on the surface of the sun 

(1964a) 

"New measurements of the solar oblateness have given a value for the fractional 
difference of equatorial and polar radii of (5.0 ± 0.7) X 10 0 

[DICKE AND GOLDENBERG (1967)] 

"[The universe must ] have aged sufficiently for there to exist elements other than 
hydrogen. It is well-known that carbon is required to make physicists" 

(1961) 

"The question of the constancy of such dimensionless numbers is to be settled 

not by definition but by measurements" 

[BRANS AND DICKE (1961)] 

"The geophysical data lead to an upper limit of 3 parts in 10 xz per year on the 

rate of change of the fine-structure constant" 

[DICKE AND PEEBLES (1962)] 

_ J 


§38.4. TESTS FOR THE EXISTENCE OF A METRIC GOVERNING 
LENGTH AND TIME MEASUREMENTS, AND 
PARTICLE KINEMATICS 

Special relativity, general relativity, and all other metric theories of gravity assume 
the existence of a metric field and predict that this field determines the rates of ticking 
of atomic clocks and the lengths of laboratory rods by the familiar relation — dr 2 = 
ds 2 = g a p dx a dxP. 

The experimental evidence for a metric comes largely from elementary particle 
physics. It is of two types: first, experiments that measure time intervals directly, 
e.g., measurements of the time dilation of the decay times of unstable particles;* 
second, experiments that reveal the fundamental role played by the Lorentz group 
in particle kinematics and elsewhere in particle physics.f To cast out the metric tensor 
entirely would leave one with no theoretical framework adequate for interpreting 
such experiments. 

*For a 2 per cent test of time dilation with muons of (1 - ir 2 )" 172 — 12 in a storage ring, see Farley, 
Bailey, Brown, Giesch, Jostlein, van der Meer, Picasso, and Tannenbaum (1966). For earlier time-dilation 
experiments see Frisch and Smith (1963); Durbin, Loar, and Havens (1952); and Rossi and Hall (1941). 

fSee p. 18 of Lichtenberg (1965) for a discussion of Lorentz invariance, spin and statistics, the TCP 
theorem, and relevant experiments. 
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Notice what particle-physics experiments do and do not tell one about the metric 
tensor, g. First, they do not guarantee that there exist global Lorentz frames, i.e., 
coordinate systems extending throughout all of spacetime, in which g a ^ — r\ a ^. 
However, they do suggest that at each event 9 there exist orthonormal frames with 
e-Jf) ’ e $(&) = Vaf}, which are related to each other by Lorentz transformations. 
These orthonormal frames provide one with a definition of the inner product between 
any two vectors at a given event—and, thereby, they define the metric field. 

Second, particle experiments do not guarantee that freely falling particles move 
along geodesics of the metric field, i.e., along straight lines in local Lorentz frames. 
(Here, in §§38.4 and 38.5, the phrase “local Lorentz frame” means a “normal” co- 
ordinate system at an event 9, in which g a0 (9) = Va/3 and g af} y (9) = 0. The term 
“inertial frame” is avoided because no assertions are made, yet, about test-body 
motion.) In particular, one does not know from elementary-particle experiments 
whether the local Lorentz frames in the laboratory are freely falling (so they fly 
up from the center of the earth and then fall back with Newtonian acceleration 
g = 980 cm/sec 2 ), whether they are forever at rest relative to the laboratory walls, 
or whether they undergo some other type of motion. All one is led to believe is 
that a metric determines the nature of the spacetime intervals (dr 2 = —g^ dx* dx v ) 
measured by atomic clocks, that the various local Lorentz frames in the laboratory 
therefore move with uniform velocity relative to each other (they are connected by 
Lorentz transformations), and that electric and magnetic fields and the energies and 
momenta of particles undergo Lorentz transformations in the passage from one local 
Lorentz frame to another. 

Third, elementary particle experiments do suggest that the times measured by 
atomic clocks depend only on velocity, not on acceleration. The measured squared 
interval is ds 2 = g a/3 dx a dx 13 , independently of acceleration (until the acceleration 
becomes so great it disturbs the structure of the clock; see §16.4 and Box 16.3). 
Equivalently, but more physically, the time interval measured by a clock moving 
with velocity v’ relative to a local Lorentz frame is 


Particle experiments do not 
guarantee existence of global 
Lorentz frames, or geodesic 
motion for test particles 


Particle experiments do 
suggest proper tim'e is 
independent of acceleration 


dr = ( Va /3 dx a dxP) 1/2 = [1 - (v*) 2 - (u*) 2 - (p*) 2 ]"2 dt, (38.1) 

independently of the clock’s acceleration d 2 x’/dt 2 . If this were not so, then particles 
moving in circular orbits in strong magnetic fields would exhibit different decay rates 
than freely moving particles—which they do not [Farley et al. (1966)].* 


§38.5. TESTS OF GEODESIC MOTION: 

GRAVITATIONAL REDSHIFT EXPERIMENTS 

The uniqueness of free fall, as tested by the Dicke-Eotvos experiments, implies that 
spacetime is filled with a family of preferred curves, the test-body trajectories. There 


•The experiment of Farley et al. is a 2 percent check of acceleration-independence of the muon decay 
rate for energies E/m = (1 — ir)~ l/2 ~ 12 and for accelerations, as measured in the muon rest frame, 
of a = 5 x 10 20 cm/sec 2 — 0.6 cm -1 . 
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Physical meaning of a 
comparison between 
test-body trajectories and 
geodesics of metric 


Pound-Rebka-Snider redshift 
experiment as a test of 
geodesic motion 


is also another family of preferred curves, the geodesics of the metric g . It is tempting 
to identify these geodesics with the test-body trajectories. Einstein’s geometric theory 
of gravity makes this identification (“equivalence principle”). One might conceive 
of theories that reject this identification. What is the experimental evidence on this 
point? 

In order to see what kinds of experiments are relevant, it is helpful to elucidate 
the physical significance of the geodesics. 

A geodesic of g is most readily identified locally by the fact that it is a straight 
line in the local Lorentz frames. Put differently, a body’s motion is unaccelerated 
as measured in a local Lorentz frame if and only if the body moves along a geodesic 
of g. Hence, to determine whether test-body trajectories are geodesics, one must 
compare experimentally the motion of the spatial origin of a local Lorentz frame 
(as defined by atomic-clock readings) with the motion of a test body (material 
particle). 

It is easy to study experimentally the motions of test bodies; relative to an earth- 
bound laboratory, they accelerate downward with g = 980 cm/sec 2 ; and this accel¬ 
eration can be measured at a given location on the Earth to a precision of 1 part 
in.10 6 . 

Unfortunately, it is much more difficult to measure the motion of a local Lorentz 
frame, once again as defined by atomic-clock readings. The only direct experimental 
handle one has on this today, with sufficient precision to be interesting, is gravita¬ 
tional redshift experiments. (See §§7.2-7.5 and §25.4 for theoretical discussions of 
the gravitational redshift in the framework of general relativity.) 

The redshift experiment of highest precision is that of Pound and Rebka (1960), 
as improved by Pound and Snider (1965); see Figure 38.1. It used the Mossbauer 
effect to measure the redshift of 14.4 keV gamma rays from Fe 57 . The emitter and 
absorber of the gamma rays were placed at rest at the bottom and top of a tower 
at Harvard University, separated by a height h = 74 feet = 22.5 meters. The meas¬ 
ured redshift agreed, to 1 percent precision, with the general relativistic prediction 
of 


AX/X = gh = 2.5 X 10‘ 15 . (38.2) 

This result tells one that the local Lorentz frames are not at rest relative to the 
Earth’s surface; rather, they are accelerating downward with the same acceleration, 
g , as acts on a free particle (to within 1 percent precision). To arrive at this conclusion, 
one analyzes the experiment in the laboratory reference frame, where everything 
(the experimental apparatus, the Earth, the Earth’s gravitational field) is static. 
Relative to the laboratory a local Lorentz frame, momentarily at rest, accelerates 
downward (horizontal accelerations being ruled out by symmetry) with some un¬ 
known acceleration a. Equivalently, the laboratory accelerates upward (in +z direc¬ 
tion) with acceleration a relative to the local Lorentz frame. Consequently, the 
spacetime metric in the laboratory frame has the standard form 

ds 2 = -(1 + 2az) dt 2 + dx 2 + df + dz 2 + 0( |xf) dx a dx (38.3) 




Figure 38.1. 

The experiment of Pound and Rebka (1959) and Pound and Snider (1965) on the gravitational redshift 
of photons rising 22.5 meters against gravity through a helium-filled tube in a shaft in the Jefferson 
Physical Laboratory of Harvard University. The source of Co 57 had an initial strength greater than a 
curie. The 14.4 keV gamma rays had to pass in through an absorber enriched in Fe 57 to reach the 
large-window proportional counters. Both source and absorber were placed in temperature-regulated 
ovens. The velocity of the source consisted of two parts: one steddy (v M ), to put the center of the emission 
line on the part of the transmission curve that is nearly straight; and the other alternating between + Vj 
and — Vj. to sweep the transmission curve in this straight region; similarly when the steady velocity was 
-v M . The departure from symmetry between the two cases + v u and -v if allows one to determine the 
offset v D (effect of gravitational redshift) from the zero-gravity case of stationary emitter and stationary 
absorber. The final result for the redshift was (0.9990 ± 0.0076) times the value 4.905 x 10~ 13 of 2^/c 2 
predicted from the principle of equivalence (difference between “up" experiment and “down" experi¬ 
ment). Diagrams adapted from Pound and Snider (1965). 













1058 


38. TESTING THE FOUNDATIONS OF RELATIVITY 


which Track-2 readers have met in §§6.6 and 13.6; and Track-1 readers have met 
and used in Box 16.2. Moreover, in the laboratory frame the metric is static, gravity 
is static, and the experimental apparatus is static. Therefore the crest of each electro¬ 
magnetic wave that climbs upward must follow a world line t(z) identical in form 
to the world lines of the crests before and after it; thus, 

wave crest #0: t = t 0 (z), 
wave crest #1: t = t 0 (z) + At, 

wave crest #w: t — t 0 (z) + n At. 

[Here, as in Schild’s argument (§7.3) that redshift implies spacetime curvature, no 
assumption is made about the form of the wave-crest world lines see Figure 
7.1.] Hence, expressed in coordinate time, the interval between reception of successive 
wave crests is the same as the interval between emission. Both are At. But the atomic 
clocks of the experiment (Fe 57 nuclei) are assumed to measure proper time At = 
(—g a p Ax a Ax^) 1/2 , not coordinate time. Thus 

^received _ ^received _ (1 + ^received) 

^emitted ^emitted 0 + a2 emitted) ^ 

— 1 ^( 2 received ^emitted)’ 


i.e., 

theoretical prediction based on assumptions 
(/) that atomic clocks measure At = (~g a pAx a Ax#) 

(ii) that electromagnetic radiation has the form of a 
wave train; 

(i/7) that local Lorentz frames accelerate downward 
with acceleration a relative to the laboratory. 

Direct comparison with the experimental result (38.2) reveals that local Lorentz 
frames in an Earthbound laboratory accelerate downward with the same acceleration 
g as acts on a test particle (to within 1 per cent precision). 

[The above discussion is basically a reworked version of Schild’s proof (§7.2) that 
the redshift experiment implies spacetime is curved. After all, how could ipacetime 
possibly be flat if Lorentz frames in Washington, Moscow, and Peking all accelerate 
toward the Earth’s center with g — 980 cm/sec 2 ?] 

Other redshift experiments Of all redshift experiments, the Pound-Rebka-Snider experiment is the easiest to 

interpret theoretically, because it was performed in a uniform gravitational field. 
Complementary to it is the experiment by Brault (1962), which measured the redshift 
of the sodium D x line emitted on the surface of the sun and received at Earth (Figure 
38.2). To a precision of 5 per cent, he found a redshift of GM e /R e c 2 , where M 0 and 
Rq are the mass and radius of the sun. This is just the redshift to be expected if 
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Figure 38.2. 

The measurement by Brault (1962) of the redshift of the line of sodium gives 1.05 zt. 0.05 of the 
gravitational redshift predicted by general relativity. This strong line, in contrast to the weak lines used 
by earlier investigators (1) is emitted high in the sun’s atmosphere, above the regions strongly disturbed 
by the pressure and convective shifts, and yet lower than the chromosphere, and (2) comes closer to 
standing up cleanly above the background than any other line in the visible spectrum. Brault built a 
new photoelectric spectrometer (upper diagram), with its slit vibrated mechanically back and forth across 
a narrow region of the spectrum, to define the position of the line peak (1) electronically, (2) independently 
of subjective judgment, and (3) with a precision greater by a factor of the order of ten than that afforded 
by conventional visual methods. The slit is considered set on a line when its mean position is such that 
the photomultiplier current contains no signal at the frequency of the modulation. The redshift measured 
in this way is corrected for orbital motion and for rotation of the sun and the Earth to give the points 
in circles and triangles in the lower diagram. Extrapolation to zero vibration of the slit gives the cited 
number for the redshift. Figure adapted from thesis of Brault (1962). 
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the local Lorentz frames, at each point along the photon trajectory, fall in step with 
freely falling test bodies.* 

In summary, redshift experiments reveal that, to a precision of several percent, 
the local Lorentz frames at the Earth’s^ surface and near the sun are unaccelerated 
relative to freely falling test bodies. Equivalently, test bodies move along straight 
lines in the local Lorentz frames. Equivalently, the test-body trajectories are geodesics 
of the metric g . 


§38.6. TESTS OF THE EQUIVALENCE PRINCIPLE 


Tests of the equivalence 
principle: 


(1) geodesic motion 


Of all the principles at work in gravitation, none is more central than the equivalence 
principle. As enunciated in §16.2, it states: “In any and every local Lorentz frame, 
anywhere and anytime in the universe, all the ( nongravitational ) laws of physics must 
take on their familiar special-relativistic forms . ” 

That test bodies move along straight lines in local Lorentz frames (geodesic 
motion) is one aspect of the equivalence principle. Other aspects are the universality 
of Maxwell’s equations 

= 4w/“ and F a0<y + F fiy>a + F yaJ = 0 (38.5) 

in all local Lorentz frames; the universality of the law of local energy-momentum 
conservation 

= 0; (38.6) 

and the universality of the values of the dimensionless constants that enter into the 
local laws of physics: 


_ e 2 _ _1__ /electromagnetic fine-\. 

c tic — 137.0360 ... \structure constant / 


tYi 

Deutr0D = 1.00138 

^proton 


' ‘electron 
^proton 


l 

1836.12 777 ’ 


etc. 


(38.7) 


(2) physical laws are locally 
Lorentz-invariant 


(3) laws do not vary from 
event to event 


(Attention here is confined to dimensionless constants, since only they are independ¬ 
ent of one’s arbitrary choice of units of measure.) 

If one focuses attention on a given event and asks about invariance of the form 
of the physical laws [equations (38.5), (38.6), etc.] from one Lorentz frame to another, 
one is then in the province of special relativity. Here a multitude of experiments 
verify the equivalence principle (see §38.4). 

If one asks about variations in the form of the laws from one event to another, 
one opens up a Pandora’s box of possibilities that one hardly dares to contemplate. 
However, no experimental evidence has ever given the slightest warrant to consider 
any such “departure from democracy” in the action of the laws of physics. Moreover, 
astronomical observations provide strong evidence that the laws of physics are the 


For a review of other, less-precise redshift experiments, see Bertotti, Brill, and Krotkov (1962). 
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same in distant stellar systems as in the solar system, and the same in distant galaxies 
as in our own Galaxy. (See, in Box 29.5, Edwin Hubble’s expressions of joy upon 
discovering this.) 

Constancy of the dimensionless “constants” from event to event can be tested to 
high precision, if one assumes constancy of the physical laws. Dirac (1937, 1938), 
Teller (1948), Jordan (1955, 1959), Gamow (1967), and others have proposed that 
the fine-structure “constant” a e might be a slowly varying scalar field, perhaps 
governed by a cosmological equation. However, rather stringent limits on such 
variations follow from data on the fine-structure splitting of the spectral lines of 
quasars and radio galaxies. For the quasar 3C 191 with redshift z = 1.95, Bahcall, 
Sargent, and Schmidt (1967) find a e (3C 191)/a e (Earth) = 0.97 ± 0.5. With a cos¬ 
mological interpretation of the quasar redshift, this corresponds to a limit 
(\/a e ){d(x e /dt) < 1/10 11 years. An even tighter limit has been obtained from radio¬ 
galaxy data, where there is no question about the interpretation of the redshift. 
Bahcall and Schmidt (1967) measured fine-structure splitting in five radio gal¬ 
axies with z 0.20, corresponding to an emission of light 2 X 10 9 years ago. 
They obtained a e (z = 0.20)/a c (Earth) = 1.001 ± 0.002, which yields the limit 
\(l/a e )(da e /dt)\ < 1/10 12 years. 

Dyson (1972) points out that comparison of the rate of beta decay of Re 187 in 
times past (via osmium-rhenium abundance ratios in old ores) with the rate of 
beta-decay today provides a means to check on any possible variation of a e with 
time more sensitive than redshift data and more sensitive than any changes in rates 
of alpha decay and fission between early times and now. He summarizes the available 
data on Re 187 and arrives at the limit 

- - \(\/a e )(da e /dt)\ < 5/10 15 years. 

For further evidence of the constancy of the fundamental constants see Minkowski 
and Wilson (1956), Dicke (1959a,b), Dicke and Peebles (1962b). 

Spatial variations of a e , w neutron /m protoI1 , anc * other “constants” in the solar system 
can be sought by means of Eotvos-type experiments. The reasoning [by Dicke (1969)] 
leading from such experiments to limits on any spatial variation of the constants 
is indirect. It recalls the reasoning used in standard treatises on polar molecules to 
deduce the acceleration of a polarizable molecule pulled on by an inhomogeneous 
electric field. It proceeds as follows. 

Suppose one of the dimensionless “constants,” “a,” depends on position. This will 
lead to a position-dependence of the total mass-energy of a laboratory test body. 
For example, if a c depends on position, then the coulomb energy of an atomic 
nucleus will also (E coul oc e 4 oc a 2 ; 8M/E conl = 2 8a e /a e ). One can calculate the 
change in a test body’s mass-energy when it is moved from x* to x* + 8x^ by 
assuming no change at all in the body’s structure during its displacement: 

6M = (9A//aa) fixedstructure (0a/0x^) 8x (38.8) 
After the displacement, a weakening of internal forces (due, e.g., to a decrease of a) 


(4) fundamental constants do 
not vary from event to 
event 


Eotvos-type experiments as 
tests for spatial variation of 
fundamental constants 
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may cause a change in structure, but that change will be accompanied by a con¬ 
version of internal potential energy into internal kinetic energy, which conserves M. 

Now consider the following thought experiment [an elaboration of the argument 
by which Einstein first derived the gravitational redshift (§7.2)]: Take n particles, 
each with mass-energy jli. Make the particles with a structure such that a negligible 
fraction of jl l is associated with the “constant” of interest, a: 

(1 /ji)(dji/da) = 0. (38.9) 

Place these particles at a height h in a (locally) uniform Newtonian field. Combine 
them together there, releasing binding energy E B (h ), to form a composite body of 
mass 

Af = nix - E B (h) (38.10) 

which depends in a significant manner on the “constant” a, 

(1 /M)(dM/da) 0. (38.11) 

Lower this body, and the released binding energy tied up in a little bag, a distance 
8h. The total force acting is (in Newtonian language) 

F= Ma + E B (h)g. (38.12) 

Here g is acceleration experienced by the type of mass-energy that is independent 
of a when it is in free fall. In contrast, “free” fall of the assembled body M is not 
really free fall, because of the supplementary “polarization force” pulling on this 
object. Hence the assembled body in “free” fall experiences an acceleration, a, a 
little different from g . However, the mass that is accelerated is precisely M , and 
therefore the force required to produce this acceleration is given by the product Ma. 
The energy gained in lowering the body and the bag is 

£(down) = F8h = Ma8h + E B {h)g8h . 

Put this energy in the bag. 

At h — 8h use some of the energy from the bag to pull the body apart into its 
component particles. The energy required is 

EJh — 8h) = n\i — M(h — Sh) = nfi — M(h) + 

da ah 

= E ‘« t > + Wfk Sh ’ 


so an energy 

£ bag = E is ( h ) + £(down) - E B (h - 8h) 

= [ Ma + E tWs - d -£^ 


(38.13) 


is left in the bag. Use this energy to raise the n particles and the bag back up to 
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height h. Assume total energy conservation, so that there will be no extra energy and 
no deficit when the n particles and bag have returned to the original state back 
at height h. This means that £ bag must be precisely the right amount of energy to 
do the raising: 

^bag = m Sh = [M+ E B {h)]g 8h. (38.14) 

Combining expressions (38.13) and (38.14) for £ bag , discover that 


_ 1 dMda 
M da dh 


(38.15) 


Thus, under the assumption of total energy conservation (no perpetual-motion 
machines!), a spatial dependence of a physical “constant” a will lead to the anomaly 
(38.15) in the acceleration of a body whose mass depends on a. 

Coulomb energy, which is proportional to a e 2 , amounts in a gold nucleus to 0.4 
per cent of the mass, and to 0.1 per cent in an aluminum nucleus. Hence, a spatial 
variation in a e should lead to a fractional difference in the gravitational accelerations 
of these two nuclei equal to 


g 


1 „ 0.003 da. 

-2 - -f< 1 x lO" 11 ; 

g a e dh ~ 


i.e.. 


da. 


dh 


< 1 X 10~ 9 g ~ 1 X 10~ 9 cm/sec 2 = 1 x 10- 30 /cm 


at the Earth due to the sun. 


(38.16) 


Here use is made of the limit (1 X 10“ n ) from Dicke’s experiment (§38.3), and the 
acceleration g = 0.6 cm/sec 2 due to the sun at Earth. 

Notice that this says the gradient of In a e is less than 1 X 10“ 9 the gradient of 
the Newtonian potential! 


§38.7. TESTS FOR THE EXISTENCE OF UNKNOWN 
LONG-RANGE FIELDS 

Whether or not one accepts the assumption that test bodies move on geodesics of 
the metric, it remains conceivable that previously unknown long-range fields (fields 
with “1/r” fall-off at large distances) are somehow associated with gravity. 

If “new” long-range fields (not metric, not electromagnetic) do exist, waiting to 
be discovered, then there are two ways by which they could influence matter. First, 
they could couple directly to matter, producing, for example, slight deviations from 
geodesic motion (deviations smaller than the limits of §38.5), or slight dependences 
of masses of particles on position (dependences smaller than the limits of §38.6). 
Second (and harder to detect), they could couple indirectly to matter by being mere 


Possible existence of new 
long-range fields associated 
with gravity 


Direct vs. indirect coupling 
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Experimental limits on 
direct-coupling fields: 

(1) Hughes-Drever 
experiment 


(2) ether-drift experiments 


participants in field equations that determine the geometry of spacetime. This section 
will describe tests for direct-coupling effects. Theories with fields that couple indi¬ 
rectly will be described in Box 39.1, and tests for such fields will be discussed in 
Chapter 40. 

Dicke (1964b), using his framework for analyzing tests of gravitation theories 
(§38.2), has shown that several null experiments place stringent limits on unknown, 
direct-coupling, long-range fields. 

One of these experiments is the “Hughes-Drever Experiment” [Hughes, Robinson, 
and Beltran-Lopez (1960); Drever (1961)]. It can be thought of as a search for a 
symmetric second-rank tensor field h a0 that produces slight deviations of test-body 
trajectories from geodesics of the metric g a/3 . Unless one’s experiments happen to 
be made in a region of spacetime where h afi is a constant multiple of g afi (“mere 
rescaling of all lengths and times by a constant factor”), this tensor field must produce 
anisotropies in the properties of spacetime—which, in turn, will cause anisotropies 
in the inertial mass of a nucleon, and in turn will cause in an atomic nucleus relative 
shifts of degenerate energy levels with different magnetic quantum numbers. The 
Hughes-Drever experiment places stringent limits on such shifts, and thereby on 
a possible tensor field h aj8 . To quote Dicke (1964, p. 186), “If two [tensor] fields are 
present with the one strongly anisotropic in a coordinate system chosen to make 
the other isotropic, the strength of [direct] coupling to one must be only of the order 
of 10“ 22 that of the other.... [Moreover], on the moving Earth with ever-changing 
velocity, anisotropy would be expected at some season.” From the experiments of 
Hughes and Drever, then, one concludes that there is not the slightest evidence for 
the presence of a second tensor field. For further details see Dicke and Peebles 
(1962a). 

Another series of experiments, called “ ether-drift experiments places stringent 
limits on any unknown, long-range vector field that couples directly to mass-energy. 
One can imagine such a field of cosmological origin. Being cosmological, the 4-vector 
would most naturally be expected to point in the same direction as the 4-vector 
u of the “cosmological fluid” (identical with the time direction e 0 of a frame in 
which the cosmic microwave radiation is isotropic). The 4-vector of the new field 
would then have spatial components in any other frame. In principle an observer 
could use them to discern his direction of motion and speed relative to the mean 
rest frame of the universe. The ether-drift experiments search for effects of such 
a field. 

For example, the experiment of Turner and Hill (1964) searches for a dependence 
of clock rates on such a vector field, by examining the transverse Doppler’shift as 
a function of direction for an emitter on the rim of a centrifuge and a receiver at 
its center (Figure 38.3). If there is any effect, it would most naturally be expected 
to have the form 


rate of clock moving relative 
to universe with speed /J 

( rate of clock at rest\ 
relative to universe / 


= l + yP 2 , 


y a small constant. (38.17) 
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Figure 38.3. 

The experiment of Turner and Hill (1964) looks for a dependence of proper 
clock rate (the clock being a Co 57 source placed near the rim of the centrifuge) 
on velocity relative to the distant matter of the universe; or, in operational terms, 
relative to a “new local field” described by a 4-vector. The 14.4 keV gamma 
rays from the Co 57 already experience a second-order Doppler shift of 1.3 parts 
in 10 13 . One searches for an additional shift y /? 2 where = u 4 - v(e x cos wf 4- 
e y sin uf) is the velocity relative to the frame in which the scalar field is purely 
timelike. The transmission of the gamma rays through the Fe 57 absorber will 
drop linearly with any such additional shift, and will be noted as a drop in 
the counting rate of the Nal crystal. The source was 10 cm from the axis of 
rotation and the centrifuge turned at 15,000 rpm. The value of y deduced from 
the experiment was (1 ±. 4) x 10" 5 . 


A clock at the center of the centrifuge has /3 = u = ue x , whereas one on the rim 
has /3 = u 4* v(e x cos ut + e y sin cot). Thus, the shift between rim and disk should 
vary with position 

A\/X = — Av/v — — lyuveoscot 4 - usual transverse shift. 

The data of Turner and Hill, using the Mossbauer effect, show that 

IyI < 4 X 10" 5 ’ (38.18) 

Hence, a cosmological vector field, if present, has only a weak direct coupling to 
matter. 

For further discussion of these experiments and references on others like them, 
see Dicke (1964b). 



CHAPTER 


OTHER THEORIES OF GRAVITY 
AND THE POST-NEWTONIAN 
APPROXIMATION 




§39.1. OTHER THEORIES 


Role of alternative gravitation 
theories as foils for 
experimental tests 


Criteria for viability of a 
theory: 

(1) self-consistency 


Among all bodies of physical law none has ever been found that is simpler or more 
beautiful than Einstein’s geometric theory of gravity (Chapters 16 and 17); nor has 
any theory of gravity ever been discovered that is more compelling. 

As experiment after experiment has been performed, and one theory of gravity 
after another has fallen by the wayside a victim of the observations, Einstein’s theory 
has stood firm. No purported inconsistency between experiment and Einstein’s laws 
of gravity has ever surmounted the test of time. 

Query: Why then bother to examine alternative theories of gravity? Reply: To 
have “foils” against which to test Einstein’s theory. 

To say that Einstein’s geometrodynamics is “battle-tested” is to say it has won 
every time it has been tried against a theory that makes a different prediction. How 
then does one select new antagonists for decisive new trials by combat? 

Not all theories of gravity are created equal. Very few, among the multitude in 
the literature, are sufficiently viable to be worth comparison with general relativity 
or with future experiments. The “worthy” theories are those which satisfy three 
criteria for viability: self consistency, completeness, and agreement with past experi¬ 
ment . 

Self-consistency is best illustrated by describing several theories that fail this test. 
The classic example of an internally inconsistent theory is the spin-two field theory 
of gravity [Fierz and Pauli (1939); Box 7.1 here], which is equivalent to linearized 
general relativity (Chapter 18). The field equations of the spin-two theory imply 
that all gravitating bodies move along straight fines in global Lorentz reference 
frames, whereas the equations of motion of the theory insist that gravity deflects 
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bodies away from straight-line motion. (When one tries to remedy this inconsistency, 
one finds oneself being “bootstrapped” up to general relativity; see route 5 of Box 
17.2.) Another self-inconsistent theory is that of Kustaanheimo (1966). It predicts 
zero gravitational redshift when the wave version of light (Maxwell theory) is used, 
and nonzero redshift when the particle version (photon) is used. 

Completeness: To be complete a theory of gravity must be capable of analyzing 
from “first principles” the outcome of every experiment of interest. It must therefore 
mesh with and incorporate a consistent set of laws for electromagnetism, quantum 
mechanics, and all other physics. No theory is complete if it postulates that atomic 
clocks measure the “interval” dr = {—g a p dx a dx&) 1/2 constructed from a particular 
metric. Atomic clocks are complex systems whose behavior must be calculated from 
the fundamental laws of quantum theory and electromagnetism. No theory is com¬ 
plete if it postulates that planets move on geodesics. Planets are complex systems 
whose* motion must be calculated from fundamental laws for the response of stressed 
matter to gravity. For further discussion see §§16.4, 20.6, and 40.9. 

Agreement with past experiment: The necessity that a theory agree, to within several 
standard deviations, with the “four standard tests” (gravitational redshift, perihelion 
shift, electromagnetic-wave deflection, and radar time-delay) is obvious. Equally 
obvious but often forgotten is the need to agree with the expansion of the universe 
(historically the ace among all aces of general relativity) and with observations at 
the more everyday, Newtonian level. Example: Birkhoff’s (1943) theory predicts the 
same redshift, perihelion shift, deflection, and time-delay as general relativity. But 
it requires that the pressure inside gravitating bodies equal the total density of 
mass-energy, p = p; and, as a consequence, it demands that sound waves travel with 
the speed of light. Of course, this prediction disagrees violently with experiment. 
Therefore, Birkhoff’s theory is not viable. Another example: Whitehead’s (1922) 
theory of gravity was long considered a viable alternative to Einstein’s theory, 
because it makes exactly the same prediction as Einstein for the “four standard tests.” 
Not until the work of Will (1971b) was it realized that Whitehead’s theory predicts 
a time-dependence for the ebb and flow of ocean tides that is completely contradicted 
by everyday experience (see §40.8). 


§39.2. METRIC THEORIES OF GRAVITY 

Two lines of argument narrow attention to a restricted class of gravitation theories, 
called metric theories. 

The first line of argument constitutes the theme of the preceding chapter. It 
examined experiment after experiment, and reached two conclusions: (1) spacetime 
possesses a metric; and (2) that metric satisfies the equivalence principle (the standard 
special relativistic laws of physics are valid in each local Lorentz frame). Theories 
of gravity that incorporate these two principles are called metric theories.* In brief, 
Chapter 38 says, “For any adequate description of gravity, look to a metric theory.” 


(2) completeness 


(3) agreement with past 
experiment 


Why attention focuses on 
metric theories of gravity 


*For a slightly narrower definition of metric theories, see Thorne and Will (1971). 
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How metric theories differ 


Weak-field, slow-motion 
expansion of a metric theory 


Exception: Cartan’s (1922b, 1923) theory [“general relativity plus torsion”; see 
Trautman (1972)] is nonmetric, but agrees with experiment and is experimentally 
indistinguishable from general relativity with the technology of the 1970 s. 

The second line of argument pointing to metric theories begins with the issue of 
completeness (preceding section). To be complete, a theory must incorporate a 
self-consistent version of all the nongravitational laws of physics. No one has found 
a way to incorporate the rest of physics with ease except to introduce a metric, and 
then invoke the principle of equivalence. Other approaches lead to dismaying 
complexity, and usually to failure of the theory on one of the three counts of 
self-consistency, completeness, and agreement with past experiment. All the theories 
known to be viable in 1973 are metric , except Cartan’s. [See Ni (1972b); Will (1972).] 

In only one significant way do metric theories of gravity differ from each other: 
their laws for the generation of the metric. In general relativity theory, the metric 
is generated directly by the stress-energy of matter and of nongravitational fields. 
In Dicke-Brans-Jordan theory (Box 39.1, p. 1070), matter and nongravitational fields 
generate a scalar field <J>; then <J> acts together with the matter and other fields to 
generate the metric. Expressed in the language of §38.7, </> is a “new long-range 
field” that couples indirectly to matter. As another example, a theory devised by 
Ni (1970, 1972) (Box 39.1) possesses a flat-space metric n and a universal time 
coordinate t (“prior geometry”; see §17.6); n acts together with matter and nongrav¬ 
itational fields to generate a scalar field <J>; and then /7, t , and <J> combine to create 
the physical metric g that enters into the equivalence principle. 

All three of the above theories—Einstein, Dicke-Brans-Jordan, Ni—were viable 
in the summer of 1971, when this section was written. But in autumn 1971 Ni’s 
theory, and many other theories that had been regarded as viable, were proved by 
Nordtvedt and Will (1972) to disagree with experiment. This is an example of the 
rapidity of current progress in experimental tests of gravitation theory! 

Henceforth, in this chapter and the next, attention will be confined to metric 
theories of gravity and their comparison with experiment. 


§39.3. POST-NEWTONIAN LIMIT AND PPN FORMALISM 


The solar system, where experiments to distinguish between metric theories are 
performed, has weak gravity, 

\0\ = (Newtonian potential] < 10~ 6 ; | (39.1a) 

moreover, the matter that generates solar-system gravity moves slowly 

v 2 — (velocity relative to solar-system center of mass) 2 < 10“ 7 (39.1b) 

and has small stress and internal energies 

\T jk \/p 0 = (stress divided by baryon “mass” density) < 10" 6 , 


„ v , /internal energy density per\ . 1A - 

17 = (p- p 0 )/p 0 - ( unit baryon “ mass ” density) ~ 


(39.1c) 
(39. Id) 
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[Here the baryon “mass” density p a , despite its name, and despite the fact it is 
sometimes even more misleadingly called “density of rest mass-energy,” is actually 
a measure of the number density of baryons n, and nothing more. It is defined as 
the product of n with some standard figure for the mass per baryon, p 0 , in some 
well-defined standard state; thus, 

Po = «Mol (39. le) 


Consequently, the analysis of solar-system experiments using any metric theory of 
gravity can be simplified, without significant loss of accuracy, by a simultaneous 
expansion in the small parameters \4>\, v 2 , \T jk \/p 0 , and 77. Such a “weak-field, 
slow-motion expansion” gives: (1) flat, empty spacetime in “zero order”; (2) the 
Newtonian treatment of the solar system in “first order”; and (3) post-Newtonian 
corrections to the Newtonian treatment in “second order”. 

The formalism of Newtonian theory plus post-Newtonian corrections is called the 
“post-Newtonian approximation.” Each metric theory has its own post-Newtonian 
approximation. Despite the great differences between metric theories themselves, 
their post-Newtonian approximations are very similar. They are so similar, in fact, 
that one can construct a single post-Newtonian theory of gravity, devoid of any 
reference to indirectly coupling fields (<f> in Dicke-Brans-Jordan; //, t, and <f> in Ni; 
see Box 39.1), that contains the post-Newtonian approximation of every conceivable 
metric theory as a special case. This all-inclusive post-Newtonian theory is called 
the “Parametrized Post-Newtonian (PPN) Formalism.” It contains a set of parameters 
(called “PPN parameters”) that can be specified arbitrarily. One set of values for 
these parameters makes the PPN formalism identical to the post-Newtonian limi t 
of general relativity; another set of values makes it the post-Newtonian limit of 
Dicke-Brans-Jordan theory, etc. 

Subsequent sections of this chapter present a version of the PPN formalism devised 
by Clifford M. Will and Kenneth Nordtvedt, Jr. (1972). [See also Will (1972).] This 
version, containing ten PPN parameters, encompasses as special cases nearly every 
metric theory of gravity known to the authors. The few exceptions [Whitehead (1922) 
and theories reviewed by Will (1973)] all disagree with experiment. One can include 
them in the PPN formalism by adding additional terms and parameters. 

The ten parameters are described heuristically in Box 39.2, for the convenience 
of readers who would skip the full details of the formalism (§§39.4-39.12). 

How accurate is the PPN formalism? Or, stated more precisely, how accurately 
does the posi-Newtonian approximation agree with the metric theory from which 
it comes? In the solar system, where \4>\, v 2 , \T jk \/p 0 , and Id are all < 10 -6 , the 
post-Newtonian approximation makes fractional errors of < 10~ 6 in quantities of 
post-Newtonian order, and fractional errors of < 10“ 12 in quantities of Newtonian 
order. For example, it misrepresents the deflection of light by < 10~ 6 x (post-New¬ 
tonian deflection) ~ 10~ 6 seconds of arc. And it ignores relativistic deformations of 
the Earth’s orbit of magnitude <10“ 12 x (one astronomical unit) ~ 10 centimeters. 
Clearly, there is no need in the 1970’s to use higher-order corrections to the post- 
Newtonian approximation; and hence no need to construct a “parametrized post- 
post-Newtonian framework.” However, in the words of Shapiro (1971b): “If one 
projects from the achievements in the last decade, it is not unreasonable to predict 


Post-Newtonian 

approximation 


PPN formalism 


Accuracy of PPN formalism 
in solar system 


{continued on page 1072) 
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39. OTHER THEORIES OF GRAVITY AND POST-NEWTONIAN APPROXIMATION 


Box 39.1 THE THEORIES OF DICKE-BRANS-JORDAN AND OF Nl 


A. Dicke-Brans-Jordan 


References: Brans and Dicke (1961); Jordan (1959). [Notes: This is the special 
case Tj = — 1 of Jordan’s theory. An alternative mathematical representation 
of the theory is given by Dicke (1962).] 

Fields associated with gravity: 

<f>, a long-range scalar field; 

g, the metric of spacetime (from which are constructed the covariant derivative 
y and the curvature tensors, in the usual manner). 

Equations by which these fields are determined: 

The trace of the stress-energy tensor generates <j> via the curved-spacetime wave 


equation 


□<f> = 4>> a ;a 


877 

3 + 2to 


i ; 


where to is the dimensionless “Dicke coupling constant ” 

The stress-energy tensor and <j> together generate the metric (i.e., the spacetime 
curvature) via the field equations 

G a0 =~T afl + - &*□*), 

where G afi is the Einstein tensor. 

Variational principle for these equations: 

sf [<f>R - «(<#>,„<#>’“/<#>) + 16tt L](-g) 1/2 d 4 x = 0, 

where R is the scalar curvature and L is the matter Lagrangian. 

Equivalence principle is satisfied: 

The special-relativistic laws of physics are valid, without change, in the local 
Lorentz frames of the metric g. 

Consequence: the scalar field does not exert any direct influence on matter; 
its only role is that of participant in the field equations that determine the 
geometry of spacetime. It is an “indirectly coupling field” in the sense of §38.7. 
This theory is self-consistent, complete, and for to > 5 in “reasonable” accord (two 
standard deviations or better) with all pre-1973 experiments. 


B. Ni 

References: Ni (1970, 1972) 

Fields associated with gravity: 

/j, a flat “background metric” (“prior geometry” in sense of §17.6). There exist, 
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by assumption, coordinate systems (“background Lorentz frames”) in which 
everywhere at once Voo = - l, Voj = 0, and r, jk = 8 jk . 
t, a scalar field called the “universal time coordinate” (“prior geometry” in sense 
of §17.6), which is so “tuned” to the background metric that 

fia/3 71*^ = U 

where denotes covariant derivative with respect to rj. 

This means there exists a background Lorentz frame (the “rest frame of the 
- universe”) in which x° = t. 

<t>, a scalar field called the “scalar gravitational field”. 

g, the metric of spacetime (from which are constructed the covariant derivative 
V and the curvature tensors, in the usual manner). 

Equations by which these fields are determined: 

The stress-energy of spacetime generates the scalar gravitational field <#> via the 
wave equation 


□</> = <#>’%= -2 vT* 3g a/3 /0<|> 

= 4irT a * [v ap e- 2 * + (e 2 * + e~^)t a tj. 

<f>, n, and t together determine the metric of spacetime through the algebraic 
relation 

9 = e-^n + (e~ 2 ^ - e 2 *) dt < 8 ) dt. 

Note: In the “rest frame of the universe” that is presupposed in this theory, 
this metric reduces to 

* 2 = g a/ 3 dx a dxfi = —e 2 ’ t> dt 2 + e- 2 *{dx 2 + dy 2 + dz 2 ). 

Variational principle for the field equation for <f>: 

3 J (— 2 <#>’“<#> „ + I 67 rL)(—g) 1/2 d 4 x = 0 , 

where L is the matter Lagrangian. 

Equivalence principle is satisfied: 

The special-relativistic laws of physics are valid, without change, in the local 
Lorentz frames of the spacetime metric g. 

Consequence: <f>, n, and t do not exert any direct influence on matter; they are 
“indirectly coupling fields” in the sense of §38.7. 

This theory is self-consistent and complete. If the solar system were at rest in the 
rest frame of the universe”, the theory would agree with all experiments to 
date—except, possibly, the expansion of the universe. But the motion of the 
solar system through the universe leads to serious disagreement with experiment 
(Will and Nordtvedt 1972; §40.8). 
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Box 39.2 HEURISTIC DESCRIPTION OF THE TEN PPN PARAMETERS j 


Para¬ 

meter 

What it measures, relative to general relativity* 

Value in 
General 
Relativity 

Value in 
Dicke-Brans- 
Jordan Theory b 

Value in 
Ni's 
Theory b 

y 

How much space curvature (g ; - fc ) ls produced by unit rest mass? 

1 

1 -f to 

2 + u 

1 

p 

How much nonlinearity is there in the superposition law for 
gravity (£,„,)? 

1 

1 

1 

Pi 

How much gravity (go,,) is produced bv unit kinetic enerav 
(|p„u 2 )? 

1 

3 + 2 to 

4 -f 2to 

1 

P2 

How much gravity (g TO ) is produced by unit gravitational potential 
energy (p 0 t/)? 

1 

1 + 2a: 

4 -F 2o) 

1 

Pi 

How much gravity (g 00 ) is produced by unit internal energy ( p 0 /7)? 

1 

1 

1 

P< 

How much gravity (g^) is produced by unit pressure (/?)? 

1 

1 -F to 

2 + cj 

1 


How much more gravity (g ()0 ) is produced by radial kinetic energy 
[^p 0 (i*-r) 2 ]— i.e., kinetic energy of motion toward obser¬ 
ver—than by transverse kinetic energy? 

0 

0 

0 

V 

How much more gravity (g 00 ) is produced by radial stress 
than by transverse stress? 

0 

0 

0 

A 

How much dragging of inertial frames ( g oj ) is produced by unit 
momentum (p 0 u)? 

1 

10 + 7a: 

14 -F 7a: 

7 

4 

How much easier is it for momentum (p 0 u) to drag inertial frames 
radially (toward the observer) than in a transverse direction? 

1 

1 

1 


a These heuristic descriptions are based on equations (39.23), 

b For expositions of these theories see Box 39.1. For derivation of their PPN values and of PPN values for other theories, see Ni (1972). 


J 


that in the 1980’s techniques will be available to detect second-order effects of general 
relativity. At that point the ratio of theoretical to experimental relativists may take 
a sharp turn downwards.” 

Actually, there are a few exceptions to the claim that the post-Newtonian approxi¬ 
mation suffices for the 1970’s. These exceptions occur where the external universe 
impinges on and influences the solar system. For example, gravitational waves 
propagating into the solar system from distant sources (Chapters 35-37) are ignored 
by every post-Newtonian approximation and by the PPN framework. They must 
be treated using a full metric theory or a weak-field, “fast-motion” approximation 
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to such a theory. Similarly, time-dependence of the “gravitational constant” (§40.8), 
induced in some theories by expansion of the universe, is beyond the scope of the 
PPN formalism, as is the expansion itself. 

The PPN formalism is used not only in interpreting experimental tests of gravita¬ 
tion theories, but also as a powerful tool in theoretical astrophysics. By specializing 
all the PPN parameters to unity, except f = rj = 0, one obtains the post-Newtonian 
approximation to Einstein’s theory of gravity. This post-Newtonian approximation 
can then be used (and has been used extensively) to calculate general relativistic 
corrections to such phenomena as the structure and stability of stars.* 


Historical and Notational Notes 

The earliest parametrizations of the post-Newtonian approximation were performed, 
and used in interpreting solar system experiments, by Eddington (1922), Robertson 
(1962), and Schiff (1962, 1967). However, they dealt solely with the vacuum gravita¬ 
tional field outside an isolated, spherical body (the sun). Nordtvedt (1968b, 1969) 
devised the first full PPN formalism, capable of treating all aspects of the solar 
— system; he treated the sun, planets, and moon as made from “gases” of point-particles 
(atoms) that interact gravitationally and electromagnetically. Will (1971c) later used 
techniques devised by Chandrasekhar (1965a) to modify Nordtvedt’s formalism, so 
that it employs a stressed, continuous-matter description of celestial bodies. The 
version of the formalism presented here. devised __bv Will and Nordtvedt (1972), 
'geireranzefall previous versions to acquire “post-Galilean invariance” [see Chandra¬ 
sekhar and Contopolous (1967)]. The most detailed and up-to-date review article 
on the PPN formalism is Will (1972). 

In the literature of post-Newtonian physics and the PPN formalism, the Newtonian 
potential is described traditionally not by but by 

_ ^Z i '=- + J £ w=lf‘ < 39 - 2 > 

To avoid confusion, this chapter and the next will use £/, although the rest of the 
book uses 0. 

Turn now to a detailed, Track-2 exposition of the PPN formalism. 


§39.4. PPN COORDINATE -SYSTEM 


The PPN formalism covers the solar system (or whatever system is being analyzed) 
with coordinates (r, x } ) = (r, x*) that are as nearly globally Lorentz as possible: 

Sap ~ ^ap "b ^ap’ \^ap\ £ ^©/^O ^ 6 - (39.3) 


Applications of PPN 
formalism to astrophysics 


History and notation of PPN 
formalism 



EXPOSITION OF PPN 
FORMALISM: 

Coordinate system 


* See, e.g.. a long series of papers by Chandrasekhar and his associates in the Astrophysical Journal, 
beginning with Chandrasekhar (1965a.b,c). 
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- ^ 

The rest of this chapter is 
Track 2. No earlier Track-2 
material is needed as 
preparation for it, but the 
following will be helpful: 

(1) Chapter 7 (incompatibility 
of gravity and special 
relativity) 

(2) §17.6 (no prior geometry); 

(3) §§36.9-36.11 (generation 
of gravitational waves); and 

(4) Chapter 38 (tests of 
foundations). 

This chapter is not needed as 
preparation for any later 
chapter, but it will be helpful 
in Chapter 40 (solar-system 
tests) 

_ J 


(In this sense the PPN formalism is like linearized theory; see Chapter 18.) The 
velocity of the coordinate system (i.e., 4-velocity of its spatial origin) is so chosen 
that the solar system is approximately at rest in these coordinates. (Whether the 
center of mass of the solar system is precisely at rest, or is moving with some low 
velocity v < (M 0 /R 0 ) 1/2 ~~ 10“ 3 — 300 km/sec, is a matter for the user of the 
formalism to decide. For more on the options, see §§39.9 and 39.12.) 

The PPN coordinates provide one with a natural “3 -f 1” split of spacetime into 
space plus time. That split is conveniently treated using the notation of three-dimen¬ 
sional, flat-space vector analysis—even though spacetime and the three-dimensional 
hypersurfaces x° = constant are both curved. The resultant three-dimensional for¬ 
malism will look more like Newtonian theory than like general relativity—as, indeed, 
one wishes it to; after all, one’s goal is to study small relativistic corrections to 
Newtonian theory! 


§39.5. DESCRIPTION OF THE MATTER 
IN THE SOLAR SYSTEM 


Description of matter Relative to the PPN coordinates, the matter of the solar system (idealized as a 

stressed medium) has a coordinate-velocity field 

v j = dxj/dx 0 . (39.4) 

Choose an event i?, and in its neighborhood transform to an orthonormal frame 
that moves with the matter there. Orient the spatial axes ej of this comoving frame 
so that they coincide as accurately as possible with the PPN coordinate axes. (This 
requirement will be made more precise in §39.10.) In the orthonormal comoving 
frame, define the following quantities, which describe the state of the matter; 


(density of total mass-energy) = p; 

(baryon “mass” density) = p 0 

( number density \ /standard rest mass per baryon, p 0 , 
of baryons, n / X \for matter in some standard state 

(specific internal energy density) = II = (p — p 0 )/p 0 ; 

(components of stress tensor) = = e* • 7- ey, 

(pressure) =p=~ (t- + j + t-) 

== (average of stress over all directions). 


(39.5 a) 

(39.5 b) 

(39.5c) 
(39.5 d) 

(39.5 e) 


Anisotropies (i.e., shears) in the stress are important only in planets such as the 
Earth; and even there they are dominated by the isotropic pressure: 


t V) =p8 ij +px [corrections < 1]. 


(39.6) 
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For many purposes, especially inside the sun, one can ignore the anisotropies, thereby 
approximating the solar-system matter as a perfect fluid.* 

The isotropic part of the radiation field gives a significant contribution to the 
pressure, p 9 and the density of internal energy, p 0 77, inside the sun. However, the 
anisotropic radiation flux is ignored in the stress-energy tensor. This approximation 
is allowable because in the sun the outward energy flux carried by radiation is less 
than 10~ 15 of the internal energy density p 0 77; in planets it is even less. 


§39.6. NATURE OF THE POST-NEWTONIAN EXPANSION 

For any gravitationally bound configuration such as the solar system, the Newtonian 
approximation imposes limits on the sizes of various dimensionless physical quanti¬ 
ties (sfee exercise 39.1): 

€ 2 = maximum value of Newtonian potential U (39.7) 

^ Xwalues-anywhere of U, v 2 9 p/p Q9 \t Vj \/p 0 , 77. 

(The Newtonian potential at the center of the sun is e 2 ~~ 10" 5 . The values of p/p 0 , 
hj! Po) an d IJ there are also "^lO* 5 , and they are much smaller elsewhere. The orbital 
velocities of the planets are all less than 100 km/sec = 3 X 10~ 4 , so v 2 < 10" 7 .) 
Moreover, changes with time of all quantities at fixed x j are due primarily to the 
motion of the matter. As a result, time derivatives are small by 0(c) compared to 
space derivatives, 


dA/dt 

dA/dXj 


~~ \v-\ < c for any quantity A , 


(39.8) 


although not in the radiation zone, where outgoing gravitational waves flow (dis¬ 
tance > one light year from Sun). Consequently, the radiation zone must be excluded 
from the analysis when one makes a post-Newtonian expansion. To treat it requires 
different techniques, e.g., those of Chapter 36. 

Conditions 39.7 and 39.8 suggest that one expand the metric coefficients in powers 
of the small parameter e, treating {/, v 2 , p/p 0 , f i; /p 0 , and 77 as though they were 
all of 0(e 2 ) (often they are smaller!), and treating time derivatives as 0(e) smaller 
than space derivatives. 

In this “post-Newtonian” expansion, terms odd in e (i.e., terms such as 


r p 0 (x'> t)Vj(x\ 0 

J \x - x'\ 


d 3 x' 


M 

~R l 


(39.9) 


whose total number of i/s and (0/9r)’s is odd) change sign under time reversal, 


*In the solar system, post-Newtonian corrections due to anisotropic stresses are so much smaller than 
other post-Newtonian corrections that there is no hope of measuring them in the 1970’s. For this reason, 
elsewhere in the literature (but not in this book) the PPN formalism treats all stresses at the post-New¬ 
tonian level as isotropic pressures, thereby setting to zero the PPN parameter tj of §§39.8-39.11. 


Relative magnitudes of 
expansion parameters 


Rules of the expansion 
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Expanded form of metric 


EXERCISES 


whereas terms even in c do not. Time reversal (x° = —x°) also changes the sign 
of g 0j (goj = -g 0 ,X but * eaves goo and gjk unchanged. Therefore, g 0j must contain 
only terms odd in e; whereas g 00 and g jk must contain only even terms . (Actually, 
this ceases to be the case when radiation damping enters the picture. In the real 
world-one always insists on outgoing-wave boundary conditions. But time reversal 
converts outgoing waves to ingoing waves; so an extra sign change is required to 
convert back to out. Therefore, radiation damping terms in the near-zone metric 
are even in e for g 0j , but odd for g 00 and g jk . However, radiation damping does 
not come into play until order c 5 beyond Newtonian theory—see Chapter 36—so 
it will be ignored here.) 

The form of the expansion is already known through Newtonian order (see §17.4, 
with <P replaced by — U)\ Newtonian gravity is only obtained when one demands 
that 

goo = “ 1 + 2U + [terms < c 4 ], 

g 0 j = [terms < c 3 ], (3 9.10) 

gij = + [terms < c 2 ]. 

The stated limits on the higher-order corrections are dictated by demanding that 
the space components of the geodesic equation agree with the Newtonian equation 
of motion: 


d 2 Xj _ d 2 Xj dx a dx& _ dx a dx^ 

a0 dr dr ~ dt dt 

= —r’oo ~ 0k v k ~ r*kl v k v l 

= Uj + terms of order {eg 0k /, ^gkij)- 


(39.11) 


One would get the wrong Newtonian limit if g ok were 0(c) or greater, and if g kl — S kl 
were 0(1) or greater. 

The above pattern continues on to all orders in the expansion. Thus in the geodesic 
equation, and also in the law of local conservation of energy-momentum T a & = 0, 
g 00 always goes hand-in-hand with ig 0k and c 2 g jk (see exercise 39.2). Therefore, the 
post-Newtonian expansion has the form summarized in Box 39.3. 


3 

Exercise 39.1. ORDERS OF MAGNITUDE IN GRAVITATIONALLY 
BOUND SYSTEMS 

Use Newtonian theory to derive conditions (39.7) for any gravitationally bound system. [Hint: 
Such concepts as orbital velocities, the speeds of sound and shear waves, the virial theorem, 
and hydrostatic equilibrium are relevant.] 

Exercise 39.2. PATTERN OF TERMS IN POST-NEWTONIAN EXPANSION 

Verify the statements in the paragraph following equation (39.11). In particular, suppose 
that one wishes to evaluate the coordinate acceleration, d 2 x j /dt 2 , to accuracy for some 
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integer N. Show that this undertaking requires a knowledge of to accuracy c 2 * 4-2 , of 
gok t0 f 2N+1 y an d °f gjk t0 t2N ' Also suppose that one knows T 00 to accuracy p^ 2 *, T 0i to 
Po c2N+1 y an d T jk to p 0 ^ 2N+2 [see, e.g., equations (39.13) for N = 0 and (39.42) for N = 2]. 
Show that to calculate T 0a . a with accuracy t 2N+1 p oj and T ia . a with accuracy c 2N+2 p of , 
one must know g 00 to c 2JV + 2 , g Qk to c 2J ' r+1 , and g jk to c 2 ^. This dictates the pattern of Box 
39.3. 


§39.7. NEWTONIAN APPROXIMATION 

At Newtonian order the metric has the form (39.10); and the 4-velocity and stress- Newtonian approximation 
energy tensor have components, relative to the PPN coordinate system, 

r M° = 4-1 + 0(e 2 ), = v j + 0(€ 3 ); 

T 00 = p 0 + 0( Po € 2 ), T°> = p oVj + 0(p 0 e 3 ), 

T’ k = ty k + p 0 v j v k + O(p 0 € 4 ) 

(see exercise 39.3). Two sets of equations govern the structure and evolution of the 
solar system. (1) The Einstein field equations. As was shown in §18.4, and also in 
§17.4, in the Newtonian limit Einstein’s equations reduce to Laplace’s equation 

Uji = —4 *p 0 , (39.14a) 


(39.12) 

(39.13) 


which has the “action-at-a-distance” solution 

U{x, t)=f ^ 2 - dH\ (39.14b) 


Box 39.3 POST-NEWTONIAN EXPANSION OF THE METRIC COEFFICIENTS 


Level of approximation 
{and papers expanding general 
relativity to this level) 


Order or value of terms 
9oo 9 0i 


[Chandrasekhar and Esposito (1970)] 


9 * 


flat, empty spacetime 

-1 

0 

#* 

Newtonian approximation 

2U 

0 

0 

post-Newtonian approximation 
[(Fock (1959); Chandrasekhar (1965a)] 

4- terms — c 4 

4- terms — c 3 

4- terms — e 2 

post-post-Newtonian approximation 
[Chandrasekhar and Nutku (1969)] 

4- terms — e 6 

4- terms — c 5 

4- terms — e 4 

radiation damping 

4- terms — c 7 

4- terms — c 6 

4- terms — c 5 
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EXERCISES 


(2) The law of local energy-momentum conservation, T a & ;/8 = 0. The time compo¬ 
nent of this law reduces to the conservation of rest mass 

dp 0 /dt + d(p 0 Vj)/dXj — 0 + fractional errors of 0(c 2 ); (39.15a) 

and the space components reduce to Newton’s second law of motion, “F = ma 

p 0 dv j /dt = p 0 (dU/dXj) — dty k /dx k 4- fractional errors of 0(c 2 ), (39.15b) 
d/dt = (time derivative following the matter) = d/dt + v k d/dx k (39.16) 

(see exercise 39.3). 

Equations (39.14)-(39.16), together with equations of state describing the planetar)' 
and solar matter, are the foundations for all Newtonian calculations of the structure 
and motion of the sun and planets. Notice that the internal energy density p 0 TI 
nowhere enters into these equations. It is of no importance to Newtonian hydrody¬ 
namics. It matters for the sun’s thermal-energy balance; but that is irrelevant here. 


Exercise 39.3. NEWTONIAN APPROXIMATION 

(a) Derive equations (39.13) for the components of the stress-energy tensor in the PPN 
coordinate frame. [Hint: In the rest frame of the matter (“comoving orthonormal frame”) 
Tfa = p = p 0 + 0(€ 2 ), T 6] = 0, Tft = see equations (39.5). Lorentz-transform these 
components by a pure boost with ordinary velocity — Uj to obtain 
(b) Show that, in the PPN coordinate frame, T 0a ;a = 0 reduces to equation (39.15a), and 
T ja . a = 0, when combined with (39.15a), reduces to equation (39.15b).] 

Exercise 39.4. A USEFUL FORMULA 

Derive from equations (39.15) the following useful formula, valid for any function f(x y t): 

^ f Po( x > Of(x, 0 d 3 x = J pjx, t) — d 3 x (39.17) 

4* fractional errors of 0(c 2 ). 

Here both integrals are extended over all of space; and df/dt is the derivative following 
the matter (39.16). 

Exercise 39.5. STRESS TENSOR FOR NEWTONIAN GRAVITATIONAL FIELD 

Define a “stress tensor for the Newtonian gravitational field U ” as follows: 

< 39 - 18 ) 

Show that the equations of motion for the matter (39.15b) can be rewritten in the forms 

(t,* + l ik) + fractional errors of 0(c 2 ), (39.19) 

(Po v >),t + ( f j* + ‘jk + Po v j v k),k = 0 + fractional errors of 0(c 2 ). (39.19') 
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Exercise 39.6. NEWTONIAN VIRIAL THEOREMS 

(a) From equation (39.19') show that 


d 2 I jk /dt 2 = 2 J (t jk + + p 0 VjV k )d 3 x + fractional errors of 0(e 2 ), (39.20a) 

where I jk is the second moment of the system’s mass distribution, 

hk = f Po x i x k d 3 x. 

This is called the “time-dependent tensor virial theorem” 

(b) From this infer that, if { >i ongtiine denotes an average over a long period of time, 
then 

, (J«* + '» + Po V j V k) ^longtime = 0 (f V" *“*)• (39.20b) 

This is called the “ tensor virial theorem 

(c) By contraction of indices and use of equations (39.18), (39.14a), and (39.5e), derive 
the (ordinary) virial theorems: 

- jd 2 I/dt 2 = J p 0 v 2 d 3 x — J'ypot/ d 3 x + 3 Jp d 3 x 4* fractional errors of 0(e 2 ), (39.21a) 

where 1 is the trace of the second moment of the mass distribution 

'=4 = f Pa' 2 d 3 x; 

and 

(fpo^x - jjpo^x + 3fpd°‘x ) loagtime =o(fpo< 4 <t 3 *). (39.21b) 

2 x / kinetic \ + /gravitational\ + ^ /pressure \ 

Venergy/ \energy / X \integral/ 


Exercise 39.7. PULSATION FREQUENCY FOR NEWTONIAN STAR 

Use the ordinary, time-dependent virial theorem (39.21a) to derive the following equation 
for the fundamental angular frequency of pulsation of a nonrotating, Newtonian star: 


, 2 _ 


= (3 A - 4) 


I star’s self-gravitational energy| 


(trace of second moment of star’s mass distribution) 


( pressure-weighted average\ _ fr t p d 3 x 
\of adiabatic index / “ d 3 x ' 


(39.22a) 

(39.22b) 


In the derivation assume that the pulsations are “homologous”—i.e., that a fluid element 
with equilibrium position x> (relative to center of mass x’ = 0) gets displaced to x> + &(x, t\ 
where 


= (small constant)x>e _iw *. 


Assume nothing else. Notes: (1) The result (39.22) was derived differently in Box 26.2 
and used in §24.4. (2) The assumption of homologous pulsation is fully justified if 
l-Ti — 4/3) = constant < 1; see Box 26.2. (3) The result (39.22) is readily generalized to slowly 
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rotating Newtonian stars; see, e.g., Chandrasekhar and Lebovitz (1968). It can also be 
generalized to nonrotating post-Newtonian stars using general relativity (Box 26.2), or using 
the PPN formalism for any metric theory [Ni (1973)]. And it can be generalized to slowly 
rotating, post-Newtonian stars [see, e.g., Chandrasekhar and Lebovitz (1968)]. 


§39.8. PPN METRIC COEFFICIENTS 


Post-Newtonian corrections to 
metric: 


(1) rules governing forms 


(2) construction of 
corrections 


The post-Newtonian corrections k af3 to the metric coefficients g a p are calculated, 
in any metric theory of gravity, by lengthy manipulations of the field equations. 
(See, e.g., exercise 39.14 near the end of this chapter for general relativity.) But 
without ever picking some one theory, and without ever writing down any set of 
field equations, one can infer the forms of the post-Newtonian corrections k a p. Their 
forms are fixed by the following constraints: (1) They must be of post-Newtonian 
order (k 00 — € 4 , k 0j — e 3 , k {j — e 2 ). (2) They must be dimensionless. (3) k 00 must 
be a scalar under rotations, k 0j must be components of a 3-vector, and k jk must 
be components of a 3-tensor. (4) The corrections must die out at least as fast as 
1 /r far from the solar system, so that the coordinates become globally Lorentz and 
spacetime becomes flat at r = oo. (5) For simplicity, one can assume that the metric 
components are generated only by p 0 , p 0 /7, t% 9 p , products of these with the velocity 
Vp and time-derivatives of such quantities;* but not by their spatial gradients. [This 
assumption of simplicity is satisfied by all metric theories examined up to 1973, 
except Whitehead (1922) and theories reviewed by Will (1973)—which disagree with 
experiment.] Note the further justification for this assumption in exercise 39.8. 

Begin with the corrections to the spatial components, k xi — e 2 . There are only two 
functionals of p 0 , /?, 77, fj*, v j9 that die out at least as fast as 1/r, are dimensionless, 
are 0(e 2 ), and are second-rank, symmetric 3-tensors; they are 


t ); U^ix, 0 = f 


Po(x', t)(Xj - x i ')(x 1 - x/) 
|x-x '| 3 


d 3 x'. (39.23a) 


Thus, k t j must be k {j = 2y U + ir Uy, for some constant “PPN parameters” y and 
r. By an infinitesimal coordinate transformation [* iNEW = * i0 LD + r s X/9*i> with 
X(x, 0 = —fp 0 (x’, /)|jc — jc'| d 3 x'] one can set r = 0, thereby obtaining 

gii = « i# + A : i} = S v ( 1 + 2y U) + 0(c 4 ). j(39.23b) 


•One allows for time derivatives because retarded integrals contain such terms when expanded to 
post-Newtonian order; thus, 

p f- Ijc — JC'I) _ rf pA',0 g P.( J V2 + ...W. 
j (or — jc'i J L ]JC - JC'I dt J 

However, it turns out that, with a suitable choice of coordinates (“gauge”)* a U time-odd retarded terms 
[e.g., f(dp 0 /dt) <Px] vanish, except at “the post 5/2 -Newtonian order” and at higher orders of approxima¬ 
tion; there they lead to radiation damping (see Box 39.3). For example, f(dp 0 /dt) cPx = {d/dt)jp 0 cPx 
vanishes by virtue of the conservation of baryon number. 
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Next consider k 0j ~ e 3 . Trial and error yield only two vector functionals that die 
out as 1/r or faster, are dimensionless, and are 0(i 3 ). They are 

J \x - x'\ 

W M ,,) = (3 9.23d) 

J j JC — x'\ 6 


Thus, k 0 j must be a linear combination of these, involving unknown constants (PPN 
parameters) A 1 and A 2 \ 

go, = k 0i = - \ A V, - | A 2 w t + 0(e 3 ). (39.23 e) 


Finally consider k 00 — c 4 . Trial and error yields a variety of terms, which can all 
be combined together with the Newtonian part of g 00 to give 

g 00 = -1 + 2U + k 00 = -1 + 2U - 2pU 2 + 4* - la - i|®, (39.23f) 

where 




* = /^v 2 + /8 2 C7 + 1 0 3 77 + | P iP /p 0 


#(*, 0 = / 


(39.23g)* 


(39.23h) 


[ fjfc(x\ 0 - 4 Vu(*'* ol - x/)(x fc - x fc 0 

L J -</**'. (39.23i) 


JC — X 


Also, /?, /? l5 /? 2 , /? 3 , /? 4 , f, 7] are unknown constants (PPN parameters). Elsewhere 
in the literature the term — tj<2) in g 00 is ignored (see footnote on p. 1075). 

Yet another term is possible: one could have pet 


g 00 = [value in equation (39.23f)] 

' P 0 ( x ', 0 Po(*"’ 0[(x - X') • (*' 


--IF 


*")] 


\X - X Lr' 


r"|3 


a 3 x' d 3 x", (39.24) 


where S is another PPN parameter. [It can be shown, using the Newtonian equations 
(39.14)-(39.16), that this expression dies out as 1/r far from the solar system.] If 

* WARNING: Throughout the literature the notation <P is used where we use ¥ for the functional 
(39.23g), and <f> is used for our We are forced to violate the standard notation to avoid confusion 
with the Newtonian potential <P = —U. However, we urge that nobody else violate the standard notation! 


1082 


39. OTHER THEORIES OF GRAVITY AND POST-NEWTONIAN APPROXIMATION 


Rigidity of coordinate system 


Summary of PPN metric and 
parameters 


EXERCISES 


such a 2 term had been included, then one could have removed it by making the 
infinitesimal coordinate transformation 


^ ^ - y 2f P -^ *?/ ^ fe- (3925) 


(see exercise 39.9). Thus, there is no necessity to include the 2 term. 

The absence of the 2 term from g 00 means that the time coordinate has been 
fixed rigidly up through post-Newtonian order: 

x° has uncertainties only of 0(R Q t 5 ) — 10~ 14 seconds. (39.26a) 
The space coordinates are also fixed rigidly through post-Newtonian order: 

x j has uncertainties only of 0(R Q c 4 ) — 0.1 cm, (39.26b) 

because any transformation of the form 

x* new = x ? old + position-dependent terms of 0(z 2 R e ) 

would destroy the form (39.23b) of the space part of the metric. 

In summary , for almost every metric theory of gravity yet invented, accurate 
through post-Newtonian order the metric coefficients have the form (39,23). One 
theory is distinguished from another by the values of its ten “post-Newtonian 
parameters” ft, /? 2 , /? 3 , /? 4 , y, f, tj, A 1 and A 2 . These are determined by comparing 
the field equations of the given theory with the form (39.23) of the post-Newtonian 
metric. The parameter values for general relativity and for several other theories 
are given in Box 39.2, along with a heuristic description of each parameter. 


Exercise 39.8. ABSENCE OF "METRIC-GENERATES-METRIC" TERMS IN 
POST-NEWTONIAN LIMIT 

In writing down the post-Newtonian metric corrections, one might be tempted to include 
terms that are generated by the Newtonian potential acting alone, without any direct aid 
from the matter. After all, general relativity and other metric theories are nonlinear; so the 
two-step process (matter) —U —(post-Newtonian metric corrections) seems quite 
natural. Show that such terms are not needed, because the equations (39.14)-(39.16) of the 
Newtonian approximation enable one to reexpress them in terms of direct integrals over 
the matter distribution. In particular, show that g 


i 


a 2 U(x', t)/dXf dt 

I* “ *1 


d 3 x’ — 2ir[V f (x, t) — 


(3927) 


where V, and are defined by equations (39.23c,d); also show that 


I 


[9 U(x’, Q/dxjJdUjx^ Q/ dxT\ iV 

I* “ *1 * 


-2w [U{x, ()J* + 4v f - ° (X '’ t)U( y C) d 3 x'. (39.28) 
J \x- x'\ 
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Note that the terms on the righthand sides of (39.27) and (39.28) are already included in 
the expressions (39.23e,f) for g oj and g 00 . 

Exercise 39.9. REMOVAL OF 2 TERM FROM g M 

Show that the coordinate transformation (39.25) removes the 2 term from the metric 
coefficient g 00 of equation (39.24), as claimed in the text. 

Exercise 39.10. VERIFICATION OF FORMS OF POST-NEWTONIAN 
CORRECTIONS 

Verify the claims in the text immediately preceding equations (39.23a,b,c,f). 


§39.9. VELOCITY OF PPIM COORDINATES RELATIVE TO 
c "UNIVERSAL REST FRAME" 


Thus far it has been assumed tacitly that the center of mass of the solar system 
is at rest in the PPN coordinate system. Is this really a permissible assumption? Put 
differently, can one always so adjust the PPN coordinate system that its origin moves 
with any desired velocity (e.g., that of the solar system); or is the PPN coordinate 
system rigidly and irrevocably attached to some “universal rest frame”? 

In general relativity, the geometry of curved spacetime picks out no preferred 
coordinate frames (except in cases with special symmetry). Therefore, one expects 
the velocity of the PPN coordinate frame to be freely specifiable. Put differently, 
one expects the entire PPN formalism, for general relativity, to be invariant under 
Lorentz transformations of the PPN coordinates [combined, perhaps, with “in¬ 
finitesimal coordinate transformations” to maintain the gauge conditions that the 
“2” and terms of (39.24) and (39.23a) be absent]. By contrast, in Ni’s theory 
of gravity (Box 39.1) the geometry of spacetime always picks out a preferred coordi¬ 
nate frame: the “rest frame of the universe.” One would not be surprised, in this 
case, to find the PPN coordinate frame rigidly attached to the universal rest frame. 

The above intuition is correct, according to calculations by Will (197Id) and by 
Will and Nordtvedt (1972). When dealing with general relativity and other theories 
with little or no “prior geometry,” one can freely specify the velocity of the PPN 
coordinate system (at some initial instant of time). But for theories like Ni’s, with 
a preferred “universal rest frame” (“preferred-frame theories”), only in the preferred 
frame can the post-Newtonian metric assume the form derived in the last section 
[equations (39.23)]. This restriction on the PPN metric does not mean that one is 
confined, in preferred-frame theories, to perform all calculations in the universal 
rest frame. Rather, it means that for such theories the PPN metric requires generali¬ 
zation to take account of coordinate-frame motion relative to the universal rest frame. 

The required generalization can be achieved by subjecting the PPN metric (39.23) 
to (1) a Lorentz boost from the preferred frame {xg LD } to a new PPN frame {x“ Ew ), 
which moves with velocity »’, plus (2) a change of gauge designed to keep the metric 
coefficients as simple as possible. The boost-plus-gauge-change is [Will and Nordt¬ 
vedt (1972)] 


Preferred-frame theories of 
gravity 


Generalization of PPN metric 
to moving frames 
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Final form of metric 


x old = %ew + y( x NEw * H ’) w + + y w ’ 2 j H ’^NEw (39.29a) 

+ 0(c 5 f NEW + c4 %ewX 

^OLD = ^NEW + y w2 + y M ' 4 j + (l + y W ’ 2 j X NEW * M * 

+ (y ^2 + f “ 0 W >T7 ^^ 6 ^new f5 %Ew)’ (39.29b) 

/ ^NEW 

^—[gauge change] 

X(*NEW> %Ew) = f Po(* NEW’ Jt NEw)i X NEW ” X NEwl ^^NEW* (39.29c) 


[Note: One insists, in the spirit of the post-Newtonian approximation, that the velocity 
w of the new PPN frame relative to the universal rest frame be no larger than the 
characteristic internal velocities of the system: 

M < €.] (39.30) 


This change of coordinates produces corresponding changes in the velocity of the 
matter 


__ f^OLP 
^OLD 


V NEW^ “ W * V NEW — 2 

+ M ’( 1 - y w v NEW ^ + 0(e 5 ). 


(39.31) 


A long but straightforward calculation (exercise 39.11) yields the following compo¬ 
nents for the metric in the new PPN coordinates. [Note: The subscripts NEW are 
here and hereafter dropped from the notation.] 


So; 


g jk = S jk (l + 2y U)+ 0(€ 4 ), 

= - y Aj^Vf - y A 2 W f + (ol 2 -ja^WjU - a 2 w k U kj + 0 (t 5 ), 


(39.32a) 

(39.32b) 


goo = -1 + 2U - 2/3U 2 + 4* 

+ (a 2 + a 3 - a 1 )w 2 U + (2a 3 - aJWjVj - a 2 WjW k U jk + 0(« 6 ). (39.32c) 

5 

Here a 1( a 2 , and a 3 are certain combinations of PPN parameters 

a k = 7 A k +A 2 -4y-4, (39.33a) 

a 2 = A 2 + S — 1, (39.33b) 

ot 3 — 4fi k — 2y — 2 l (39.33c) 


The “gravitational potentials” U, V p W p T, a, and 9 appearing here are to be 
calculated in the new, “moving” PPN coordinate system by the same prescriptions 



§39.9. VELOCITY OF PPN COORDINATE FRAME 


1085 


as one used in the universal rest frame. Thus, their functional forms are the same 
as previously, but their values at any given event are different (see exercise 39.11): 


J |r- x | 

Vfx, ,, 

J |x — ar'l 

W{x t) = f P °^'’ ~ f21 v( - x ’’ ~ */) d3x ' . 

’ ’ J |r — r'l 3 


*(X, t) = f 


\X — X 

pjy, wx, o 

\x - JC'I 


d 3 x\ 


+ = Pi* 2 + P 2 U + ±f3 3 n + jP 4 p/ Po ; 


a{x, t) = f 


p 0 ( x '> OK* - *') • K*', OP 


\X — X 


'13 


d 3 x'; 


®(x, t) — J 


[*&(*'. 0 0 ( x i - */)(** - V) 


(39.34a) 

(39.34b) 

(39.34c) 


(39.34d) 

(39.34e) 

d 3 x\ (39.34f) 


\x — x'| 3 

f 

The quantity U jk is the gravitational potential defined in equation (39.23a); 


U (x A - f P ° (X '’ t)(X i ~ X P (X * Z <1 d 3y' 

M ’ ’ ~ J Ijc - jt'I 3 ' 


(39.34g) 


Notice that the velocity w of the PPN coordinate system relative to the universal 
rest frame appears explicitly in the PPN metric only if one or more of the coefficients 
a v a 2 , a 3 , is nonzero. Thus, theories with cq = a 2 = a 3 = 0 (e.g., general relativity) 
possess no preferred universal rest frame in the post-Newtonian limit; all their PPN 
frames are “created equal.” By contrast, theories with at least one of a v a 2 , a 3 , 
nonzero (e.g., Ni’s theory) do possess a preferred frame. 

The generalized form (39.32) of the PPN metric, by virtue of the process used 
to construct it, is invariant under a Lorentz boost plus a gauge adjustment [“Post- 
Galilean transformation”; see Chandrasekhar and Contopolous (1967)]: 


X OLD “ %EW + 2 ( X NEW ' P)P + + y/^ 2 j^ ? NEW 

+ 5 ^new + €4x new)’ (39.35) 

? OLD = + y + 1 " ^ 4 ) ?NEW + ( 1 + ^ 2 ) Xnew ’ ^ 

(y 4 + ^1 ~ ’ ^NEwX + 6 ^NEW + e °*NEw)* 


Post-Galilean invariance 


Of course, it is also invariant under spatial rotations. 
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EXERCISE 


Exercise 39.11. TRANSFORMATION TO MOVING FRAME 

Show that the change of coordinates (39.29) changes the PPN metric coefficients from the 
form (39.23) to the form (39.32). [Hints: (1) Keep firmly in mind the fact that the potentials 
U, Vp Wp d , and are not scalar fields. Each coordinate system possesses its own potentials. 
For example, by using equations (39.29) in the integral for U QLD , one finds 

^old( x old’ ^old) — §ff ° LD ’ v °/ LD ^i ^ 3 *old (39.36) 

i^old ” ^oldI 

= [^NEW — W j( ^'NEW ™ ^jNEw) + 2* H j + 0(0* 

x new/new 

(2) The law of baryon conservation (39.44) may be useful.] 


§39.10. PPN STRESS-ENERGY TENSOR 


The motion of the solar system is governed by the equations T a P.p = 0. Before 
studying them, one must calculate the post-Newtonian corrections to the stress-energy 
tensor in the PPN coordinate frame. This requires a transformation from the comov¬ 
ing, orthonormal frame o/ a , where 

r 66 = Po (l + 77), T 6 ’ = 0, 7* = t ]k , (39.37) 

to the coordinate frame. One can effect this transformation in two stages: stage 2 
is a transformation 


«/° = [l - U+ 0(e 4 )] dt (39.38a) 

+ [t AlV ’ + 7 AzW > + (j“* “ “ 2 ) w > u + u v + °( £5 )] dx ’> 

ufi = [(1 + yU) S jk + 0(£ 4 )] dx k + 0(c 5 ) dt, (39.38b) 


between the coordinate frame and an orthonormal frame attached to it; stage 1 is 
a pure Lorentz transformation (boost) between the two orthonormal frames cu“ and 
cu a . The 4-velocity of this boost is minus the 4-velocity of the matter, which has 
components j 

u j — i>jU°, u° = 1 + y v 2 + U + 0(0 in coord, frame; (39.39) 


J = vju*, = 1 + ± v 2 + 0(0, 

vj= ^[1 + (1 + y)U] 


in cu a frame. (39.40) 


Combining the boost, which has ordinary velocity = — vj, with the transformation 
(39.38), and then inverting, one obtains the result (exercise 39.12) 
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dx a _ ( w ^ = orthonormal comoving basis, 

l dx“ — PPN coordinate basis; 

= 1 +^v 2 + U+ 0(t% 

A °i = + y v 2 + (2 + Y )t/j - y -j^Wf 

+ («2 - Y«i) w,U - <x 2 w k u kj + 0(e 5 ), (39.41) 

a ’6 = o # [l + y v 2 + t/J + 0(e 5 ), 

A \ = (1 - yU) S jk + 1 Vj v k + 0 (£ 4 ). 

This transformation, when applied to the stress-energy tensor (39.37) yields, in the 
PPN coordinate frame, 


r 00 = Po ( i + n + v 2 + 2U) + o ( Po € 4 ), 

(39.42a) 

TOi = P.0 +n+v* + 2 U)Vj + t ]rn v m + O(p 0 e 5 ), 

(39.42b) 

T* = t- jh ( 1 - 2 yU) + p 0 ( 1 + n + v* + 2 U)v,v k 


+ y ( v i‘km v m + V k tfaVj + 0( P Q C 6 ). 

(39.42c) 


Exercise 39.12. THE TRANSFORMATION BETWEEN COMOVING FRAME 
AND PPN FRAME 

Carry out the details of the derivation of the transformation matrix (39.41); and in the process 
calculate the correction of 0(c 4 ) to A 0 g. 


§39.11. PPN EQUATIONS OF MOTION 

The post-Newtonian corrections to the Newtonian equations of motion (39.15) and 
(39.16) are derived from the law of conservation of baryon number ( p 0 u a ). a — 0, 
and from the law of conservation of local energy-momentum, T af3 . p = 6. The 
simplest of the equations of motion is the conservation of baryon number. Its exact 
expression is (p 0 u a ). a — (1/ V—g)(V— gP 0 ua ), a = 0- Define a new quantity 

P* = P„( 1 +yi> 2 + 3yt/) 

= + 0(P„* 4 ) 


Transformation from rest 
frame of matter to PPN 
coordinate frame 


Stress-energy tensor in 
coordinate frame 


EXERCISE 


(39.43) 
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Law of baryon conservation 


Law of energy conservation 


Post-Newtonian Euler 
equation 


[see (39.39) for u° , and (39.32) for the metric]. Then rest-mass conservation takes 
on the same form as at the Newtonian order (39.15a), except now it is more accurate: 

p* t + j = 0 + errors of 0(p o / 5 ). (39.44) 

The next simplest equation of motion is T 0a . a = 0. Straightforward evaluation, 
using the metric of equations (39.32) and the stress-energy tensor of equations (39.42), 
yields 

[p 0 (l + n + v 2 + 2 £/)],< + [p 0 (l +n+v 2 + 2U)v j + tfiUjj 

+ (3y - 2 )p 0 U t + (3y - 3 )p 0 v k U k = 0(p 0 / 5 ). (39.45) 

By subtracting equation (39.44) from this, and using the Newtonian equations of 
motion (39.15) and (39.16) to simplify several terms where the Newtonian approxi¬ 
mation is adequate, one obtains 


p 0 dll/dt + tftVj'k = 0 + errors of O(p oj e 5 ). (39.46) 

Notice that this is nothing but the first law of thermodynamics (local energy conser¬ 
vation) with energy flow through the matter being neglected. (Neglecting energy 
flow was justified in §39.5.) This first law of thermodynamics is actually a post-New¬ 
tonian equation in the context of hydrodynamics, rather than a Newtonian equation, 
because 77 does not affect the hydrodynamic motion at Newtonian order (see §39.7). 

The last of the equations of motion, T’ a . a = 0, reduces to the post-Newtonian 
Euler equation 


- p*t/,. + [t Tk { 1 + 3y U)\ k - t VKk (| v* + 77) - MMJ. 

+ P'j;[( 2 y + 2 Wv, - |(7 A x + d 2 )fj - Uw ,] - v jP *U t + v k t kh 

+ \ A 2 p*(Vj - W f ) t + yp*[(7 A x + A 2 )v k + («j - 2a 3 )w k ]V k j 

2 ' 2 (39.47) 

- p*[2^ - Isa - ~ j« 2 w iWk U ik + a 9 w t (V t - W t )] 

- p*t/j[yv 2 — v + y(a 2 + <*3 — «iV 2 — (2/3 — 2)U + 3yp/p*j 

+ ~ ( }m V m,kVk) + jlv m ( l miV k ),k ~ vfiuvj.tl = °- 


Partial derivatives are denoted by commas; djdt is the time-derivative following the 
matter [equation (39.16)]. 

Equations (39.44), (39.46), and (39.47) are a complete set of equations of motion 
at the post-Newtonian order. 
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Exercise 39.13. EQUATIONS OF MOTION 

Carry out the details of the derivation of the equations of motion (39.44), (39.46), and (39.47). 
As part of the derivation, calculate the following values of the Christoffel symbols in the 
PPN coordinate frame: 


II 

-U t + . 

r° 0 j =-u r 

h 0(Ujt 2 l 


r % = 

yu, t s ik -t 


(j a l - “ 2 ) 

w (i u ,k) 


+ a 2 w i u m,k ) + 0(Uji 3 ). 



r ’oo = 

-^ + [ 

0S + y)U 2 - 2* + jitr + ylj# + j 

(«1 - «2 - <X 3 ) w2U 

£ 

+ y( a i 

- 2 “ 3 )' v i V i + J a 2 W i W k U ik 


-Iaw (3948) 

t 2 S’* 


+ ( a 2 — 

J a i) w i u ,t - a 2 w i u n,t + 0(Ujt 4 ), 


II 

AJ 

O 

yu t s ik - 

~ (2 Al + 2 ^ 2 ) Vll ’ kl 2 

a i w u u ,k] + 

0(Us 3 ), 

r \t = 

-tWjS, 

m - 2 U ik S t)j ) + O(Uf). 




Here square brackets on tensor indices denote antisymmetrization, and round brackets denote 
symmetrization. As part of the derivation, it may be useful to prove and use the relations 


x(t,x) = -J p 0 (t,x')\x - x'\ d 3 x', 

(39.49a) 

X,jfc = —8j k U -h U jk , 

(39.49b) 

“ x,« = Vi - Wi + 0 (£ 5 ), 

(39.49c) 




IM- 


Here x is the function originally defined in equation (39.29c). 


(39.50) 


Exercise 39.14. POST-NEWTONIAN APPROXIMATION TO GENERAL RELATIVITY 

Perform a post-Newtonian expansion of Einstein’s field equations, thereby obtaining the 
values cited in Box 39.2 for the PPN parameters of general relativity. The calculations might 
best follow the approach of Chandrasekhar (1965a): Set g aj3 = rj aj3 + h aj3 , and assume 


*oo = 0(€*) + 0(t% h oj = 0(c 3 ), h ik = 0(c 2 ). (39.51) 


Choose the space and time coordinates so that the four “gauge conditions” 


J h .i= 0(< 4 /R 0 ) ' 

K Oh,k ~ ~2 ^kk.O = 0(t ''/ Rq) 


with h = h a pT) a P — 


— *00 + *ti 


(39.52) 


are satisfied. 

(a) Show that the spatial gauge conditions are the post-Newtonian approximations to those 
(35.1a) used in the study of weak gravitational waves, but that the temporal gauge condition 
is not. 


EXERCISES 
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(b) Use these gauge conditions and the post-Newtonian limit in equations (8.24) and (8.47) 
to obtain for the Ricci tensor, accurate to linearized order, 

*00 - - J * 00 ,™ + 0(c 7V). *» = - J V,mm + °( £ V V)> (39.53a) 

*0; = -J h 0i . nm ~ j *00,0; + 0(t 5 /R 0 2 ). (39.53b) 

(c) Combine these with the Newtonian form (39.13) of the stress-energy tensor, and with 
equation (39.27), to obtain the following metric coefficients, accurate to linearized order; 

*00 -2U+ k 00 + 0(i% *0i= 0( t % 

t_[unknown post-Newtonian correction] (39.54) 

h jk = 2 US jk + 0(e 4 ). 

Here [/, V j, and W i are to be regarded as defined by equations (39.34a,b,c). By comparing 
these metric coefficients with equations (39.32), discover that 

y=l, A x = 1, A 2 = 1 (39.55) 


for general relativity. 

(d) With this knowledge of the metric in linearized order, one can carry out the analysis 
of §39.10 (using 7 = A x = A 2 = 1 throughout), to obtain the post-Newtonian corrections to 
the stress-energy tensor [equation (39.42) with 7 = 1 ]. 

(e) Calculate, similarly, the post-Newtonian corrections to the Ricci tensor component R 00 , 
using g a 0 = 7] a/3 + h a p, using h a/3 as given in equations (39.54), and using the gauge conditions 


(39.52). The answer should be 

*00 = (- U - J *00 - U 2 ) + 4 UU mm + 0(< W). (39-56) 

(f) Evaluate the Einstein equation R 00 = %n(T 00 — accurate to post-Newtonian 

order, and solve it to obtain the post-Newtonian metric correction 

£ 00 = —2t / 2 + 4^, (39.57) 

where 'k is given by equation (39.43d) with = fi 2 — = 1. By comparing with 

equations (39.32c) and (39.34d), discover that . 

P = Pi = P 2 = ft = ft = 1. S = V = 0 (39.58) 


for general relativity. 

(g) Knowing the full post-Newtonian metric, and the full post-Newtonian stress-energy 
tensor, one can carry out the calculations of §39.11 (using 7 — p = = fi 2 = /? 3 = = 

— A 2 = 1, f = 17 = 0) to obtain the post-Newtonian equations of motion for the matter 
[equations (39.44), (39.46), and (39.47)]. 
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§39.12. RELATION OF PPN COORDINATES TO 
SURROUNDING UNIVERSE 

One crucial issue remains to be clarified: What is the orientation of the PPN coordi¬ 
nate system relative to the surrounding universe? More particularly: Does the PPN 
coordinate system rotate relative to the “fixed stars on the sky;” or is it “rigidly 
attached” to them, in some sense? In order to answer this question, imagine using 
the PPN formalism to analyze the solar system. Make no assumptions about the 
solar system’s velocity through the PPN coordinate frame. Then, as one moves 
outward from the Sun, past the Earth’s orbit, past Pluto’s orbit, and on out toward 
interstellar space, one sees the PPN coordinate frame become more and more 
Lorentz in its global properties [g a/3 = Va/} + 0(M 0 /r)\. Thus , far from the solar 
system the PPN coordinates become a “Lorentz frame moving through the galaxy.” 
This means, of course, that the spatial axes of the PPN coordinate frame behave 
as though they were attached to gyroscopes far outside the solar system. Equivalently: 
The PPN coordinate system Fermi-Walker-transports its spatial axes through the 
spacetime geometry of the galaxy and universe. 


§39.13. SUMMARY OF PPN FORMALISM 

f 

The PPN formalism, as constructed in this chapter, is summarized in Box 39.4. Much 
of the recent literature uses a different set of PPN parameters than are used in this 
book; for a translation from one parameter set to the other, see Box 39.5. 


Exercise 39.15. MANY-BODY SYSTEM IN POST-NEWTONIAN LIMIT 
OF GENERAL RELATIVITY 

Consider, in the post-Newtonian limit of general relativity, a system made up of many 
gravitationally interacting bodies with separations large compared to their sizes ( example: 
the solar system). Idealize each body to be spherically symmetric, to be free of internal 
motions, and to have isotropic internal stresses, t# = 8 jk p. Let the world line of the center 
of body A, in some chosen PPN coordinate frame, be Jt^(r); and let the (coordinate) velocity 
of the center of body A be 


v A (t) = dxjdt . (39.59a) 

The total mass-energy of body A as measured in its neighborhood (rest mass-energy plus 
internal energy plus self-gravitational energy) is given by 

m a = jf (l + n - y £/ self ^(rest mass) + 0 (M A e 4 ), (39.59b) 

where L/ self is the body’s own Newtonian potential (no contributions from other bodies), 
and T a is the interior of the body. 


Solar system's PPN 
coordinate frame is attached 
to a local Lorentz frame of 
Galaxy 


EXERCISE 


{continued on page 1094) 
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Box 39.4 SUMMARY OF THE PPN FORMALISM 


I. Variables 

p 0 (x, t): baryon “mass” density (§39.3), as measured in rest frame 
77(x, t): specific internal energy (dimensionless; §39.3), as measured in rest frame 
tft( x, t): components of stress referred to orthonormal axes of rest frame 
Vj(x, t): coordinate velocity of matter (i.e., rest frame) relative to PPN coordinates 
U(x , r), ^(x, /), tf{x, /), L D(x, r), Vj(x, /), W^(x, t), U jk (x, t): gravitational potentials 
y, /?, £ 3 , @ 4 ’ V : parameters whose values distinguish one theory 

from another (see Box 39.2) 

w: velocity of PPN coordinate frame relative to “universal rest frame” [relevant 
only for theories with nonzero a v a 2 , or a 3 ; see eq. (39.33)]. 

II. Equations governing evolution of these variables 

p 0 : conservation of rest mass, equation (39.44) 

17: first law of thermodynamics, equation (39.46) 

determined in terms of p 0 , 77, and other material variables (chemical composi¬ 
tion, strains, etc.) by equations of state and the usual theory of a stressed 
medium—which is not discussed here 
vf. equations of motion (“JF = ma”), equations (39.47) 

U , 'P, d, <$, V j9 Wj, U jk : source equations (39.34) - - 

III. Quantities to be calculated from these variables 

g 00 (x, 0* go/** 0» gjk( x > 0 : these components of metric in PPN coordinate frame 
are expressed in terms of gravitational potentials by equations (39.32) 
w°(x, /), u\x , t): these components of matter 4-velocity in PPN coordinate frame 
are given by equations (39.39) 

7 00 (x, r), T°’(x, t), T jk (x , t ): these components of stress-energy tensor in PPN 
coordinate frame are given by equations (39.42) 


IV. Relation between rest frame, PPN coordinates, and the universe 

1. Orthonormal basis oj a of rest frame, where t-% are defined, is related to PPN 
coordinate basis dx a by equations (39.41) 

2. Far from the sun, the PPN coordinates become asymptotically Lorentz; i.e., 
they form an inertial frame moving through the spacetime geometry of the 
galaxy and the universe. 

3. Gives no account of expansion of universe or of cosmic gravitational waves 
impinging on solar system. 
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Box 39.5 PPN PARAMETERS USED IN LITERATURE: A TRANSLATOR'S GUIDE 

The original “point-particle version” of the PPN formalism [Nordtvedt (1968b)], and 
the original “perfect-fluid version” [Will (1971c)] used different sets of PPN param¬ 
eters. This book has adopted Will’s set, and has added the parameter r\ character¬ 
izing effects of anisotropic stresses. More recently, Will and Nordtvedt have jointly 
adopted a revised set of parameters, described below. 


A. Translation Table 


Wilf-Nordtvedt 
r revised parameters a 

Revised parameters in 
notation of this book b 

Revised parameters in 
notation of Nordtvedt (1968b) c 

Y 

y 

y 

0 

0 

fi 


7 A 1 + 4 — 4y — 4 

8 A - 4y - 4 

a. 

1 

+ 

N 

a'" - 1 

«3 

4/8, - 2y - 2 - £ 

4a" - a"' — 2y — 1 

h 

f 

- X 

^ 2 

2/3 + 2/3 2 - 3y - 1 

2/3- a' - 1 


fis- 1 

absent 

t* 

fit-y 

absent 


a Revised parameters are used by Will and Nordtvedt (1972), Nordtvedt and Will (1972), Will (1972), and Ni (1973). 
b Notation of this book is used by Will (1971a,b,c,d), Ni (1972), and Thome, Ni, and Will (1971). 
c Nordtvedt’s original “point-particle” parameters were used by Nordtvedt (1968b, 1970, 1971a,b). 


B. Significance of Revised Parameters 

a v a 2 , a 3 measure the extent of and nature of “preferred-frame effects”; see §39.9. 
Any theory of gravity with at least one a nonzero is called a preferred-frame theory. 

£ 2 ’ ^ 4 > a 3 measure the extent of and nature of breakdowns in global conser¬ 

vation laws. A theory of gravity possesses, at the post-Newtonian level, all 10 global 
conservation laws (4 for energy-momentum, 6 for angular momentum; see Chapters 
19 and 20) if and only if ^ = f 4 = a 3 = 0. See Will (1971d), Will and 

Nordtvedt (1972), Will (1972), for proofs and discussion. Any theory with 
= = f 4 = « 3 = 0 is called a conservative theory. 

In general relativity and the Dicke-Brans-Jordan theory, all a’s and f’s vanish. 
Thus, general relativity and Dicke-Brans-Jordan are conservative theories with no 
preferred-frame effects. 


V 
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(a) Show that, when written in the chosen PPN coordinate frame, this expression for 
M a becomes 

= jf P 0 (1 _+ n .+ i V + 3 U - i t/ sel£ ) d 3 x + 0(M A i 4 ). (39.59c) 

Use equations (39.43), (39.44), and (39.46) to show that M A is conserved as the bodies move 
about, dMJdt — 0. 

(b) Pick an event (/, x) outside all the bodies, and at time / denote 

r A= x A~ x > r AB = x A - x B , r A = \r A \, r AB = \r AB \. (39.59d) 

Show that the general-relativistic, post-Newtonian metric (39.32) at the chosen event has 
the form 


Sik — &jk (1 2 2 ) + 6)(e 4 ), 

V A T A f 

(39.60a) 

+<*•>. 

(39.60b) 

A r A ' A T A ' A r A 


-2 2 2 ^+0(< 6 ). 

(39.60c) 


A B*A 'A'AB 


[Hint: From the Newtonian virial theorem (39.21a), applied to body A by itself in its own 
rest frame, conclude that 


X ( 3 P--J p o U ^)^x = 0(M A t% 


(39.61) 


where the integral is performed in the PPN frame.] 

(c) Perform an infinitesimal coordinate transformation, 


t _ , 1 'sr M A (r A * v a) „ _ 

*OLD — NEW y Zj I * ^OLD “ 


(39.62) 


to bring the metric (39.60) into the standard form originally devised by Einstein, Infeld, and 
Hoffman (1938), and by Eddington and Clark (1938): 


8jk 


= «4 1 + 2 2T L ) + 0(t4 >’ 

' A •A * 


(39.63a) 


M a 


Soj =- 4 St 1 ^+ °( £5 ) 

A r A 


goo 


= -l+2Sf 1 -2(Sf 1 ) 2 + 3 S^ 

A T A \ a r A f a ' A 


A 'A 


A B^A r A r AB 01 


(39.63b) 


(39.63c) 
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where x [equation (39.49a)] is given by 


X = 




(d) The equations of motion for the bodies can be obtained in either of two ways: by 
performing a volume integral of the Euler equation (39.48) over the interior of each body; 
or by invoking the general arguments of §20.6. The latter way is the easier. Use it to conclude 
that any chosen body K moves along a geodesic of the metric obtained by omitting the terms 
A = K from the sums in (39.63). Show that the geodesic equation for body K reduces to 



- 2 

A ^LK r AK 


(39.64) 


+ \ 2 2 r cA 

A^KC jtA 


MaMc 

*» *• 3 

'AK'CA 


Equations (39.63) and (39.64) are called the Einstein-Infeld-Hoffman (“EIH”) equations for 
the geometry and evolution of^a many-body system. They are used widely in analyses of 
planetary orbits in the solar system. For example, the Caltech Jet Propulsion Laboratory uses 
them, in modified form, to calculate ephemerides for high-precision tracking of planets and 
spacecraft. The above method of deriving the EIH equations and metric was devised by 
Fock (1959). For a similar calculation in the Dicke-Brans-Jordan theory, see Estabrook 
(1969); and for a derivation of the analogous many-body equations in the full PPN formalism, 
see Will (1972). 




r 


CHAPTER 



SOLAR-SYSTEM EXPERIMENTS 


V. 


§40.1. MANY EXPERIMENTS OPEN TO DISTINGUISH 
GENERAL RELATIVITY FROM PROPOSED 
METRIC THEORIES OF GRAVITY 


This chapter analyzes 
experiments using PPN 
formalism 


Complexity of solar system's 
spacetime geometry 


No audience will show up for a fight if in everyone's eyes the challenger has zero 
chance to win. No battle-hungry promoter desperately trying to finance the fight 
can afford to put into the ring against the champion'any but the best contender 
that he can find. Against Einstein’s metric theory of gravity, the judgment of the 
day (as §39.2 showed) leaves one no option except to put up another theory of gravity 
that is also metric (or metric plus torsion). 

To put on a contest, then, is to design and perform an experiment that distinguishes 
general relativity from some not completely implausible metric theory of gravity. 
This chapter describes such experiments—some already performed; some to be 
performed in the future—and analyses their significance using the PPN formalism 
of Chapter 39. 

In most of the experiments to be described, one investigates the motion of the 
moon, planets, spacecraft, light rays, or gyroscopes through the spacetime geometry 
of the solar system. That spacetime geometry is very complicated. It includes the 
spherical fields of the sun and all the planets, nonspherical fields due to their 
quadrupolar and higher-order deformations, and fields due to their momentum and 
angular momentum. Moreover, the spacetime geometry results—or at least in the 
post-Newtonian formalism it is viewed as resulting—from a nonlinear superposition 
of all these fields.* 


* Of course, from the point of view of Einstein’s full general relativity theory, all that legalistically 
counts is the one and only curved-spacetime geometry of the real physical world. All these “individual 
fields’’ are mere bookkeepers’ discourse, and they are best abandoned (they cease to be useful) when 
one passes from the post-Newtonian limit to the full Einstein theory. 
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Fortunately for this discussion, several of the most important experiments are free 
of almost all these complications. The effects they measure are associated entirely 
with the spherical part of the sun’s gravitational field. A description of these experi¬ 
ments will come first (§§40.2-40.5), and then attention will turn to experiments that 
are more complex in principle. 

To discuss central-field experiments, one needs an expression for the external 
gravitational field of an idealized, isolated, static, spherical sun. In general relativity, 
such a gravitational field is described by the Schwarzschild line element, 


Idealization of geometry to 
that of isolated, static, 
spherical sun: 


= _ (, _ Vk)* + T -^-- ++ si „w). 


(1) in Schwarzschild 
coordinates 


But this line element is not what one wants, for two reasons: (1) it is “too accurate”; 
(2) it is written in the “wrong” coordinate system. 

Why too accurate? Because it is simple only when unperturbed and unmodified; 
whereas some modified theories show up new effects that are so complex they are 
tractable only in the post-Newtonian approximation. Why wrong coordinate system? 
Because physicists, astronomers, and other celestial mechanics have adopted the 
fairly standard convention of using “isotropic coordinates” rather than “Schwarzs¬ 
child coordinates” when analyzing the solar system. Example: post-Newtonian 
expansions, including the PPN formalism of Chapter 39, almost always use isotropic 
coordinates. Another example: the relativistic ephemeris for the solar system, pre¬ 
pared by the Caltech Jet Propulsion Laboratory [Ohandley et al. (1969); Anderson 
(1973)] and used extensively throughout the world, employs isotropic coordinates. 

Modify the Schwarzschild line element, then. First transform to isotropic coordi¬ 
nates (Exercise 31.7); then expand the metric coefficients in powers of M 0 /r, to 
post-Newtonian accuracy. Thereby obtain 


ds 2 ~ — 


[l - + l(d^j^dt 2 + [l + 2^][Jr 2 + r 2 (d0 2 

[i-2^ + 2(^)y + [ l+2 k] [dx * + d? + 


+ sin 2 # d$ 2 )\ 

(40.1) 

dz 2 }. 


(2) in isotropic coordinates 


Here r, 6 , <f> are related to x, y, z in the usual manner: 

r = (x 2 + y 2 + z 2 ) 1/2 , 6 = tan -1 [z/(x 2 + y 2 ) 1/2 ], <> = tan ~\y/x)\ (40.2) 

and r is the new, “isotropic” radial coordinate, not to be confused with the 
Schwarzschild r. (The reader who has not studied §39.6 will discover in the next 
section why one keeps terms of order M 0 2 /r 2 in g 00 but not in g jk .) Note: this post- 
Newtonian expression for the metric is a special case of the result derived in 
exercise 19.3. 

If one calculates the gravitational field of the same source (the sun) in the same 
post-Newtonian approximation in other metric theories of gravity, one obtains a 
very similar result: 
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(3) in PPN formalism 

ds 2 — — 

L r \ r j 

f] 

dt 2 + 


= - 

1 - 2 ^ + 2 p 

r \ r / 

f] 

dt 2 + 

[■ 


(40.3) 


(see exercise 40.1). Here y and ft are two of the ten PPN parameters described in 
Box 39.2. Recall from that box that y measures “the amount of space curvature 
produced by unit rest mass,” while ft measures “the amount of nonlinearity in the 
superposition law for g 00 .” These heuristic descriptions find their mathematical 
counterparts in the above form for the idealized metric surrounding a spherically 
symmetric center of attraction. 

By measuring the parameter y to high precision, one can distinguish between 
general relativity (y = 1) and the Dicke-Brans-Jordan theory [y = (1 + co)/(2 + co), 
where co is the “Dicke coupling constant”]; see Box 39.2. But general relativity and 
Dicke-Brans-Jordan predict the same value for /?(/?= 1). This identity does not 
mean that /? is unworthy of measurement. A value ft ^ 1 is predicted by other 
theories [see Ni (1972)]; so measurements of ft are useful in distinguishing such 
theories from general relativity. 

Actually, the above form (40.3) for the sun’s metric is not fully general. In any 
theory with a preferred “universal rest frame” (e.g., Ni’s theory; Box 39.1), there 
are additional terms in the metric due to motion of the sun relative to that preferred 
frame (exercise 40.1): 


(4) including preferred-frame 
effects 


ds 2 = (expression 40.3) + (a 2 + n 3 — n x ) h > 2 dt 2 -I- 2 ^a 2 — — w j dx j dt 

— — -jr 2 w j dtQ. dx k + w k dt). (40.30 


In these “preferred-frame terms” 7 0 = I if = fpr 2 d 3 x is the trace of the second 
moment of the sun’s mass distribution; 


= 1A X + d 2 — 4y - 4, 
n2 = d 2 + f 1» 

a 3 = 4/?! - 2y - 2 - £ 

I 

are combinations of PPN parameters; and w is the sun’s velocity (= velocity of 
coordinate system) relative to the preferred frame. (Theories such as general relativity 
and Dicke-Brans-Jordan, which possess no preferred frame, have a x = a 2 = a 3 = 0, 
and therefore have no preferred-frame terms in the metric.) For ease of exposition, 
all equations and calculations in this chapter will ignore the preferred-frame terms; 
but the consequences of those terms will be discussed and references analyzing them 
will be cited. 
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Exercise 40.1. PPN METRIC FOR IDEALIZED SUN [Track 2] 

Show that for an isolated, static, spherical sun at rest at the origin of the PPN coordinate 
system, the PPN metric (39.32) reduces to expressions (40.3), (40.3'). As part of the reduction, 
show that the sun’s total mass-energy is given by 

R 

Mo = jf PoO + W 2 U + /? 3 // + 3/?4/)/p 0 )477r 2 dr. (40.4) 

( Warning: One must not look at this formula and immediately think: “The contribution of 
rest mass is fp 0 4rrr 2 dr, the contribution of gravitational energy is f2p 2 p 0 U4rrr 2 dr, etc.” 
Rather, in making any such interpretation one must remember that (1) spacetime is curved, 
so 4 mr 2 dr is not proper volume as measured by physical meter sticks; also (2) virial theorems 
(exercise 39.6) and other integral theorems can be used to change the form of the integrand. 
For further discussion see exercises 40.9 and 40.10 below.] 


§40.2. THE USE OF LIGHT RAYS AND RADIO WAVES 
TO TEST GRAVITY 

In the Newtonian limit, planetary and spacecraft orbits are strongly influenced by 
gravity; but light propagation and radio-wave propagation (at “infinite” velocity) 
are not influenced at all. For this reason, experimental studies of orbits are beset 
by the problem of separating the relativistic effects from much larger standard 
Newtonian effects. By contrast, experimental studies of light and radio-wave propa¬ 
gation do not contend with any such overpowering Newtonian background. Not 
surprisingly, they are to date (1973) the clearest and most definitive of the solar- 
system experiments. 

Mathematically, the parameter that distinguishes a light ray from a planet is its 
high speed. In the geodesic equation, the magnitude of the velocity determines which 
metric coefficients can influence the motion. Consider, for example, a weak, static 
Sap — Vap + h a/3 , an d a particle at (x,y, z) = (r, 0,0) moving with velocity 

( v x > v v’ v z) — (0> u >0); see Figure 40.1. Here the effect of gravity on the trajectory 

of the particle can be characterized by the quantity 

/curvature of trajectory in 3-dimensional,\ _ /radius of curvature)- 1 
\ nearly Euclidean, space / ~ \ of trajectory / 

_ dbc _ dr d ! dr dx \ _ _1_ d I u z \ _ 1 du*_ 

dy 2 dy dr\dy dr) u y dr \u y ) ~ (u y ) 2 dr 

(1 - v 2 ) dx a dxP 1 r;c dx a dxS 

v 2 a!i dr dr V 2 1 afi dt dt 

— — r x ~ I )~ 2 — IT 1 r* 

1 00 U OyV — 1 yy 

= J h oo,*v - 2 + (h 0VfX - h 0 XfV )v~ l + (jh yvx - h xv v y 


EXERCISES 


Light rays and radio waves 
give "clean" tests of 
relativity 


1100 


40. SOLAR-SYSTEM EXPERIMENTS 


Light rays are governed 
solely by Newtonian potential 
and PPN parameter y 



Reexpressed in spherical coordinates, in the terminology of the idealized solar line 
element (40.3), this formula says 

/ curvature of trajectory \ _ J. 2 + L (h /r 2 s 
Vin 3-space ) ~ 2 °°- r + 2 ] ' T 

S -(M 0 //-2)(u- 2 + y) (4 °' 5) 

for a particle at its point of closest approach to the sun. (Compare with exercise 
25.21.) Note that here y is a PPN parameter; it is not (1 — v 2 )~ 1/2 . 

Notice what happens as one boosts the velocity of the particle. For slow velocities 
[v 2 ~ (post-Newtonian expansion parameter c 2 ) ss M 0 /R 0 ], the Newtonian part of 
h 00 dominates completely; and the tiny post-Newtonian corrections come equally 
from the c 4 part of h 00 , the c 3 part of h oj , and the c 2 part of h jk . [This was the 
justification for expanding h 00 to 0(e% h 0j to 0(c 3 ), and h jk to 0(e 2 ) in the post- 
Newtonian hmit; see §39.6.] But as v increases, the ordering of the terms changes. 
In the high-u regime (u — 1 > c 2 ), the bending of the trajectory has become almost 
imperceptible because of the high forward momentum of the particle and the short 
time it receives transverse momentum from the sun. What bending is left is due 
to the c 2 (Newtonian) part of /z 00 , and the c 2 (post-Newtonian) part of h jk . Nothing 
else can have a significant influence. Notice, moreover, that—even when on^ allows 
for “preferred-frame” effects—these dominant terms, 

^oo = 2U= 2MJr and h jk = 2y U8 jk = 2y(M 0 /r) 8 jk , 

depend only on the Newtonian potential U = —<P and the PPN parameter y. 

This is a special case of a more general result: Aside from fractional corrections 
of c 2 < 10 -6 , relativistic effects on light and radio-wave propagation are governed 
entirely by the Newtonian potential U and the PPN parameter y. These relativistic 
effects include the gravitational redshift (discussed in the last chapter; independent 
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of y), the gravitational deflection of light and radio waves (discussed below; depend¬ 
ent on y), and the “relativistic time-delay” (discussed below; dependent on y). 


§40.3. "LIGHT" DEFLECTION 

Consider a light or radio ray coming into a telescope on Earth from a distant star 
or quasar. Do not assume, as in the usual discussion (exercises 18.6 and 25.24), that 
the ray passes near the sun. The deflection by the sun’s gravitational field will 
probably be measurable, in the middle or late 1970’s, even when the ray passes far 
from the sun! [The calculation that follows is due to Ward (1970), but Shapiro (1967) 
first derived the answer.] 

Orient the PPN spherical coordinates of equation (40.3) so that the ray lies in 
the “plane” 6 = tt/2. By symmetry, if it starts out in this plane far from the Earth, 
it must he in this plane always. Let the incoming ray enter the solar system along 
the line <f> = 0; and let the Earth be located at r = r E , <j> — <j> E when the ray reaches 
it. (See Figure 40.2.) One wishes to calculate the angle a between the incoming light 
ray and the center of the sun, as measured in the orthonormal frame (e*, e ^) of 
an observer on Earth. If the sun had zero mass (flat, Euclidean space), a would 
be 7r — <f> E (see Figure 40.2). However, the sun produces a deflection: a = m — <j> E 
*F Sa. The deflection angle 8a is the true objective of the calculation. 

In the calculation, ignore the Earth’s orbital and rotational motions. They lead 
to aberration, for which correction can be made by the usual formula of special 
relativity (Lorentz transformation in the neighborhood of the telescope.) Also ignore 
deflection of the light ray due to the Earth’s gravitational field (deflection angle 
2 M e /R e ~ 0".0003), which might be detectable in the late 1970’s. 



Figure 40.2. 

Coordinates used in the text for calculating the deflec¬ 
tion of light. Notice that in this diagram increases 
in the clockwise direction. 




Light deflection: 


(1) derivation 
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As the first step in calculating the deflection angle, determine the trajectory of 
the ray in the r, <f>-plane. This can be calculated either using the geodesic equation, 
or using the eikonal method of geometric optics (Hamilton-Jacobi method; §22.5 
and Box 25.4). The result of such a calculation (exercise 40.2) is an equation con¬ 
necting r with <f>; thus, 


— = sin <f> + — (l — cos <f>). (40.6) 

r b 

Notice that b has a simple geometric interpretation: far from the sun, the ray 
trajectory is <f> = b/r -j- 0(M o b/r 2 ). Consequently, b is the impact parameter in the 
usual sense of classical scattering theory (see Figure 40.2). The ray makes its closest 
approach to the sun (assuming it is not intercepted by the Earth first) at the PPN 
coordinate radius 


min 




b. 


(40.7) 


Thus, b can also be thought of as the radius of the ray’s “perihelion.” 

Notice that the ray returns to r = oo ; not at an angle <f> = 7r, but rather at 

<f>(r = oo) = 7 T + 2(1 + y)M e /b. (40.8a) 


Thus, the total deflection angle is 


(angle of total deflection) = 2(1 -f y)M Q /b 

— y (1 -f y)l".75 for a ray that 
2 just grazes the sun. 


(40.8b) 


But this is not the quantity of primary interest. Rather, one seeks the position 
of the star as seen by an astronomer on Earth. The angle a = v — <j> E + 8a between 
the sun and the star as measured by the astronomer is given by (see Figure 40.2) 


tan(7r — <j> E -j- §a) = — ta n<j> E + §a/cos 2 </> B 



'(1 + yM Q /r)r d<t>/d \" 

E 

~rd<t>~\ 

u - 

(1 + yMcJr) dr/dX J 

. dr \ 


(b/r)d<j> 1 
d(b/r) \ E 


(40.9) 


I 


where — dx^/dX are the components of a tangent to the ray at the Earth. By 
inserting into this equation expression (40.6) for the trajectory of the ray, one obtains 

tan <5 - 8<X - sin< ftg ± [Q ± y) M o/ b ]( l ~ 

an E cos 2 <i> E ~ cos <(> E + [(1 + y)M 0 /b] sin <j> E 

= tan <j> E - [(1 + y)M Q /b](\ - cos^)/cos 2 ^. 


(40.10) 
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Thus, the deflection angle measured at the Earth is 


8a = 


(1 + y )M q 


a (1 + cos a) = A1 ±yWo n ,+ cosay 
r F \ 1 — cos a / 


(40.11) 


It ranges from zero when the ray comes in opposite to the sun’s direction (a = it), 
through the value 


(1 + y)M Q /r E = -i(l + y)0"0041 (40.12) 

when the ray comes in perpendicular to the Earth-Sun line (a = m/\ 2), to the “classi¬ 
cal value” of^(l +y)x 1".75 when the ray comes in grazing the sun’s limb. 

All experiments to date (1972) have examined the case of grazing passage. The 
experimental results are stated and discussed in Box 40.1. They show that the PPN 
parameter y has its general relativistic value of 1 to within an uncertainty of about 
20 percent. 

By the middle or late 1970’s, measurements of the deflection of radio waves from 
quasars should determine y to much better than 1 percent. Also, by that time radio 
astronomers may be making progress toward setting up high-precision coordinates 
on the sky using very long baseline interferometry. If so, they will have to use 
equation (40.11) to compensate for the “warping” of the coordinates caused by the 
sun’s deflection of radio waves in all regions of the sky, not just near the solar limb. 


Exercise 40.2. TRAJECTORY OF LIGHT RAY IN SUN S GRAVITATIONAL FIELD 

Derive equation (40.6) for the path of a light ray in isotropic coordinates (40.3) in the sun’s 
“equatorial plane.” Use one or more of three alternative approaches: (1) direct integration 
of the geodesic equation (the hardest approach!); (2) computation based on the three integrals 
of the motion 


k * k = 0, k • (9/0/) = k 09 k • (0/0<#>) = = -bk 0 ; 

k = d/dX = tangent vector to geodesic 

(see §§25.2 and 25.3); (3) computation based on the Hamilton-Jacobi method (Box 25.4), 
which for photons (zero rest mass) reduces to the “eikonal method” of geometric optics 
(see §22.5). 


§40.4. TIME-DELAY IN RADAR PROPAGATION 

Another effect of spacetime curvature on electromagnetic waves is a relativistic delay 
in the round-trip travel time for radar signals. It was first pointed out by Shapiro 
(1964); see also Muhleman and Reichley (1964, 1965). 


(2) formula for deflection 
angle 


Experimental measurements 
of deflection 


EXERCISE 


Radar time delay: 


{continued on page 1106) 
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“Here (observed deflection)/(Einstein prediction) is the number |(1 + y) obtained by fitting the observational data to the PPN prediction (40.11). [For these experiments the ray passes near 
the solar limb; so (40.11) reduces to 8a = §(1 + y)(M e /b).) The “formal standard error” is obtained from the data by standard statistical techniques. However, it is not usually a good measure 
of the certainty of the result, because it fails to take account of systematic errors. The quoted “one-sigma error” is the experimenters’ best estimate of the combined statistical and systematic 
uncertainties. The experimenters estimate a probability of 68 percent that the true result is within “la” of their measured value; a probability of 95 percent that it is within “2a”; etc. 
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Figure 40.3. 

Diagram, in the PPN coordinate system, for 
the calculation of the relativistic time delay. 


(1) foundations for 

calculation; Fermat's 
principle 


(2) details of calculation 


Let a radar transmitter on Earth send a radio wave out to a reflector elsewhere 
in the solar system, and let the reflector return the wave to Earth. Calculate the 
round-trip travel time, as measured by a clock on Earth. For simplicity of calculation, 
idealize both Earth and reflector as nonrotating and as at rest in the static, spherical 
gravitational field of the Sun. At the end of the calculation, the effects of rotation 
and motion will be discussed separately. Also ignore time dilation of the transmitter’s 
clock due to the Earth’s gravitational field; it is easily corrected for, and it is so 
small that it will not come into play in these radar experiments before the middle 
or late 1970’s. The gravitational effects of the other planets on the radio waves are 
too small to be discernible in the foreseeable future, unless the beam grazes the 
limb of one of them. However, the effects of dispersion in the solar wind and corona 
are discernible and must be corrected for. These corrections will not be discussed 
here, since they are free of any general-relativistic influence. 

The calculation of the round-trip travel time can be simplified by using a general- 
relativistic version of Fermat’s principle: In any static field ( g oj = 0, g afi0 — 0) 
consider all null curves between two points in space , x* = a j and x’ = bK Each such 
null curve , requires a particular coordinate time interval At to get from a i to 

bK The curves of extremal At are the null geodesics of spacetime . The proof of this 
theorem is outlined in exercise 40.3. 

Because of Fermat’s principle, the lapse of coordinate time between transmission 
of the radar beam and reflection at the reflector, r TR , is the same for a straight path 
in the PPN coordinates as for the slightly curved path which the beam actually 
follows. (The two differ by a fractional amount At rR /t TR ~ (angle of deflec¬ 
tion) 2 < 10" 12 , which is far from discemable.) Hence, in the computation one can 
ignore the gravitational bending of the beam. 

Adopt Cartesian PPN coordinates with the sun at the origin; the transmitter, sun, 
and reflector in the z — 0 “plane”; and the transmitter-reflector line along the x 
direction (see Figure 40.3). The transmitter is at (x,y) = ( — a^ b) in the PPN coordi¬ 
nates, and the reflector is at (x,y) = (a R , b ). Recall that for a null ray ds 2 — 0 = 
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goo dt 2 — g xx dx 2 . It follows that the lapse of coordinate time between transmission 
and reflection is 


*TR 



(1 + y)Md 1 

Vx 2 + ft 2 J 


dx. 


Integration yields 

t T R = o R J i-a T +(\ -f y)A/q In [ 


(a R + Vg ± b 2 ) (a T + Vaj. + b 2 ) 
b 2 


]■ 


(40.13) 


The lapse of coordinate time in round-trip travel has twice this magnitude. The 
lapse of proper time measured by an Earth-based clock is 


— IfoolEitfth^TR’ 


At 


V<2r + b 2 


) 


— 2 (a R -j- a T ) ^ 1 

+ 2(1 + y)« s :p± y^TP) j 


(3) formula for delay 


(40.14) 


This is the lapse of time on an Earth-based clock , aside from corrections for the orbital 
and rotational motion of the Earth, for the orbital motion of the reflector, for 
dispersion of radiation traversing the solar wind and corona, and for time dilation 
in the Earth’s gravitational field. 

Any reader is reasonable who objects to the form (40.14) in which the time-delay (4) comparison with 
has been written. The quantities a R , a^ and b are coordinate positions in the PPN experiment 
coordinate system, rather than numbers the astronomer can measure directly. They 
differ from coordinate positions in other, equally good coordinate systems by 
amounts of the order of M e —1.5 km. The objection is not mathematical in its origin. 

The quantities a R ., a ^ and b are perfectly well-defined [with post-post-Newtonian 
uncertainties of order b(M Q /b) 2 < 10 -6 km], because the PPN coordinate system 
is perfectly well-defined. But they are not quantities which the experimenter can 
measure directly, with precision anywhere near that required to test the relativistic 
terms in the time-delay formula (40.14). 

In practice, fortunately, the experimenter does not need to measure a R , a^ or b 
with high precision. Instead, he checks the time-delay formula by measuring the 
changes in At as the Earth and reflector move in their orbits about the Sun; Le., 
he measures At as a function of Earth-based time t. Notice that when the beam 
is passing near the sun (b < a R , b 4 a T \ but db/dT > da R /dT and db/dr > da T /dT 
because the Earth’s and reflector’s orbits are nearly circular), the change of b in 
the In term of (40.14) dominates all other relativistic corrections to the Newtonian 
delay; consequently (using db/dT — 10 km/sec for typical experiments) 


dAT 

dT 


( Constant Newtonian 
part 


-4(1 + y) 


M 0 db 
b dT 


(40.15) 


4(1 +v) ( 


/1.5 km\ 

/10 km \ 

\10 6 km/ 

\ sec / 


30 jiAsec 
day 
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(5) experimental result for y 


EXERCISE 


Such differential shifts in round-trip travel time—which rise as the Earth-reflector 
line moves toward the Sun and falls as it moves away—are readily observable. 

In practice, in order to obtain precisions better than about 20 percent in the 
determination of the parameter y by time-delay measurements, one must carefully 
collect'and analyze data for a large fraction of a year—from a time when the beam 
is far from the sun (b ~~ a T ~~ 10 s km), to the time of superior conjunction ( b ~~ R Q 
~~ 10 6 km), and on around to a time of distant beam again. Such a long “arc” of 
data is needed to determine the reflector’s orbit with high precision, and to take 
full advantage of the slow, logarithmic falloff of At with b (40.14). When the beam 
is far from the sun (b > R e ), the simplifying assumptions behind equation (40.15) 
are not valid; and the relativistic time-delay gets intertwined with the orbital motions 
of the Earth and the reflector. The analysis then remains straightforward, but its 
details are so complex that one resorts to numerical integrations on a computer to 
carry it out. Because the orbital motions enter, the time-delay data then contain 
information about other metric parameters (/? is the dominant one) in addition to y. 

The experimental results as of 1971 are described in Box 40.2. They yield a value 
for the PPN parameter y that is more accurate than the value from light and 
radio-wave deflection experiments: 

y= 1.02 ± 0.08. (40.16) 

Future experiments using spacecraft may improve the precision of y to ±0.001 or 
better. 


Exercise 40.3. FERMAT'S PRINCIPLE 

Prove Fermat’s principle for a static gravitational field. [Hint: The proof might proceed as 
follows. Write down the geodesic equation in four-dimensional spacetime using an affine 
parameter X. Convert from the parameter X to coordinate time /, and use ds 2 — 0 to obtain 

d 2 x k dx k dx l g kl dx k dx l d 2 i/d\ 2 dx k _ 

g ’ k dt 2 + ikl ~dT~dT~ )00 g^ dt Jt + (dt/d\) 2 Sik ~dT ~ • 


Combine with the time part of the geodesic equation 


d 2 t/dX 2 _ , r dx*/dt 
(dt/d\) 2 oko g 00 


and use the expression for the Christoffel symbols in terms of the metric to obtain 


d 2 x k 1 dx k dx l 

■fyc ^2 + J ( yjk ’ 1 + T#i,* “ 


T;* : 


8jk 

goo 


Then notice that this is a geodesic equation with affine parameter t in a three-dimensional 
manifold with metric y jk . The familiar extremum principle for this geodesic is 


sf (y jk dxl dx k ) 1/2 = Sjf 6 dt = o. 


which is precisely Fermat’s principle!] 
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Box 40.2 RADAR TIME DELAY IN THE SOLAR SYSTEM: 

EXPERIMENTAL RESULTS 

Two types of experiments have been performed to 
measure the relativistic effects [proportional to 
f(l + r); equation (40.14)] in the round-trip radar 
travel time in the solar system. In one type (“ pas¬ 
sive” experiment) the reflector is the surface of the 
planet Venus or the planet Mercury. In the other 
type (“active” experiment) the “reflector” is elec- 
- trcrnic equipment on-boards spacecraft that re¬ 
ceives the signal and transmits it back to Earth 
(“transponder”). Passive experiments suffer from 
noise due to topography of the reflecting planet 
(earlier radar return from mountain tops than 
from valley floors), and they suffer from weakness 
of the returned signal. Active experiments suffer 
from buffeting of the spacecraft by solar wind, 
buff etin g by fluctuations 4nr solar radiation pres¬ 
sure, and buffeting by leakage from gas jets (“out- 
gassing”). Experiments of the future will solve 
these problems by placing a transponder on the 
surface of a planet or on a “drag-free” (buffeting- 
free) spacecraft. But experiments of the present 
and future must both contend with fluctuating 
time delays due to dispersion in the fluctuating 
solar wind and corona. Fortunately, these are 
smaller than the relativistic effects, except when 



The Mariner VI spacecraft (mock-up), which was the reflector 
in a 1970 measurement of £(1 + y) by radar time delay [photo 
courtesy the Caltech Jet Propulsion Laboratory]. 

the beam passes within 2 or 3 solar radii of the 
sun. 

The results of experiments performed before 
1972 are listed in the table. 


Experimental result a 


Dates of 
observation 

Radar 

telescopes 

Reflector 

Experimenters 

and 

reference 


+ y) = 

/ Ob served \ 
V delay ! 

Formal 

standard 

error 

One- 

sigma 

error 

Wave i 
length ' 

fEinstein \ 
\predictionf 

November 1966 

Haystack (MIT) 

Venus and 

Shapiro (1968) 

3.8 cm 

0.9 


±0.2 

to 


Mercury 






August 1967 








1967 

Haystack (MIT), 

Venus and 

Shapiro, Ash, el at. 

3.8 cm. 

1.015 

±0.02 

±0.05 

through 

and 

Mercury 

(1971). 

and 




1970 

Arecibo (Cornell) 



70 cm. 




October 1969 

Deep Space 

Mariner VI 

Anderson, el ai 

14 cm. 

1.00 

±0.014 

±0.04 

to 

Network 

and VII 

(1971) 





January 1971 

(NASA) 

spacecraft 







a Here (observed delay)/(Einstein prediction) is the value of j(l + y) obtained by fitting the observational data, At(t), to a more sophisticated 
version of the PPN prediction (40.14). This more sophisticated version includes the gravitational influences of all the planets on the orbits of 
reflector and Earth: also the effect of the moon on the Earth’s orbit and the effect of the Earth’s rotation on the travel time; also, to as good 
an extent as possible, the delay due to dispersion in the solar corona and wind. “Formal standard error*’ and “one-sigma error” are defined 
in the table in Box 40.1. . _ 
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40. SOLAR-SYSTEM EXPERIMENTS 


§40.5. PERIHELION SHIFT AND PERIODIC PERTURBATIONS 
IN GEODESIC ORBITS 


Perihelion shift for geodesic 
orbits around spherical sun, 
ignoring preferred-frame 
effects 


The light-deflection and time-delay experiments both measured y. To measure other 
PPN parameters, one must examine the effects of gravity on slowly moving bodies; 
this was the message of §40.2. 

Begin with the simplest of cases: the geodesic orbit of a test body in the sun’s 
spherical gravitational field, ignoring all gravitational effects of the planets, of solar 
oblateness, and of motion relative to any preferred frame. The PPN metric then 
has the form (40.3): 


\dt 2 


+ [l + 2y^] [dr 2 + r\d6 2 + sin 2 0 d<f> 2 )]. 


(40.3) 


Orient the coordinates so the test body moves in the equatorial “plane” 6 = 77 / 2 ; 
and calculate the shape r(<j>) of its nearly Keplerian, nearly elliptical geodesic orbit. 
The result, accurate to order M 0 /r beyond Newtonian theory, is 


(1 — e 2 )a 

1 + ecos[(l — S<£ 0 /2 t7)<£] ’ 


(40.17) 


where a and e are constants of integration, and 8<j> 0 is defined by 


8(j> 0 - 


(2 — J3 + 2y) 67tM q 
3 a ( 1 - e 2 ) 


6ttM q 
a {1 — e 2 ) 


in general relativity. 


(40.18) 


(For derivation, see exercise 40.4.) 

Notice that, if <$<£ 0 were zero—as it is in the Newtonian limit—then the orbit (40.18) 
would be an ellipse with semimajor axis a and eccentricity e (see Box 25.4). The 
constant 8<j> 0 merely makes the ellipse precess: for r to go through a complete circuit, 
from perihelion to aphelion to perihelion again, (1 — S<J> 0 /2 t7)<j> must change by 277; 
so <j> must change by 2?7 + S<£ 0 . Thus, the perihelion shifts forward by an angle 5 ^> 0 
with each circuit around the ellipse. 

Relative to what does the perihelion shift? (1) Relative to the PPN coofdinate 
system; ( 2 ) relative to inertial frames at the outskirts of the solar system (since the 
PPN coordinates are tied to those frames; see §39.12); (3) relative to a frame 
determined by the “fixed stars in the sky” (since the inertial frames at the outskirts 
of the solar system, inertial frames elsewhere in our galaxy, and inertial frames in 
our cluster of galaxies should not rotate significantly relative to each other); (4) 
relative to the perihelia of (other) planets, which themselves are shifting at calculable 
rates that decrease as one moves outward in the solar system from Mercury to Venus 
to Earth to.... 
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The perihelion shift is not the only relativistic effect contained in the orbital motion 
for a test body. There are other effects, but they are all periodic rather than cumula¬ 
tive with time; so, with the limited technology of the pre-space-age era, it was 
impossible to detect them. But the technology of the 1970’s is bringing them within 
reach. Moreover, many space-age experiments are necessarily of short duration 
(< one orbit)—particularly those involving spacecraft and transponders landed on 
planets. For these, the periodic perturbations in an orbit are of almost as much 
expenm e n t al v alue as t he cu mulative perihelion shift. The periodic effects are not 
obvious in the PPN orbital equation (40.17); it looks like the simplest of precessing 
ellipses. But the quantities the observer measures directly are not a, e, and S<f> 0 . 
Instead, he measures the time evolution of round-trip radar travel times, z1t(t), and 
of angular positions on the sky [0 O ( t), <£ 0 (t)]. To compute these quantities is perfectly 
straightforward in principle, but in practice is a very complex task. The calculations 
predict relativistic effects periodic with the frequency of the orbit and all its har¬ 
monics. The amplitudes of these effects, for the lower harmonics, must obviously 
be of the order of M e ~ 1 km ~ 10 jttsec ~ 0".01 arc on the sky. (The distance 

M e = 1.48 km is the characteristic length for all relativistic effects in the sun’s 
spherical field!) 

The most favorable orbits for experimental tests of the perihelion shift and of 
periodic effects are those that go nearest the sun and have the highest eccentricity 
[see equation (40.18)]—the orbits of Mercury, Venus, Earth, Mars, and the asteroid 
Icarus. But how does one know that these orbits are geodesics? After all, planets 
are not “test bodies”; they themselves produce nonnegligible curvature in spacetime. 
It turns out (see §40.9 for full discussion) that there should exist tiny deviations from 
geodesic motion, but they are too small to compete with the perihelion shift or with 
the periodic effects discussed above, at least for these five bodies. 

Extensive astronomical observations of planetary orbits, dating back to the mid¬ 
nineteenth century and aided by radar since 1966, have yielded accurate values for 
planetary perihelion shifts (accurate to ±0.4 seconds of arc per century for Mercury). 
From the data, which are summarized and discussed in Box 40.3, one obtains the 
value 

1(2 -/3 + 2y)= 1.00 (40.19a) 

for the ratio of observed relativistic shift to general relativistic prediction. Combining 
this result with the radar-delay value for y (40,16), one obtains a value 

& — 1-0 {^0.2 (40J9b) 

for the PPN parameter j3. (Recall: (3 measures the “amount of nonlinearity in the 
superposition law for g 00 .”) 

The periodic effects in the planetary orbits have not yet (1973) been studied 
experimentally. 

The above discussion and Box 40.3 have ignored the motion of the solar system 
relative to the preferred frame (if one exists); i.e., they have ignored the terms (40.3') 


Periodic perturbations in 
geodesic orbits 


Comparison of theory with 
planetary orbits 


Experimental result for ft 
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f Box 40.3 PERIHELION SHIFTS: EXPERIMENTAL RESULTS 


A 


Relativistic corrections to Newtonian theory are 
not the only cause of shift in the perihelion of a 
planetary orbit. Any departure of the Newtonian 
gravitational field from its idealized, spherical, 
inverse-square-law form also produces a shift. 
Such nonsphericities and resulting shifts are 
brought about by ( 1 ) the gravitational pulls of 
other planets, and ( 2 ) deformation of the sun 
(“solar oblateness”; “quadrupole moment”). In 
addition, when the primary data are optical posi¬ 
tions of planets on the sky (right ascension and 
declination as functions of time), there is an ap¬ 
parent perihelion shift caused by the precession 
of the Earth’s axis (“general precession”; observer 
not on a “stable platform”; see exercise 16.4). 

The perihelion shifts due to a general precession 
and to the gravitational pulls of other planets can 
be calculated with high precision. But in 1973 
there is no fully reliable way to determine the solar 
quadrupole moment. It is conventional to quantify 
the sun’s quadrupole moment by a dimensionless 
parameter / 2 , which appears in the following ex¬ 
pression for the Newtonian potential, 


M e 


(2 COS 2 # - 1 \] 

r 

1 f2 ! 

{ 2 /J 



The Haystack radar antenna, which Irwin Shapiro and his 
group have used to collect extensive data on the systematics 
of the inner part of the solar system. Those data are rapidly 
becoming the most important source of information about 
perihelion shifts. (Picture courtesy of Lincoln Laboratories, 
MIT) 


If the sun were rotating near breakup velocity, J 2 
would be near 1. Very careful measurements of 
the optical shape of the sun [Dicke and Golden- 
berg (1967)] show a flattening, which suggests J 2 
may be near 3 X 10~ 5 . 

The total perihelion shift produced by relativity 
plus solar quadrupole moment is (see exercise 
40.5) 


S(f> = 


a{\ — e 2 ) p ’ 


_2 — /3 + 2y R 0 2 /M 0 

3 + 2 2a(l-e 2 )' 


Note that relativistic and quadrupole shifts have 
different dependences on the semimajor axis a and 
ecentricity e of the orbit. This difference in depen¬ 
dence allows one to obtain values for both tfie 
quadrupole moment parameter J 2 , and the PPN 
parameter J (2 - /3 + 2 y) by combining measure¬ 
ments of 8<j> for more than one planet. 

The experimental results, as of 1972, are as 
follows. 
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I. Data for Mercury from optical studies [Clemence (1943, 1947)]* 

(general relativity with no solar oblateness predicts 43.03"/century) 

Quantity 

(a) Total observed shift per century 

(b) Contribution to shift caused by observer not being in an inertial frame far from 
the sun (“general precession” as evaluated in 1947) 

(c) Shift per century produced by Newtonian gravitation of other planets 

(d) Residual shift per century to be attributed to general relativity plus 
solar oblateness 

(e) Residual shift if one uses the 1973 value for the “general precession” 

(f) Corresponding value of A p (see above) 

Value 

5599".74±0".41 

5025".645 ± 0" 50 

531 ".54 ±0".68 

42".56 0",94 

41 ".4 dt 0".90 

A p = 0.96 dr 0.02 

1970 Results of Shapiro (1970, 1971a,b), Shapiro et aL (1972) 

(a) Values of A p obtained by reanalyzing all the world’s collection of optical data, 
and combining it with radar data 

(b) Value of J 2 obtained by comparing the observed shifts for Mercury and Mars 

f ^Mercury = 1.00 ± 0.01 

U^Mars =107 ±0.10 
J 2 < 3 x 10~ 3 

III. Theoretical implications of Shapiro’s results 

(a) Value of (2 — + 2y)/3 

(b) Value of (5 obtained by combining with y from time delay experiments 
[equation (40.16)] 

100 (-2?o 
>»(-0°2 4 


* Clemence (1947) notes, “The observations cannot be made in a Newtonian frame of reference. They are referred to the moving equinox, 
that is, they are affected by the precession of the equinoxes, and the determination of the precessional motion is one of the most difficult problems 
of observational astronomy, if not the most difficult. In the light of all these hazards, it is not surprising that a difference of opinion could exist 
regarding the closeness of agreement between the observed and theoretical motions.” 

V -—I___/ 


in the sun’s metric. When one takes account of these terms, one finds an additional 
contribution to the perihelion shift, given for small eccentricities e < 1 by 



Perihelion shift due to 
preferred-frame forces 

(40.20) 


[see Nordtvedt and Will (1972)]. Here A/ 0 , Q 0 , and w 0 are the sun’s mass, self-gravi¬ 
tational energy, and rotational angular velocity; w is the sun’s velocity relative to 
the preferred frame; a and e are the semimajor axis and eccentricity of the orbit; 
P is the unit vector pointing from the sun to the perihelion; and Q is a unit vector 
orthogonal to P and lying in the orbital plane. Comparison with observations for 
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Experimental result for a 3 


The future of orbital 
experiments 


Mercury—and combination with limits on a 1 and a 2 discussed below [equations 
(40.46b) and (40.48)]—yields the limit 


w • Q 

a3 200 km/sec 


< 2 X 10" 5 . 


(40.21a) 


Since the velocity of the sun around the Galaxy is —200 km/sec, and the peculiar 
motion of the Galaxy relative to other nearby galaxies is —200 km/sec, a value 
w — 200 km/sec is reasonable. Moreover, there is no reason to believe that w and 
Q are orthogonal, so one is fairly safe in concluding 

|o£ 3 I = 14/Jj — 2y — 2 — f|<2x 10" 5 (40.21b) 

This is a stringent limit on theories that possess universal rest frames. For example, 
with great certainty it rules out a theory devised by Coleman (1971), which has 
p — y = 1, but a 3 = —4; see Ni (1972). 

Looking toward the future, one cannot expect data on orbits of spacecraft to give 
decisive tests of general relativity, despite the high precision (—10 meters in 1972) 
with which spacecraft can be tracked. Spacecraft are buffeted by the solar wind. 
They respond to fluctuations in this wind and in the pressure of solar radiation, 
and respond also to “outgassing” from leaky jets. Unless one can develop excellent 
“drag-free” or “conscience-guided” spacecraft, one must therefore continue to rely 
on planets as the source of data on geodesics. However, planetary data themselves 
can be greatly improved in the future by placing radar transponders on the surfaces 
of planets or in orbit about them, by improvements in radar technology, and by 
the continued accumulation of more and more observations. 


EXERCISES 


Exercise 40.4. DERIVATION OF PERIHELION SHIFT IN PPN FORMALISM 

[See exercise 25.16 for a derivation in general relativity, accurate when gravity is strong (2 M/r 
as large as £) but the orbital eccentricity is small. The present exercise applies to any “metric 
theory” and to any eccentricity, but it assumes gravity is weak (2 M/r < 1) and ignores motion 
relative to any universal rest frame.] Derive equation (40.17) for the shape of any bound 
orbit of a test particle moving in the equatorial plane of the PPN gravitational field (40.3). 
Keep only “first-order” corrections beyond Newtonian theory (first order in powers of M 0 /r). 
[Sketch of solution using Hamilton-Jacobi theory (Box 25.4): (1) Hamilton-Jacobi equation, 
referred to a test body of unit mass, is 


"' - *+«- +[> -- **y 


(2) Solution to Hamilton-Jacobi equation is 


5= -Et+ u± J r {-(1 - £ 2 ) + ^-[l - (1 + Y)(l - £ 2 )] 


(40.22) 
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where post-post-Newtonian corrections have been discarded. In discarding post-post-New- 
tonian corrections, recall that E is the conserved energy per unit rest mass and L is the angular 
momentum per unit rest mass (see Box 25.4). Consequently one has the order-of-magnitude 
relations 


1 — E 2 ~~ (velocity of test body) 2 — M 0 /r 


(M q /I) 2 - (M 0 /rv) 2 ~~ M 0 /r. 

(3) The shape of the orbit is determined by the “condition of constructive interference,” 
dS/dL = 0: 

* = ± f (- - o + v)d - £ 2 )] 

~^[ l -^jf-V-P + 2y)]} V2 d{\/r). 

(4) This integral is readily evaluated in terms of trigonometric functions. For a bound orbit 
(E < 1), the integral is 


where 



+ 



(1 — e 2 )a 
er 


e 




M 

a = -(1 + y)(l -2*)], 

1 — E 2 

l ~ e2 = (^)" (1 ~ '20 + yXI - E 2 ) - 2 (^) 2 ( 2 -0 + 2 y )\ ( 40 . 23 ) 

S<)» 0 =1(2 - 0 + 2y)6ir(M Q /L) 2 . 

(5) Straightforward manipulations bring this result into the form of equations (40.17) and 
(40.18).] 


Exercise 40.5. PERIHELION SHIFT FOR OBLATE SUN 

(a) The Newtonian potential for an oblate sun has the form 


U = 



RJ 3 cos 2 # - 1 


)• 


(40.24) 


where J 2 is the “quadrupole-moment parameter.” One knows that J 2 < 3 X 10~ 5 . Show that 
if an oblate sun is at rest at the origin of the PPN coordinate system, the metric of the 
surrounding spacetime [equations (39.32)] can be put into the form 


*•=.[,. 2 ^ - i,. (M) + 2, (^L)*] * 

+ |^1 4- 2y ~~ j [dr 2 + r\d9 2 + sin 2 # d<$> 2 )\ 

4- corrections of post-post-Newtonian magnitude. 


(40.25) 
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(b) Let a test particle move in a bound orbit in the equatorial plane. Use Hamilton-Jacobi 
theory to show that its orbit is a precessing ellipse [equation (40.17)] with a precession per 
orbit given by 


5<f> 0 — 


2 - p + 2y 677 M q 

3 o(l — e 2 ) 


+ Jo 


3^p 2 
a 2 ( 1 - e 2 ) 2 


(40.26) 


For the significance of this result, see Box 40.3. 


The rest of this chapter is 
Track 2. No earlier Track-2 
material is needed as 
preparation for it, but the 
following will be helpful; 

(1) Chapter 6 (accelerated 
observers); 

(2) §17.6 (no prior geometry); 
and 

(3) Chapters 3B and 39 (tests 
of foundations; other 
theories; PPN formalism). 

It is not needed as prepara¬ 
tion for any later chapter. 
___ / 


3-body effects in lunar orbit: 


(1) theory 


(2) prospects for 
measurement 


§40.6. THREE-BODY EFFECTS IN THE LUNAR ORBIT 

The relativistic effects discussed thus far all involve the spherical part of the sun’s 
external gravitational field, and thus they can probe only the PPN parameters j3 
and y plus the “preferred-frame” parameters a v a 2 , and a 3 . Attempts to measure 
other PPN parameters can focus on three-body interactions (discussed here), the 
dragging of inertial frames by a rotating body (§40.7), anomalies in the locally 
measured gravitational constant (§40.8), and deviations of planetary and lunar orbits 
from geodesics (§40.9). 

There is no better place to study three-body interactions than the Earth-moon 
orbit. The pulls of the Earth, the moon, and the sun all contribute. Perturbations 
in the motion of Earth and moon about their common center of gravity can be 
measured with high precision using laser ranging (earth-moon separation measured 
to —10 cm in early 1970’s) and using a radio beacon on the moon’s surface (angular 
position on sky potentially measurable to better than 0".001 of arc). 

Over and above any Newtonian three-body interactions, the Earth and the sun, 
acting together in a nonlinear manner, should produce relativistic perturbations in 
the lunar orbit that are barely within the range of this technology. These effects 
depend on the familiar parameters y (measuring space curvature) and /? [measuring 
amount of nonlinear superposition, (f/ Earth -I- U sun ) 2 , in g 00 ]. In addition, they 
depend on /3 2 , which regulates the extent to which the sun’s potential, U sxin , acting 
inside the Earth, affects the strength of the Earth’s gravitational pull, causing it to 
vary as the Earth moves nearer and farther from the sun. These effects are expected 
to depend also on f, A v and A 2 , which regulate the extent to which the Earth’s orbital 
momentum and anisotropies in kinetic energy (caused by the sun) gravitate. 

Bromberg (1958), Baierlein (1967), and Krogh and Baierlein (1968) have calculated 
the three dominant three-body effects in the Earth-moon orbit using general relativity 
and the Dicke-Brans-Jordan theory. These effects are noncumulative and have 
amplitudes of —100 cm, —20 cm, and —10 cm. The 100-cm effect [which was 
originally discovered by de Sitter (1916)] is known to depend only on y. The precise 
dependence of the other effects on the PPN parameters is not known. 

The prospects for measuring these effects in the 1970’s are dim; they are masked 
by peculiarities in the orbit of the moon that have nothing to do with relativity. 
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§40.7. THE DRAGGING OF INERTIAL FRAMES 

The experiments discussed thus far study the motion of electromagnetic waves, 
spacecraft, planets, and asteroids through the solar system. An entirely different type 
of experiment measures changes in the orientation of a gyroscope moving in the 
gravitational field of the Earth. This experiment is particularly important because 
it can measure directly the “dragging of inertial frames” by the angular momentum 
of the Earth. 

It is useful, before specializing to a rotating Earth, to derive a general expression 
for the precession of a gyroscope in the post-Newtonian limit. (Track-1 readers, and 
others who have not studied Chapters 6 and 39 , may have difficulty following the 
derivation. No matter. It is the answer that counts!) 

Let S a be the spin of the gyroscope (i.e., its angular momentum vector), and let 
u a be its 4-velocity. The spin is always orthogonal to the 4-velocity, S a u a — 0 (see 
Box 5,6). Assume that any nongravitational forces acting on the gyroscope are applied 
at its center of mass , so that there is no torque in its proper reference frame. Then 
the gyroscope will “Fermi-Walker transport” its spin along its world line (see § 6 . 5 ): 

▼„S = u(a • S), a = Y a u — 4-acceleration. (40.27) 

The objective of the calculation is to write down and analyze this transport equation 
in the post-Newtonian limit. 

The gyroscope moves relative to the PPN coordinate grid with a velocity v j = 
dx^/dt — dXj/dt. Assume that Vj < e, where e is the post-Newtonian expansion 
parameter (e 2 s M Q /R Q ). As the gyroscope moves, it carries with itself an orthonor¬ 
mal frame e * , which is related to the PPN coordinate frame by a pure Lorentz boost, 
plus a renormalization of the lengths of the basis vectors [transformation (39.41)]. 
The spin is a purely spatial vector (5° = 0) in this comoving frame; its length 
(W /2 remains fixed (conservation of angular momentum); and its direction is 
regulated by the Fermi-Walker transport law. 

The basis vectors e - of the comoving frame are not Fermi-Walker transported, 
by contrast with the spin. Rather, they are tied by a pure boost (no rotation!) to 
the PPN coordinate grid, which in turn is tied to an inertial frame far from the 
solar system, which in turn one expects to be fixed relative to the “distant stars.” 
Thus, by calculating the precession of the spin relative to the comoving frame, 

dSj/dr = e m S2 h S h (40.28) 

one is in effect evaluating the spin’s angular velocity of precession, d?*, relative to 
a frame fixed on the sky by the distant stars. 

Calculate dSj/dr: 

ds* 

= ▼„(S • ej) = (V„S) • e, + S • (Y u e } ) = S • Y„ej. (40.29) 
Here use is made of the fact that is in the u direction [equation (40.27)] and 


Gyroscope precession 


(1) general analysis 
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is thus orthogonal to e- y The quantity S- V„e- is readily evaluated in the PPN 
coordinate frame. In the evaluation, one uses as metric coefficients [equations (39.32)] 
the expressions 


*nn = - 1 + 2 U+ 0(€% 

gu=-jW-jW 


8jk — ^*0 + 2y£/) + 0{t 4 ), 

/“preferred- \ + 0(e s ); 

Vframe terms”/ 


(40.30) 


one takes as the components of e- } and S [obtained via the transformation (39.41)] 
the expressions 


e ]° = + 0(e 3 ), ej k = (1 - yU) 8 jk + y v k Vj + 0(e 4 ), 

So = v,Sj + 0(£ 3 S } .), 
s k = (1 - T U)S k + I v k v } Sj + 0(e45 } .); 


(40.31) 


and one uses the relation 


dVj/dr = a § + Uj + 0(e 2 Uj) (40.32) 

where (assumed < UJ) are the components of the 4-acceleration. One finds (see 
exercise 40.6) for the precession of the spin the result 

dS)/dj = S V u e- = S t [v u a k] + g 0[kJl - (2y + l)v u U kl ]. 

Rewritten in three-dimensional vector form this result becomes 

(2) general PPN formula for dS/dr = J2 X S, (40.33a) 

precession 

S2=-jvx<i-jvxg + (y+j)rXV[/, (40.33b) 

8 = go, e i- (40.33c) 

In this final answer it does not matter whether the 3-vectors entering into O are 
evaluated in the coordinate frame or in the comoving orthonormal frame, since e* 
and 0 /dXj differ only by corrections of order e 2 . 

Equations (40,33) describe in complete generality at the post-Newtonian^level of 
approximation the precession of the gyroscope spin S relative to the comoving ortho- 
normal frame that is rotationally tied to the distant stars. 

For an electron with spin S in orbit around a proton, only the first term, — \v x a, 
is present (no gravity). This term leads to the Thomas precession, which plays an 
important role in the fine structure of atomic spectra [see, e.g., Ruark and Urey 
(1930)]. For other ways of deriving the Thomas precession, see exercise 6.9 and §41.4. 

The Thomas precession comes into play for a gyroscope on the surface of the 
Earth (a = Newtonian acceleration of gravity), but not for a gyroscope in a freely 
moving satellite. 


(3) specialization: Thomas 
precession 
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If one ignores the rotation of the Earth and preferred-frame effects, and puts the ( 4 ) 
PPN coordinate frame at rest relative to the center of the Earth, then g 0j vanishes 
and 0 is given by 


fi = v x[-|a+( r + i)vc/j 

= yv x VU for gyroscope on Earth’s surface 
= (t + y)*' X Vf7 for gyroscope in orbit. 


(40.34) 


The general-relativistic term (y + j)v X V[/ is caused by the motion of the gyroscope 
through the Earth s curved, static spacetime geometry. Notice that it depends solely 
on the same parameter y as is tested by electromagnetic-wave experiments. In order 
of magpitude, for a gyroscope in a near-Earth, polar orbit, 


J? 


, 3/A4 

'2\R b 


k\ 1/2 (MA 

'J \R 2 J 


■ 8 seconds of arc per year. 


(40.35) 


The general-relativistic precession fv x VU was derived by W. de Sitter (1916) for 
the “Earth-moon gyroscope” orbiting the sun. Eleven years later L. H. Thomas (1927) 
derived the special relativistic precession — x a for application to atomic physics. 

The Earth s rotation produces off-diagonal terms, g 0j , in the PPN metric (exer- (5) 
cise 40.7): 


8 £°> e ’ (4 + 4 A ‘ 2 )~r3 r ■ (40.36) 

Here J is the Earth’s angular momentum. These off-diagonal terms contribute an 
amount 


9 = 


-y v x« = 




3(/-r)r-| 

r 2 J 


(40.37) 


to the precession of the gyroscope. Notice that this contribution, unlike the others, 
is independent of the linear velocity of the gyroscope. One can think of it in the 
following way. 

The gyroscope is rotationally at rest relative to the inertial frames in its neighbor¬ 
hood. It and the local inertial frames rotate relative to the distant galaxies with the 
angular velocity Q because the Earth’s rotation “drags” the local inertial frames 
along with it. Notice that near the north and south poles the local inertial frames 
rotate in the same direction as the Earth does (9 parallel to/), but near the equator 
they rotate in the opposite direction (9 antiparallel to /; compare 9 with the 
magnetic field of the Earth!). Although this might seem paradoxical at first, an 
analogy devised by Schiff makes it seem more reasonable.* Consider a rotating, solid 
sphere immersed in a viscous fluid. As it rotates, the sphere will drag the fluid along 
with it. At variou s points in the fluid, set down little rods, and watch how the fluid 

•This analogy can be made mathematically rigorous; see footnote on p. 255 of Thome (1971); see 
also, §21.12 on Mach's principle. 


specialization: precessions 
due to acceleration and 
Earth's Newtonian 
potential 


specialization: precession 
due to Earth’s rotation 
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rotates them as it flows past. Near the poles the fluid will clearly rotate the rods 
in the same direction as the star rotates. But near the equator, because the fluid 
is dragged more rapidly at small radii than at large, the end of a rod closest to 
the sphere is dragged by the fluid more rapidly-than the far end of the rod. Conse¬ 
quently, the rod rotates in the direction opposite to the rotation of the sphere. 

In order of magnitude, the precessional angular velocity caused by the Earth’s 
rotation is 


Q ~~ J e /Re ~~ 0*1 seconds of arc per year. (40.37') 

(6) prospects for measuring Both this precession, and the larger one [equation (40.35)] due to motion through 
precession the £ ar th’s static field, may be detectable in the 1970’s. Equipment aimed at detecting 

them via a satellite experiment is now (1973) under construction at Stanford Univer¬ 
sity; see Everitt, Fairbank, and Hamilton (1970); also O’Connell (1972).* 

The gyroscope precession produced by motion of the Earth relative to the preferred 
frame (if any) is too small to be of much interest. 

*The dragging of inertial frames by a rotating body plays important roles elsewhere in gravitation 
physics, e.g., in the definition of angular momentum for a gravitating body (§19.2), and in black-hole 
physics (Chapter 33). The effect was first discussed and calculated by Thirring and Lense (1918). More 
recent calculations by Brill and Cohen (1966) of idealized situations where the effect may be large give 
insight into the mechanism of the effect. See also the discussion of Mach’s principle in §21.12. 


EXERCISES Exercise 40.6. PRECESSIONAL ANGULAR VELOCITY 

Derive equations (40.33) for the precession of a gyroscope in the post-Newtonian limit. Base 
the derivation on equations (40.29)-(40.32). 

Exercise 40.7. OFF-DIAGONAL TERMS IN METRIC ABOUT THE EARTH 

Idealize the Earth as an isolated, rigidly rotating sphere with angular momentum /. Use 
equations (39.34b,c) and (39.27) to show that (in three-dimensional vector notation) 

V = Vf- = W= Wje J -jJx r/r 3 (40.38) 

outside the Earth, in the Earth’s PPN rest frame. From this, infer equation (40.36). 

Exercise 40.B. SPIN-CURVATURE COUPLING 

Consider a spinning body (e.g., the Earth or a gyroscope or an electron) moving through 
curved spacetime. Tidal gravitational forces produced by the curvature of spacetime act on 
the elementary pieces of the spinning body. These forces should depend not only on the 
positions of the pieces relative to the center of the object, but also on their relative velocities. 
Moreover, the spin of the body, 

S == J (pr X v) d{ volume) in comoving orthonormal frame, 

is a measure of the relative positions and velocities of its pieces. Therefore one expects the 
spin to couple to the tidal gravitational forces—i.e., to the curvature of spacetime—producing 
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deviations from geodesic motion. Careful solution of the PPN equations of Chapter 39 for 
general relativity reveals [Papapetrou (1951), Pirani (1956)] that such coupling occurs and 
causes a deviation of the worldline from the course that it would otherwise take; thus, 

Du a _ D 2 u e „ 1 , 

m ~dT = + y <(40.39) 

Evaluate, in order of magnitude, the effects of the supplementary term on planetary orbits 
in the solar system. 

[Answer: They are much too small to be detected. However, there are two other material 
places to look for the effect. (1) when a rapidly spinning neutron star, or a black hole endowed 
with substantial angular momentum enters the powerful tidal field of another neutron star 
or black hole; and (2) when an individual electron, or the totality of electrons in the “Dirac 
sea of negative energy states,” enter a still more powerful tidal field (late phase of gravita¬ 
tional collapse). Such a tidal field, or curvature, pulls oppositely on electrons with the two 
opposite directions of spin [Pirani (1956); DeWitt (1962), p. 338; Schwinger (1963a,b)] just 
as an electric field pulls oppositely on electrons with the two opposite signs of charge 
[“vacuum polarization”; see especially Heisenberg and Euler (1936)]. In principle, the tidal 
field pulling on the spin of an electron need not be due to “background” spacetime curvature; 
it might be due to a nearby massive spinning object, such as a “live” black hole (chapter 
33) [“gravitational spin-spin coupling”; O’Connell (1972)]. 


§40.8. IS THE GRAVITATIONAL CONSTANT CONSTANT? 

The title and subject of this section are likely to arouse confusion. Throughout this 
book one has used geometrized units, in which G = c = 1. Therefore, one has locked 
oneself into a viewpoint that forbids asking whether the gravitational constant 
changes from event to event. 

False! One can perfectly well ask the question in the context of G = c = 1, if 
one makes clear what is meant by the question. 

In §§1.5 and 1.6, c was defined to be a certain conversion factor between centi¬ 
meters and seconds; and G/c 2 was defined to be a certain conversion factor between 
grams and centimeters. These definitions by fiat do not guarantee, however, that 
a Cavendish experiment* to measure the attraction between two bodies will yield 

Force = — Gm x m 2 /r 2 = ~~m 1 m 2 /r 2 . 

If general relativity correctly describes classical gravity, and if the values of the 
conversion factors G and c have been chosen precisely right, then any Cavendish 
experiment, anywhere in the universe, will yield “Force = -m x m 2 fr 2 '\ But if the 


*See any standard textbook for a description of Cavendish experiments. By his original version of 
the experiment, with two separated spheres suspended by fine wires, Henry Cavendish (1798) inferred 
the mass and hence the density of the Earth. He reported: “By a mean of the experiments made with 
the wire first used, the density of the Earth comes out 5.48 times greater than that of water; and by 
a mean of those made with the latter wire it comes out the same; and... the extreme results do not 
differ from the mean more than 0.38, or 1/14 of the whole.” The most precise method of measuring 
G today [Rose et al (1969)] gives G c = (6.674 ± .004) x 10~ 8 cm 3 /g sec 2 (one standard deviation). 
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"Cavendish gravitational 
constant," G c , defined 


Changes of G c with time 


Spatial variations in G c 


Dicke-Brans-Jordan theory, or almost any other metric theory gives the correct 
description of gravity, the force in the Cavendish experiment will depend on where 
and when the experiment is performed, as well as on m v m 2 , and r. To discuss 
Cavendish experiments as tests of gravitation theory, then, one must introduce a 
new proportionality factor 

G c = G Cavendish = (“Cavendish gravitational constant”), (40.40) 

which enters into the Newtonian force law 

Force — ~G c m 1 m 2 /r 2 . (40.41) 

This Cavendish constant will be unity in general relativity, but in most other metric 
theories it will vary from event to event in spacetime. 

In some theories, such as Dicke-Brans-Jordan, the Cavendish constant is deter¬ 
mined by the distribution of matter in the universe. As a result, the expansion of 
the universe changes its value: 

1 dG c / 0.1 to 1 \ -1 

G c dt \age of universe/ 10 10 or 10 11 years 

[see, e.g. Brans and Dicke (1961)]. A variety of observations place limits on such 
time variations. Big time changes in G c during the last 4.6 billion years would have 
produced marked effects on the Earth, the sun, and the entire solar system. The 
expected geophysical effects have been summarized and compared with observations 
by Dicke and Peebles (1965). It is hard to draw firm conclusions because of the 
complexity of the geophysics involved, but a fairly certain limit is 

(\/G c )(dG c /dt) < 1/10 10 years (geophysical). (40.42a) 

Eventually, high-precision measurements of the orbital motions of planets will yield 
a better limit. For the present, planetary observations show 

(1 /G c )(dG c /dt) < 4/10 10 years (planetary orbits) (40.42b) 

[Shapiro, Smith, et al (1971)]. These limits are tight enough to begin to be interesting, 
but not yet tight enough to disprove any otherwise viable theories of gravity. 

If G c is determined by the distribution of matter in the universe, then it should 
depend on where in the universe one is, as well as when. In particular, as one moves 
from point to point in the solar system, closer to the Sun and then farther away, 
one should see G c change. Indeed this is the case in most metric theories of gravity, 
though not in general relativity. Analyses of Cavendish experiments using the PPN 
formalism reveal spatial variation in G c given by 

AG C = -2 G c (p + y-p 2 -\)U (40.43) 

[Nordtvedt (1970, 1971a); Will (1971b)]. 

The amplitude of these variations along the Earth’s elliptical orbit is AG C /G C ~ 
10“ 10 , if ft + y — f} 2 — 1 ~ 1. This is far too small to measure directly in the 
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1970’s. Despite great ingenuity and effort, the most accurate experiments measuring 
the value of G c have precisions in 1972 no better than 1 part in 10 4 [see Beams 
(1971)]. Experiments to search for yearly variations in G c on Earth without measuring 
the actual value (“null-type experiments”) can surely be performed with better 
precision than 1 in 10 4 —but not with precisions approaching 1 in 10 10 . On the other 
J)and, /ndirect consequences of a spatial variation of Gq in the solar system are almost 
certainly measurable (see §40.9 below). 

In Ni’s theory of gravity (Box 39.1), and other two-tensor or vector-tensor theories 
like it, where the prior geometry picks out a preferred “universal rest frame,” the 
Cavendish constant G c can depend on velocity relative to the preferred frame. For 
Cavendish experiments with two equal masses separated by distances large compared 
to their sizes, G c varies as 



[Will (1971b)]. Here v is the velocity of the Cavendish apparatus relative to the 
preferred frame, and n is the unit vector between the two masses. For experiments 
where one body is a massive sphere (e.g., the Earth), and the other is a small object 
on the sphere’s surface, G c varies as 


AG c /G c = 1 [(« 3 - a,) + a 2 ( 1 - I/MR 2 ))v 2 
- ya 2 (l - 3 I/MR 2 ){vrif 


(40.44') 


[Nordtvedt and Will (1972)]. Here M and R are the mass and radius of the sphere, 
and 


/ = J (pr 2 )47rr 2 dr 


is the trace of the second moment of its mass distribution. Consequences of these 
effects for planetary orbits have not yet been spelled out, but consequences for 
Earthbound experiments have. 

Think of a Cavendish experiment in which one mass is the Earth, and the other 
is a gravimeter on the Earth’s surface. The gravimeter gives a reading for the “local 
acceleration of gravity,” 

S = ^Earth/^Earth 2 * (40.45) 

As the Earth turns, so the unit vector n between its center and the gravimeter rotates, 
G c and hence g will fluctuate with a period of 12 sidereal hours and an amplitude 

(A?/<?) amplitude "^*-*2*’ COS 2 $ m . 

Here * s the minimum, as the Earth rotates, of the angle between v (constant 
vector) and n (rotating vector). (Note: we have used the value I/MR 2 ~ 0.5 for the 
Earth.) These fluctuations will produce tides in the Earth of the same type as are 


Dependence of G c on 
velocity 


Anomalies in Earth tides 
to anisotropies in G c : 
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(1) experimental value of a 2 


(2) experimental disproof of 
Whitehead theory 


Anomalies in Earth rotation 
rate due to dependence of G c 
on velocity 


produced by the moon and sun. As of 1972, gravimeter measurements near the 
Earth’s equator show no sign of any anomalous 12-sidereal-hour effects down to 
an amplitude of ^10~ 9 [Will (1971b)]. Consequently, 

. |a 2 l 1/2 u cos 8 m = \A 2 + l - 1 \ V2 v cos 8 m < 6 X 10“ 5 ~ 20 km/sec. (40.46a) 

Using a rough estimate of v ~~ 200 km/sec for the—Earth’s velocity-relative ^to-the 
universal rest frame, and 8 m < 60° for the angle between v and the Earth’s equatorial 
plane, one obtains the rough limit 


|a 2 | = |4 + 1| < 0.03. (40.46b) 

[This limit does not affect the three theories in Box 39.1; of them, only Ni’s theory 
has prior geometry and a universal rest frame; and it predicts isotropic effects in 
AG C /G c [equation (40.44)], but no anisotropic effects. However, other theories with 
universal rest frames—e.g. Papapetrou’s (1954a,b,c) theory—are ruled out by this 
limit; see Ni (1972), Nordtvedt and Will (1972).] 

Whitehead’s theory of gravity (which is a two-tensor theory with a rather different 
type of prior geometry from Ni’s) predicts that the galaxy should produce velocity- 
independent anisotropies in G c . These, in turn, would produce Earth tides with 
periods of 12 sidereal hours and amplitudes of 


Ag/g* 2 X lO" 7 - 


100 X 


( experimental limit on' 
such amplitudes 


[Will (1971b)]. The absence of such tides proves Whitehead’s theory to be incor¬ 
rect—a feat of disproof beyond the power of all redshift, light-deflection, time-delay, 
and perihelion-shift measurements. (For all these “standard experiments,” the 
predictions of Whitehead and Einstein are identical!) 

Equation (40.44') predicts a periodic annual variation of the Cavendish constant 
on Earth, as the Earth moves around the sun: 


( velocity of Earth\ /velocity of sun relative \ 

relative to sun / \to preferred frame / V]S W ’ 


(^Gq/Gq) aver aged over ? \ T ^ ®l)( 

Surface of Earth ^ \ ^ / \ 


_ _ H ’ 2 + V p 2 + 2w' 

Surface of Earth 

[varies sinusoidally with period of one year 


4 


v e)- 


(40.47) 


This annual variation, assuming all PPN parameters are of order unity, is 1,(J00 times 
larger than the one produced by the Earth’s motion in and out through the sun’s 
gravitational potential [equation (40.43)]. In response to this changing Cavendish 
constant, the Earth’s self-gravitational pull should change, and the Earth should 
“breathe” inward (greater pull) and outward (relaxed pull). The resulting annual 
variations in the Earth’s moment of inertia should produce annual changes in its 
rotation rate co (changes in “length of day” as measured by atomic clocks): 


&o/co — 0.1 + « 3 — 


H’ • V 


E 
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[Nordtvedt and Will (1972)]. Comparison with the measured annual variations of 
rotation rate (all of which geophysicists attribute to seasonal changes in the Earth’s 
atmosphere) yields the following limit 


2 

J«2 


+ a, 


= <0.2. 


*1 — N 


(40.48) 


[See Nordtvedt and Will (1972)]. This limit rules out several preferred-frame theories 
of gravity, including that of Ni (Boxes 39.1 and 392). 

The experimental results (40.21), (40.46), and (40.48), when combined, place the 
following very rough limits on any theory that possesses a Universal rest frame: 


laj = |7 4 + A 2 - 4y - 4| < 0.2, 

5 |a 2 | = M 2 + f — 1 | < 0.03, (40.49) 

N = 14/?! — 2y — 2 — f|<2x lO" 5 . 

These hmits completely disprove all theories with preferred frames that have been 
examined to date except one devised by Will and Nordtvedt [see Ni (1972V 
Nordtvedt and Will (1972)]. 

In some theories of gravity, the result of a Cavendish experiment depends on the 
chemical composition and internal structure of the test bodies (exercises 40.9 and 
40.10). Kruezer (1968) has performed the most accurate search for such effects to 
date. He finds that G c is the same for fluorine and bromine to a precision of 


(j c (bromine) - G c (fluorine) 
G c 


< 5 x 10~ 5 . 


(40.50) 


Exercise 40.9. CAVENDISH CONSTANT FOR IDEALIZED SUN 

Idealize the sun as a static sphere of perfect fluid at rest at the origin of the PPN coordinates. 
Then its external gravitational field has the form (40.3), with M Q given by (40.4). Conse¬ 
quently, a test body of mass m, located far away at radius r, is accelerated by a gravitational 


Force = —mM 0 /r 2 . 


(40.51a) 


(a) Calculate the mass of the sun, M, in the sense of the amount of 
construct it by adding one spherical shell of matter on top of another, 
inside outward. [Answer: 


energy required to 
working from the 


M — f Po( l + 77 + 3y(/)47rr 2 dr - i- f ° p 0 (/4ct- 2 dr 

___ _ _, v ^ , 

rest mass + internal energy gravitational potential energy 

= jf P 0 [l + /7 + (3y_i)c] 4xr 2 dr.] (40 . 51 b) 


(b) Use the virial theorem [equation (39.21b)] to rewrite equation (40.4) in the form 

Po [ 1 + P 3 n + {lp 2 + j u] 4irr 2 dr. (40.51 c) 


Experimental value of 

K + «3 - «1 


Dependence of G c on 
chemical composition 


EXERCISES 
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(c) Combine the above equations with the definition 

Force = — G c mM/r 2 (40.5 Id) 

of the Cavendish constant for r far outside the sun, to obtain 

( mass of sun as defined by its effect in \ 
bending world line of a faraway test particle/ 

G c = /mass-energy as defined by applying law of\ 

I conservation of energy to the steps in the ) (40.52) 

\construction of the sun / 

= 1 + / (Po/M>)[(& - >)/7 + y (4/? 2 + p 4 - 6y + \)U}4irr*dr. 

Unless /? 3 = 1, and 4fi 2 + /? 4 - 6y -F 1 = 0 (as they are, of course, in Einstein’s theory), 
G c will depend on the sun’s internal structure! Specialize equation (40.52) to “conservative 
theories of gravity (Box 39.5), and explain why the result is what one would expect from 
equation (40.43). 

Exercise 40.10. CAVENDISH CONSTANT FOR ANY BODY 

Extend the analysis of exercise 40.9 to a source that is arbitrarily stressed and has arbitrarv 
shape and internal velocities (subject to the constraints v 2 < 1, |q$.|/p 0 < 1, U < 1, IT < 1, 
of the post-Newtonian approximation). Assume that the body is at rest relative to the 
universal rest frame. Show that G c depends on the internal structure of the source unless 

2 4/? 2 + /? 4 -6y= -1, /? 3 =1, f = 0, tj = 0. (40.53) 

Of course, these PPN constraints are all satisfied by Einstein’s theory. 


§40.9. DO PLANETS AND THE SUN MOVE 
ON GEODESICS? 


The sense in which general 
relativity predicts geodesic 
motion for planets and sun 


Crucial to solar-system experiments is the question of whether the sun and the planets 
move on geodesics of spacetime. This question is complicated by the contributions 
to the spacetime curvature made by the moving body itself. 

To elucidate the question—and to obtain an answer in the framework of general 
relativity—consider an “Einstein elevator” type of argument. The astronomical object 
under consideration has an outer boundary, and each point on this boundary 
describes a world line. These world lines define a world tube. Some distance outside 
of this world tube construct a “buffer zone” as in §20.6. Tailor its inner and outer 
dimensions, according to the mass and moments of the object and the curvature 
of the enveloping space (“strength of the tide-producing force of the external gravi¬ 
tational field”), in such a way that the departure e (cf. §20.6) of the metric from 
flatness in this buffer zone takes on values equal at most to twice the extremal 
achievable value (a minimum with respect to variations in r, a maximum 
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with respect to variations in direction; in other words, a minimax). Then, apart from 
errors of order « extrem , the object can be regarded as moving in an asymptotically 
flat space. The law of conservation of total 4-momentum applies. It assures one that 
the object moves in a (locally) straight line with uniform velocity. Consider, next, 
a “background geometry” that agrees just outside the buffer zone with the actual 
geometry to accuracy f^tTem or better, but that inside is a source-free solution of 
Einstein s field equation. Then, to an accuracy governed by the magnitude of £ extrem , 
the locally straight line along which the astronomical object moves will be a geodesic 
of this background geometry. 

Insofar as one can give any well-defined meaning to the departure of the actual 
motion from this geodesic (a task complicated by the fact that the background 
geometry does not actually exist!), one can calculate this departure by making use 
ofthe^PN formalism or some other approximation scheme [see, e.g., Taub (1965)]. 
This deviation springs ordinarily in substantial measure, and sometimes almost 
wholly, from a coupling between the Riemann curvature tensor of the external field 
and the multipole moments of the astronomical object (angular momentum associ¬ 
ated with rotation; quadrupole and higher moments associated with deformation; 
see, e.g., exercises 40.8 and 16.4). This coupling is important for the Earth-moon 
system, but one need not use relativity to calculate it; Newtonian theory does the 
job to far greater accuracy than needed—or would, if one understood the interiors 
of the Earth and the moon well enough! For the planets and sun, the effect is 
negligible. (Exercise: use Newtonian theory to prove so!). 

Thus, in general relativity as applied to the solar system, one can approximate 
the orbit of the sun, the Earth-moon mass center, and each other planet, as a geodesic 
of that “background spacetime geometry” which would exist if its own curvature 
effects were absent. This is the approach used to analyze the perihelion shift for 
planets in §40.5 in the context of general relativity, and to derive in exercise 39.15 
the post-Newtonian “many-body equations of motion.” 

In most other metric theories of gravity, including the Dicke-Brans-Jordan theory, 
there are substantial departures from geodesic motion. The “Einstein elevator” 
argument fails in these theories because spacetime is endowed not only with a metric, 
but also with a long-range field that couples indirectly (cf. §§38.7 and 39.2) to 
massive, gravitating bodies. 

This phenomenon is best understood in terms of Dicke’s argument about the 
influence of spatial variations of the fundamental constants on experiments of the 
Eotvos-Dicke type (see §38.6). In a theory where the Cavendish gravitational constant 
G c depends on position (as it does not and cannot in general relativity), a body 
with significant self-gravitational energy must fall, in a perfectly uniform 
external Newtonian gravitational field, with an anomalous acceleration: 


Deviations from geodesic 
motion: 


(1) due to curvature coupling 


(2) due to spatial 
dependence of 
gravitational constant 
(Nordtvedt effect) 


( acceleration of 
massive body 


H 


acceleration of 
test body 




_ . figrav vr 

MG C c 


(40.54) 
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Nordtvedt effect in 
Earth-moon orbit 


[see equation (38.15)]. In Dicke-Brans-Jordan theory, G c is essentially the reciprocal 
of the scalar field; and it contains a small part that is proportional to the Newtonian 
potential, U [equation (40.43) with the appropriate values of the parameters from 
Box 39.2]. As a result, the sun falls with an acceleration smaller by one part in 10 6 
than the acceleration of a test body; Jupiter falls with an acceleration one part in 
10 9 smaller; and the Earth, one part in 10 10 smaller. Translated into relativistic 
language: the scalar field, by influencing the gravitational self-energy of a massive 
body, produces deviations from geodesic motion. 

One can use the full PPN formalism of Chapter 29 to calculate the motion of 
massive bodies in any metric theory of gravity. Nordtvedt (1968b) and Will (1971a) 
have done this. They find that a massive body at rest in a uniform external field 
experiences a (Newtonian-type) PPN coordinate acceleration given by 


d 2 X; _ dU 

dt 2 ~ jk dx k ’ 

where Eft is a quantity depending on the body’s structure: 


Eft = S ik {l - (7 A, - 3y - 4/})^v] - (2/? + 2/? 2 - 3y + A, - 2)^, 


o Mi 


(40.55) 


» - 1 [ PoPo^i - w* - *k) d 3 x d 3 r , F ^ r 

2 j \x-xV £ g rav= 


Here m is the body’s total mass-energy, Q- k is the “Chandrasekhar potential-energy 
tensor,” and is the body’s self-gravitational energy. [Note: Dicke’s method of 
calculating the anomalous acceleration (40.54) breaks down in theories that are not 
“conservative” (Box 39.5).] 

In general relativity, the combinations of PPN coefficients appearing in E- k vanish; 
so Ejk — fyfc’ and the body falls with the usual acceleration—i.e., it moves along 
a geodesic. But in most other theories of gravity e» * s ik ; the body does not move 
on a geodesic; and its acceleration may even be in a different direction than the 
gradient of the Newtonian potential! 

This predicted departure from geodesic motion is called the "Nordtvedt effect” 
The possibility of such an effect was first noticed in passing by Dicke (1964c), but 
was discovered independently and explored in great detail by Nordtvedt (1968a,b). 
The Nordtvedt effect in a theory other than general relativity produces a number 
of phenomena in the solar system that are potentially observable. [See Nordtvedt 
(1971b) for an enumeration and references.] The effect most suitable for a test is 
a “polarization” of the Earth-moon orbit due to the fact that the moon should fall 
toward the sun with a greater acceleration than does the Earth. This “polarization” 
results in an eccentricity in the orbit that points always toward the Sun and has 
the amplitude 
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dr = 840 [3y + 40 — 7 A, — 1 (20 + 20 2 - 3y + A 2 - 2)] cm (40.56) 

= 67 meters in Ni’s theory (Boxes 39.1 and 392) 

8 4 

= 2 + ~ me lers m Dicke-Brans-Jordan theory (Boxes 39.1 and 39.2) 

= 0 in Einstein’s theory. 


^ Box 40.4 CATALOG OF EXPERIMENTS 




Type of experiment 


Description of experiment 


Where discussed 


I. Tests of foundations of 
general relativity 


II. Post-Newtonian 

(“solar-system”). — - 
experiments 


III. Cosmological 
observations 


IV. Gravitational-Wave 
experiments 


1. Tests of uniqueness of free fall (EStvds-Dick e-Braginsky 
experiments 

2. Tests for existence of metric (time dilation of panicle de¬ 
cays; role of Lorentz group in panicle kinematics; etc.) 

3. Searches for new, direct-coupling, long-range fields. (Hughes- 
Drever experiment; ether-drift experiments) 

4. Gravitational redshift experiments 


5. Constancy, in space and time, of the nongravitational phys¬ 
ical constants 


§38.3; Figure 1.6; 
Box 1.1 

§38.4 

§38.7; Figure 38.3 
§38.5; Figures 38.1 
and 38.2; §§7.2, 
7.3, and 7.4 

§38.6 


1. Deflection of light and radio waves by Sun 

2. Relativistic delay in round-trip travel time for radar beams 
passing near Sun 

3. Perihelion shifts and periodic perturbations in planetary 
orbits 

4. Three-body effects in the Lunar orbit 

5. Precession of gyroscopes (“geodetic precession” and preces¬ 
sion due to dragging of inertial frames by Earth’s rotation) 

6. Spatial variation of the Cavendish gravitational constant in 
the solar system 

7. Dependence of the Cavendish gravitational constant on the 
chemical composition of the gravitating body 

8. Earth tides with sidereal periods 

9. Annual variations in Earth rotation rate 

10. Periodicities in Earth-Moon separation due to breakdown of 
geodesic motion 


§40.3; Box 40.1 ' 

§40.4; Box 40.2 

§40.5; Box 40.3 
§40.6 

§40.7 

§§40.8 and 40.9 

§40.8 

§40.8 

§40.8 

§40.9 


1. Change of Cavendish gravitational constant with time in 
solar system 

2. Large-scale features of universe (expansion, isotropy, homo¬ 
geneity; existence and properties of cosmic microwave 
radiation; ...) 

3. Agreement of various measures of age of universe (age from 
expansion; ages of oldest stars; age of solar system) 


§40.8 

Chapters 27-30; 
especially Chapter 29 

§29.7 


Existence of waves; propagation speed; polarization 
properties; ... 


Chapters 35-37; 
especially Chapter 37 
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Figure 40.4. (facing page) 

Measuring the separation between earth and moon by determining the time-delay (about 2.5 sec) between 
the emission of light from a laser on the earth and the return of this light to the earth. A key element 
in the program is a comer reflector, the first of which was landed on the moon July 20, 1969, by the 
Apollo 11 flight crew. In November 1971, there were three such reflectors on the moon: two American, 
and one French-built and Soviet-landed. A pulsed ruby laser projects a beam out of the 107-inch reflecting 
telescope of the McDonald Observatory of the University of Texas, on Mount Locke, 119 miles east 
of El Paso. This beam makes a spot of light on the moon’s surface about 3.2 km in diameter. Laser 
light is bounced straight back to the earth by the “laser ranging retroreflectors” (LR 3 ). Each consists 
of an aluminum panel of 46 cm by 46 cm with 100 fused silica comer cubes each 3.8 cm in diameter. 
The first reflector ever set up appears in the first inset, near the lunar landing module. It is tilted with 
respect to the landscape of the moon. The photograph was made shortly before astronauts Neil A. 
Armstrong and Edwin E. Aldrin, Jr., took off for the earth. The second inset is a photograph made 
by D. G. Currie of the field of view in the guiding eyepiece of the McDonald 107-inch telescope in 
an interval when the laser was not firing at the Apollo 11 site. One guides the telescope to Tranquility 
Base (small circle) by aligning fiducial marks on more visible moonscape features. In November 1971, 
the LR? experiment and continuing time-of-fiight measurements were the responsibility of the National 
Aeronautics and Space Administration and a Lunar Retroreflecting Ranging Team of representatives 
from several centers of research. One of the members of this team, Carroll Alley, of the University of 
Maryland, is hereby thanked for his kindness in providing the photographs used in this montage. Thanks 
to this NASA work, the distance between the laser source on the earth and the reflectors on the moon 
is known with an accuracy now better than half a meter. The astronauts left behind on the moon not 
only LR 3 and a seismometer and other equipment, but also a plaque: “We came in peace for all 
mankind.” 



By the mid 1970’s, lunar laser-ranging data will probably be able to determine the 
amplitudes of this polarization to a precision of one meter or better [see Bender 
et ah (1971); also Figure 40.4]. 


§40.10. SUMMARY OF EXPERIMENTAL TESTS 
OF GENERAL RELATIVITY 

No longer is general relativity “a theorist’s Paradise, but an experimentalist’s Hell.” 
It is now a Paradise for all—as one can see quickly by perusing the catalog of 
experiments given in Box 40.4 on page 1129. Moreover, general relativity has 
emerged from each of its tests unscathed—a remarkable 1973 tribute to the 1915 
genius of Albert Einstein. 



PART 


FRONTIERS 


Wherein the reader — who, during a life of continued variety for 
forty chapters (besides the Preface ), was eight chapters a 
mathematician, four times enticed (once by an old friend), four 
chapters a cosmologist , and four chapters a transported 
astrophysicist in the land of black holes, and who at last 
inherited a wealth of experiments, lived honest, and became a 
True Believer—now ventures forth in search of new 

frontiers to conquer . 




§41.1. REFLECTIONS, ROTATIONS, AND 

THE COMBINATION OF ROTATIONS 


Spinors and their applications in relativity grew out of the analysis of “rotations,” 
first in space, then in spacetime. Take a cube (Figure 41.1). Rotate it about one 
axis through 90°. Then pick another axis at right angles to the first. About it rotate 
the cube again through 90°. In this way the cube is carried from the orientation 
marked Initial” to that marked “Final.” How can one make this net transformation 

in a single step, with a single rotation?_Jn^ other words, what is the law for the 
—combination OfTotations? 

Were rotations described by vectors, then one could apply the law of combination 
of vectors. The resultant of two vectors of the same magnitude (90°) separated by 
a right angle, is a single vector that (1) lies in the same plane and (2) has the 
magnitude 2 1/2 x 90° = 127.28°. Both predictions are wrong. To turn the cube from 
initial to final orientation in a single turn, (1) take an axis running from the center 
through the vertex A and (2) rotate through 120°. 

--What-x^mputarionaF^IgdritEm can ever reproduce a law of combination of 

rotations apparently so strange? On the evening of October 16,1843, William Rowan 
Hamilton was walking with his wife along the Royal Canal in Dublin when the 
answer leaped to his mind, the fruit of years of reflection. With his knife he then 
and there carved on a stone on Brougham Bridge the formulas* 

i 2 = j 2 = k 2 = ijk = — 1, 


r~ - 

This chapter is entirely 
Track 2. IMo earlier Track-2 
material is needed as 
preparation for it, nor is it 
needed as preparation for any 
later chapter. 

^--- j 


The problem of combining 
rotations 


*In the same city on June 21, 1972 President Eamon de Valera told one of the authors that, while 
in jail one evening in 1916, scheduled to be shot the next morning, he wrote down the formula of which 
he was so fond, i 2 = p = k 2 = ijk = -1. 







Figure 41.1. 


Rotation about the vertical axis through 90°, followed by rotation about the horizontal axis through 
90°, gives a net change in orientation that can be achieved by a single rotation through 120° about 
an axis emergent from the center through the comer A. 


Rotation operators: 
(1) defined 


(2) as tools in combining 
rotations 


which in today’s notation, 


0 

1 




1 

0 



(41.1) 


take the form 

°x 2 = ° v 2 = = 1 . 

° x ° v = —OyG x = io x (and cyclic permutations). 


(41.2) 


To any rotation is associated a quantity (Hamilton’s “quaternion;” today’s “spin 
matrix” or “spinor transformation” or “rotation operator”) 


R = cos (9/2) - i sin (9/2)(a x cos a + a y cos [i + o x cos y), (41.3) 


where 6 is the angle of rotation and a, fi, y are the angles between the axis of rotation 
and the coordinate axes. A rotation described by R t followed by a rotation described 
by R 2 gives a net change in orientation described by the single rotation 


R 3 = R 2 R V (41.4) 

This is Hamilton’s formula for the combination of two rotations (steps Towantit— 
by Euler in 1776; obtained by Gauss in 1819 but never published by him). 

In the example in Figure 41.1, * 

/^(rotation by 6 = 90° about z-axis) = (1 — io z )/2 1/2 , 

/? 2 (rotation by 6 — 90° about x-axis) = (1 — ia x )/2 1/2 ,~ — - 

i 

and the product of the two is 


R 2 R x = (1 - ia x + ia v - ia z )/2 

= cos 60° - i sin 60 °(ct i / 3 1/2 - a/3 1/2 + oj 3 1/2 ). 
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Figure 41.2. 

Reflection in the plane MPQ carries A to B. 
Reflection in the plane NPQ carries B to C. The 
combination of the two reflections in the two 
planes separated by the angle 0/2 produces the 
same end result (transformation from A to Q as 
rotation through the angle 6 about the line PQ . 


According to Hamilton’s rule (41.3), this result implies a net rotation through 120° 
aboi|t a line that makes equal angles with the x-axis, the j-axis, and the z-axis, in 
conformity with what one already saw in Figure 41.1 (axis of rotation running from 
center of cube through the corner A), 

What one has just done in the special example one can do in the general case: 
obtain the parameters 6 3 , a 3 , /3 3 , y 3 of the net rotation (four unknowns!) by identify¬ 
ing the four coefficients of the four Hamilton units 1, — io x , —io y , — io 9 on both 
sides of the equation R 3 = R 2 R V In this way one arrives at the four prequaternion 
formulas of Olinde Rodrigues (1840) for the combination of the two rotations. 

Why do half-angles put in an appearance? And what is behind the law of combi¬ 
nation of rotations? The answer to both questions is the same: a rotation through 
the angle 6 about a given axis may be visualized as the consequence of successive 
reflections in two planes that meet along that axis at the angle 6/2 (Figure 41.2). 
Two rotations therefore imply four reflections. However, it can be arranged that 
reflections no. 2 and no. 3 take place in the same plane, the plane that includes 
the two axes of rotation. Then reflection no. 3 exactly undoes reflection no. 2. By 
now there remain only reflections no. 1 and no. 4, which together constitute one 
rotation: the net rotation that was desired (Figures 41.3 and 41.4). 

The rotation 

R = cos (6/2) - i sin (6/2)(o x cos a + o y cos £ + o z cos y) (41.3) 
is undone by the inverse rotation 

R- 1 = cos (6/2) -f- i sin (6/2)(o x cos a + o y cos £ + o z cos y). (41.3') 

Thus the product of the two rotation operators 

RR~ l = R~ X R = 1 (41.5) 

is an operator, the unit operator, that leaves unchanged everything that it acts on. 
The reciprocal R~ l of the combination R = R 2 R X of two rotations is 

R~ x = R X 'R^ 

(reverse order of factors!), as one verifies by substitution into (41.5). 


Geometric reason that half 
angles appear in rotation 
operators 


Algebraic properties of 
rotation operators 


(41.5') 
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Figure 41.3. 

Composition of two rotations seen in terms of reflections. The first rotation (for instance, 90° about 
OZ in the example of Figure 41.1.) is represented in terms of reflection 1 followed by reflection 2 (the 
planes of the two reflections being separated by 90°/2 = 45° in the example). The second reflection 
appears as the resultant of reflections 3 and 4. But the reflections 2 and 3 take place in the common 
plane ZOX. Therefore one reflection undoes the other. Thus the sequence of four operations 1234 
collapses to the two reflections 1 and 4. Their place in turn is taken by a single rotation about the axis 
OA. 


I 

The conjugate transpose, M*, of a matrix Mis obtained by taking the conjugate 
complex of every element in the matrix and then interchanging rows and columns. 
By direct inspection of matrix expressions (41.1) one sees that o* = a x , o* = a y9 
o* — o z . Such matrices are said to be Hermitian. The conjugate transpose of the 
product M = PQ of two matrices is the product M* = Q*p* of the individual 
conjugate transposed matrices taken in the reverse order. For the rotation matrix 
written down above, note that R* = R- 1 . Such a matrix is said to be unitary. The 
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Figure 41.4. 

Law of composition of rotations epitomized by 
a spherical triangle in which each of the three 
important angles represents half an angle of ro¬ 
tation. 


determinant of a unitary matrix may be seen to have absolute value unity from 
the following line of argument: 

1 = det (unit matrix) = det ( RR -1 ) 

= det {RR*) = det R det R* 

= |det/?| 2 . (41.6) 

In actuality the determinant of the rotation spin matrix is necessarily unity (“uni- 
modular matrix”) as shown in the following exercises 


Exercise 41.1. ELEMENTARY FEATURES OF THE ROTATION MATRIX 

Write equation (41.3) in the form 

R(6) = cos (0/2) — / sin (0/2)(<r ■ n ), 


and establish the following properties: 


(a) 

(b) 

(c) 

(d) 


(or ■ n) 2 = 1 = unit matrix; 

tr (<r • n) = 0 (tr means “trace,” i.e., sum of diagonal elements); 
[A, (or • ft)] ~ R(a • n ) — (or • n)R = 0; 

^ -[commutator] 

dR i 
w --j(vn)R. 


(41.7) 


[Note that if one thinks of 9 as increasing with angular velocity w, so dO/dt 
then this last equation reads 


dR 

dt 


-j(cr-u)R 


to = constant. 


EXERCISES 


where to = con.] 


( 41 . 7 ') 
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Infinitesimal rotations 


Representation of a 3-vector 
as a spin matrix 


Exercise 41.2. ROTATION MATRIX HAS UNIT DETERMINANT 

Recall from exercise 5.5 that for any matrix M one has 

r 

</[ln (det A/)] = tr(A/“ l dM) 

and use this to show that det R in (41.7) is constant, and therefore equal to (det R) e - 0 = 1. 


§41.2. INFINITESIMAL ROTATIONS 

A given rotation can be obtained by performing in turn two rotations of half the 
magnitude, or four rotations of a fourth the magnitude, or eight of an eighth the 
magnitude, and so on. Thus one arrives in the limit at the concept of an infinitesimal 
rotation described by the spin matrix 

R = 1 - m(o x dd vz + a y dO gx + a z dd xv ) 


or 

R = 1 - (i d0/2)(a • #i). (41.8) 

Here the quantities 

d6 yz — —dd zy = n x d6 — cos a dO , 

dO zx = - d6 xz = n v dd — cos ft dd, (41.9) 

dd xy = —d6 yx — n z dO — cos y dd , 

are the components of the infinitesimal rotation in the three indicated planes. An 
infinitesimal rotation in the (x, j)-plane through the angle d6 xy transforms the vector 
x = (x,y, z) into a new vector with changed components x' and y' but with un¬ 
changed component z' = z. More generally, the infinitesimal rotation (41.8) con¬ 
sidered in this same “active” sense* produces the transformation 

x — x\ 

with 

{de xv )y - {dejz, 

/= -(dO vx )x + y-{d6 yt )z, (41.10) 

z' = ~(d6 tx )x - ( dO zv )y + z. s 

Spinor calculus provides an alternative (and shorthand!) means to calculate the 
foregoing effect of a rotation on a vector. Associate with the vector x the spin matrix 

X = xo x + ya y + za x = (x • or), (41.11) 


* An “ active ” transformation changes one vector into another, while leaving unchanged the underlying 
reference frame (if there is one). By contrast, a “ passive ” transformation leaves all vectors unchanged, 
but alters the reference frame. All transformations in previous chapters of this book were passive. 
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and with the vector x' a corresponding spin matrix or quaternion X\ Then the effect 
of the rotation is summarized in the formula 

X-+ X' = RXR\ (41.12) 

Test this formula for the general infinitesimal rotation (41.10). It reads 

(*' * a) = [1 - (/ dOf 2)(<x • /i)](x • <t)[\ + (/ d0/2)(<r • n )] 

oi^tct the first order in the quantity dd , 

(x' * <t) = (x • a) + (/ d6/2)[(x * a)(a • n) — (a • /i)(x • a)]. (41.13) 

The product of spin matrices A — (a • <x) and B = (A • a) built from two distinct 
vectors a and is 
l 

AB = (a • <r)(Z> • <t) = a x b x o x 2 -h a x b v o x o y -f . . . , 
or, according to (41.2), 

= (a-b) -h /(a X b) m a. (41.14) 

Employ this formula to evaluate the right-hand side of (41.13). In the square brackets, 
the terms in (jc • n) have opposite signs and cancel. In contrast, the terms in (n X x ) 
have the same sign. They combine to cancel the factor 2 in (d9/2). End up with 

(*' • <t) = (x • &) + d9(n X x) * a 


or 


x' = [1 + (d0)n X] x (41.15) 

in agreement with (41.10), as was to be shown. 

A finite rotation about a given axis can be considered as the composition of 
infinitesimal rotations about that axis. To see this composition in simplest form, 
rewrite the spin matrix (41.8) associated with the general infinitesimal rotation as 

R(d0) = e- {iddm °- n) (41.16) 

(exponential function defined by its power-series expansion). Note that (a • n) com¬ 
mutes (a) with unity and (b) with itself, and in addition (c) has a unit square. 
Therefore the calculation of the exponential function proceeds no differently here, 
for spin matrices, than for everyday algebra. The composition of the spin matrices 
for infinitesimal rotations about an unchanging axis proceeds by adding exponents, 
to give 

R(0) = e - i{d/2 *°-”\ (41.17) 


Rotation of a 3-vector 
described in spin-matrix 
language 


Composition of finite rotation 
from infinitesimal rotations 


which can also be obtained immediately from equation (41.7). This expression can 
be put in another form by developing the power series; thus. 
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EXERCISE 


4-vectors and Lorentz 
transformations in spin- 
language 


R (fi)= 2 ( 1 //>!)(-$<7-«/2) p 

p=0 

_ (41.18) 

= E {l/p'-X-tt/iy + (a-n) 2 (l/p\)(-W/2y 

even p odd p 

= cos (9/2) — i sin (0/ 2)(a • n) 

in agreement with the expression (41.3) originally given for a spinor transformation. 
The effect of one infinitesimal rotation after another after another ... on a vector 
is given by 

X f = R(d6) . . . R(dO)XR\d0) . . . R*(d9), 

with the consequence that even for a finite rotation R — R(8) one is correct in 
employing the formula 

X' = RXR*. (41.19) 


Exercise 41.3. MORE PROPERTIES OF THE ROTATION MATRIX 

Show that for X = x * a one has the commutation relation 

[(cr*/i), X) = 2i(n X x)*a. 

Use this to obtain, from equation (41.19) in the form X — RX Q R * [where X 0 is constant, 
while R(8) is given by equation (41.17)], the formula 

^ (x * a) = (n X x) • a. 

Why is this equivalent to the standard definition 


for the angular velocity? Reverse the argument to show that equation (41.7') correctly defines 
the rotation R(t ) resulting from a time-dependent angular velocity to(/)> even though the 
simple solution R = exp [ — \it(o • to)] of this equation can no longer be written when to is 
not constant. 


* 


§41.3. LORENTZ TRANSFORMATION VIA SPINOR ALGEBRA 

Generate a rotation by two reflections in space? Then why not generate a Lorentz 
matrix transformation by two reflections in spacetime? If for this purpose one has to turn 
from a real half-angle between the two planes of reflection to a complex half-angle, 
that development will come as no surprise; nor will it be a surprise that one can 
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still represent the effect of the Lorentz transformation by a matrix multiplication 
of the form 

X — X' — LXL*. (41.20) 

Here the “Lorentz spin transformation matrix” L is a generalization of the rotation 
matrix, R . Also the “coordinate-generating spin matrix” X is now generalized from 
(41.11) to 

X= t + (jc-a) (41.21) 


or 


t + z x — iy 

x + iy t — z 


It is demanded that this matrix be Hermitian 


(41.22) 


X = X*. (41.23) 

Then and only then are the coordinates (r, x,y, z) real. The conjugate transpose of 
the transformed spin matrix must also be Hermitian—and is: 


(X')*= (LXL*)* 

= (L*)*(X)*(L)* = LXL* = X'. (41.24) 

Therefore the new coordinates (t\ x\y \ z') are guaranteed to be real, as desired. 
This reality requirement is a rationale for the form of the spin-matrix transformation 
(41.20), with L appearing on one side of X and L* on the other. 

A Lorentz transformation is defined by the circumstance that it leaves the interval 
invariant: 

t' 2 - x' 2 - y' 2 - z' 2 = t 2 - x 2 -y 2 - z 2 . (41.25) 

Note that the determinant of the matrix X as written out above has the value 


det X — t 2 — x 2 — y 2 — z 2 . (41.26) 

Consequently the requirement for the preservation of the interval may be put in 
the form 

det X' = det X (41.27) 


or 


(det L) (det X) (det L*) = det X. (41.28) 

This requirement is fulfilled by demanding 

det L — \ (41.29) 

[itis_JiQl_a-useful generalization to multiply every element of L here by a common 
phase factor e iS , and therefore multiply detL by e 2iS , because the net effect of this 
phase factor is nil in the formula X ' = LXL*]. 
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Infinitesimal Lorentz 
transformations 


Composition of finite Lorentz 
transformations from 
infinitesimal transformations 


The spin matrix associated with a rotation, whether finite or infinitesimal, already 
satisfied the condition det L — 1 [proved in exercise (41.2)]. This condition, being 
algebraic, will continue to hold when the real angles dO yz , d6 zx , dO xy , are replaced 
by complex angles, d6 yz 4- i da x , dO zx 4- / da y , dO xy 4- i da z . The spin-transformation 
matrix acquires in this way a total of six parameters, as needed to describe the general 
infinitesimal Lorentz transformation. Thus the spin matrix for the general infinitesi¬ 
mal Lorentz transformation can be put in the form 

L — \ — (i/2)(a x dQ yz 4 - o y d8 zx + o z dO xy ) 

+ ( l /2)( a x da x + a v da v + °z da z ) (41.30) 

= 1 — (/ d8/ 2)(a • ji) + (a • da/2). 

The effect of this transformation upon the coordinates is to be read out from the 
formula 

X — X' = LXL* 


or 

t' 4- (a • x') = [1 - (i dQ/2)(o • ») + (a • da/2)] 

X [t + (a • x)][l + (i dBf 2)(a • n) + (a • da/ 2)] (4131} 

Employ equation (41.14) for (a • A)(a • B) to reduce the right side to the form 

t 4- (a • jc) + (a • da)t 4- d0(n X x) • a 4- (x • ^a). 


Now compare coefficients of 1, and a z , respectively, on both sides of the 

equation, and find 


V — t 4- (x • da) 
x' = x 4- t da 4- d8(n X x), 


(41.32) 


in agreement with the conventional expression for an infinitesimal Lorentz transfor¬ 
mation or “boost” of velocity da , in active form, as was to be shown. 

The composition of such infinitesimal Lorentz transformations gives a finite 
Lorentz transformation. The result, however, can be calculated easily only when all 
infinitesimal transformations commute. Thus assume that dO and da are in a fixed 
ratio, so 

_ d6 A _ da 

co = n — and a = —— 
dr dr 

' S 

are constants, with r a parameter. Then integration with respect to r (composition 
of infinitesimal transformations) gives a finite transformation L = exp[ — \irv • 
C« 4- ia )]. For r = 1 , so On = cor, a = ar , this reads 


L — exp [(a — iOri) • a/2]. (41.33) 

In the special case of a pure boost (no rotation; 6 = 0), the exponential function 
is evaluated along the lines indicated in (41.18), with the result 


L = cosh (a/2) 4- (ft a • a) sinh (a/2). 


(41.34) 
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Here n a _ a/a is a unit vector in the direction of the boost. The corresponding 
Lorentz transformation itself is evaluated from the formula 


X' = LXL* 
or 


t' + (*' • a) = [cosh a/2 + ( n a • a) sinh a/2][t + (x • a)] 
X [cosh a/2 + ( n a • a) sinh a/2]. 


Simplify with the help of the relations 


(41.35) 


cosh 2 (a/2) + sinh 2 (a/2) = cosh a, 

2 sinh (a/2) cosh (a/2) = sinh a, 

and { 

("« • <*)(* • °)( n a ' 0) = 2(n a ■ x)(n a • a) - (n a • n a )(x • a), 
and on both sides of the equation compare coefficients of 1 and a, to find 


t' = (cosh a)t + (sinh a)(« a • jc), 


x' = [(sinh a)n a t + (cosh a)(« a • *)/i a ] (“in-line part of transformation”) 
+ [jc — (jc • n a )n a ] (“perpendicular part of x unchanged”). 


(41.36) 


In this way one verifies that the quantity a is the usual “velocity parameter,” 
connected with the velocity itself by the relations 


(1 — P 2 ) 1/2 = cosh a, 

/?(1 - P 2 )~ 1/2 = sinh a, (41.37) 

ft — tanh a. 

That velocity parameters add for successive boosts in the same direction shows 
nowhere more clearly than in the representation (41.33) of the spin-transformation 
matrix: 


L(a 2 )L(a,) = exp [a 2 (n a • a)/2] exp [oqfo • <x)/2] = exp [(a 2 + ai )(n a ■ a)/2] 

= L(a 2 + a,). (41.38) 

Turn from this special case, and ask how to get the resultant of two arbitrary Lorentz 
transformations, each of which is a mixture of a rotation and a boost. No simpler 
method offers itself to answer this question than to use formula (41.33) together 
with the equation 

L(resultant) = L 2 L V (41.39) 


§41.4. THOMAS PRECESSION VIA SPINOR ALGEBRA 

A spinning object, free of all torque, but undergoing acceleration, changes its 
direction as this direction is recorded in an inertial frame of reference. This is the 
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Origin of Thomas precession: 
composition of two boosts is 
not a pure boost 


Derivation of Thomas 
precession using spin 
matrices 


Thomas precession [see exercise 6.9 and first term in equation (40.33b)]. This preces¬ 
sion accounts for a factor two in the effective energy of coupling of spin and orbital 
angular momentum of an atomic electron. In a nucleus it contributes a little to the 
coupling of the spin and orbit of a nucleon. The evaluation of the Thomas precession 
affords an illustration of spin-matrix methods in action. 

The precession in question can be discussed quite without reference either to 
angular momentum or to mass in motion. It is enough to consider a sequence of 
inertial frames of reference S(t) with these two features. (1) To whatever point the 
motion has taken the mass at time t, at that point is located the origin of the frame 
S(t). (2) The inertial frame S(t -f dt) at the next succeeding moment has undergone 
no rotation with respect to the inertial frame S(t ), as rotation is conceived by an 
observer in that inertial frame. However, it has undergone a rotation (“Thomas 
precession”) as rotation is conceived and defined in the laboratory frame of reference. 

How is it possible for “no rotation” to appear as “rotation”? The answer is this: 
one pure boost, followed by another pure boost in another direction, does not have 
as net result a third pure boost; instead, the net result is a boost plus a rotation. 
This idea is not new in kind. Figure 41.1 illustrated how a rotation about the z-axis 
followed by a rotation about the x-axis had as resultant a rotation about an axis 
with not only an x-component and a z-component but also a ^-component. What 
is true of rotations is true of boosts: they defy the law for the addition of vectors. 

Let the frame S 0 coincide with the laboratory frame, and let the origin of this 
laboratory frame be where the moving frame is at time t . Let S(t) be a Lorentz 
frame moving with this point at time t. Let one pure boost raise its velocity relative 
to the laboratory from /3 to J3 + dp. The resulting final configuration cannot be 
reached from S 0 by a pure boost. Instead, first turn S 0 relative to the laboratory 
frame (“rotation R associated with the Thomas precession”) and then send it by 
a simple boost to the final configuration. Only one choice of this rotation will be 
right to produce match-up. Thus, distinguishing the spin matrices for pure boosts 
and pure rotations by the letters B and R y one has the relation 


B(P + dp)R(o>dt) = “B(dp)”B(p) (41.40) 


out of which to find the angular velocity to of the Thomas precession. The quotation 
marks in carry a double warning: (1) the velocity of transformation that 

boosts S(t) to S(t 4- dt) is not (j8 + <//3) — fi = df} (law of vector addition—or 
subtraction—not applicable to velocity), and (2) “i?(^]3)” does not appear as a pure 
boost in the laboratory frame. It appears as a pure boost only in the comovin^ frame. 

Take care of the second difficulty first. It is only a difficulty because the formalism 
for combination of transformations, R 3 = R 2 R V as developed in §41.1 presupposes 
all operations R v R 2 ,. . . , to be defined and carried out in the laboratory reference 
frame. In contrast, the quantity “i?(*/j3)” is understood to imply a pure boost as 
defined and carried out in the comoving frame. Such an operation can be fitted 
into the formalism as follows. (1) Undo any velocity that the object already has. 
In other words apply the operator B~\\ 3). Then the object is at rest in the laboratory 
frame. Then apply the necessary small pure boost, 5(a comoving dr), where a comoving 
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C? 

<k 


is the acceleration as it will be sensed by the object and dr is the lapse of proper 
time as it will be sensed by the object. At the commencement of this brief acceleration 
the object is at rest relative to the laboratory. What is a pure boost to it is a pure 
boost relative to the laboratory. It is also a pure boost in the spin-matrix formalism. 
Then transform back from laboratory to moving frame. Thus have the relation 

“*(#)” = B(fi)B(a e omovins dr)B~\P). (41.41) 

The equation for the determination of the Thomas precession now reads 

B(p + dfi)R(udt ) = B(P)B(a comoviDg dT) (41.42) 

or, with all unknowns put on the left, 


R(iodt)B \a comoving dT) = B~\p + dP)B(p). 


(41.43) 


The first task, to replace the erroneous value of the velocity change (dfi) by a correct 
value (# comov i n g dT), is now made part of the problem along with the evaluation of 
the Thomas precession itself. 

Principles settled, the calculation proceeds by inserting the appropriate expressions 
for all four factors in (41.43), and evaluating both sides of the equation to the first 
order of small quantities, as follows: 


1 - (/' due + dra) • a/2 = [cosh (a 72) - (n a , • a) sinh (a 72 )] 

X [cosh (a/2) + (n a • a) sinh (a/2)]. (41.44) 

Here a and n a are the velocity parameter and unit vector that go with the velocity 
P, a' = a + da, and = n a + dn a , go with P + dp. Develop the righthand side 
of (41.44) by the methods of calculus, writing a’ = a + da and n a , = n a + dn a , and 
applying the rule for the differentiation of a product. Equate coefficients of -a/2 
and —/a/2 on both sides of the equation. Thus find 


and 


a comoving dr = ( da ) n a + (sinh a) dn a 


(41.45) 


°dt = [2 sinh 2 (a/2)] dn a x n a . (41.46) 

The one expression gives the change of velocity as seen in a comoving inertial frame. 
The other gives the precession as seen in the laboratory frame. For low velocities 
the expression for the Thomas precession reduces to 

« = a X P/2. (41.47) 

Here a is the acceleration. Only the component perpendicular to the velocity P is 
relevan^ for the precession. 

For an elementary account of the importance of the Thomas precession in atomic 
physics, see, for example, Ruark and Urey (1930). 


Angular velocity of Thomas 
precession 
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Orientation-entanglement 

relation 



Figure 41.5. 

“Orientation-entanglement relation” between a cube and the 
walls of a room. A 360° rotation of the cube entangles the 
threads. A 720° rotation might be thought to entangle them 
still more—but instead makes it possible completely to disen¬ 
tangle them. 


§41.5. SPINORS 

Paint each face of a cube a different color. Then connect each comer of the cube 
to the corresponding comer of the room with an elastic thread (Figure 41.5). Now 
rotate the cube through 277 = 360°. The threads become tangled. Nothing one can 
do will untangle them. It is impossible for every thread to proceed on its way in 
a straight line. Now rotate the cube about the same axis by a further 2 77. The threads 
become still more tangled. However, a little work now completely straightens out 
the tangle (Figure 41.6). Every thread runs as it did in the beginning in a straight 
line from its corner of the cube to the corresponding corner of the room. More 
generally, rotations by 0, ±477, ±877, . . . , leave the cube in its standard “orienta¬ 
tion-entanglement relation” with its surroundings, whereas rotations by ± 2 t 7 , ±677, 
± IO77, . . . , restore to the cube only its orientation, not its orientation-entanglement 
relation with its surroundings. Evidently there is something about the geometry of 
orientation that is not fully taken into account in the usual concept of orientation; 
hence the concept of “orientation-entanglement relation” or (briefer term!) “version” 
(Latin versor, turn). Whether there is also a detectable difference in the physics 
(contact potential between a metallic object and its metallic surroundings, for exam¬ 
ple) for two inequivalent versions of an object is not known [Aharonov and Susskind 
( 1967 )]. 

In keeping with the distinction between the two inequivalent versions of an object, 
the spin matrix associated with a rotation, 

R — cos (0/2) — i(n * <r) sin (0/2), (41.48) 

reverses sign on a rotation through an odd multiple of 277. This sign change never 
shows up in the law of transformation of a vector, as summarized in the formula 

X—^ X' = RXR* (41.49) 

(two factors R; sign change in each!). The sign change does show up when one turns 
from a vector to a 2-component quantity that transforms according to the law 

(4L50) 


Spinor defined 
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720° 



Figure 41.6. 

An object is connected to its surroundings by elastic threads as in Figure 41.5. (Eight are shown here; 
any number could be used.) Rotating the object through 720° and then following the procedure outlined 
(Edward McDonald) in frames 2-8 (with the object remaining fixed), one finds that the connecting threads 
are left disentangled, as in frame 9 (lower right). 


Such a quantity is known as a spinor. A spinor reverses sign on a 360° rotation. 
It therefore provides a reasonable means to keep track of the difference between 
the two inequivalent versions of the cube. More generally, with each orientation- 
entanglement relation between the cube and its surroundings one can associate a 
different value of the spinor £. Moreover, there is nothing that limits the usefulness 
of the spinor concept to rotations. Also, for the general combination of boost and 
rotation, one can write 

= Lt (41.51) 


Lorentz transformation of 
spinor 
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Second type of spinor 


Vector regarded as a 
Hermitian second-rank spinor 


When the boost and rotation are both infinitesimal, the explicit form of this trans¬ 
formation is simple: 


€' = [1 — 0 d0/2)(n • <x) 4 - (dp/2) • <x]£. 


or, according to (41.1), 


/s'\ 

r) 


1 + — 4 . /^) 


~ K: + P:~ iP,) 


y (- + P* + iP v ) 1 + ~ P.) 


U* 


(41.52) 


For any combination of a boost in the z-direction of any magnitude and a finite 
rotation about the z-axis, one has 



g—l/2i$ xv +l/2f} e 0 

0 e l/2id xv -\/2P z 



(41.53) 


To keep track of the two components of the spinor, it is convenient and customary 
to introduce a label (capital Roman letter near beginning of alphabet) that takes 
on the values 1 and 2; thus (41.51) becomes 


r * 4 = 


(41.54) 


The spinor has acquired a significance of its own through one’s having pulled 
out half of the transformation formula 


X' = LXL*. (41.55) 

To be able to recover this formula, one requires the other half as well. It contains 
the conjugate complex of the Lorentz transformation. Therefore introduce another 
spinor rj that transforms according to the law 

V ,tr = L*yV (41.56) 

[U = 1,2; V = 1,2; dots and capital letters near the end of the alphabet are used 
to distinguish components that transform according to the conjugate complex (no 
transpose!) of the Lorentz spin matrix]. 

i 


§41.6. CORRESPONDENCE BETWEEN VECTORS 
AND SPINORS 


To go back from spinors to vectors, note that the spin matrix X in (41.55) has the 
form 


a) = 

(t + z) 

(x - iy) 


X 

X 12 

(X + iy) 

(t - z) 


X 2i 

X Z2 


(41.57) 
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where the labels receive dots or no dots according as they are coupled in (41.55) 
to L* or to L. That equation of transformation becomes 

= lA B L*yX B * (41.58) 

(transpose obtained automatically by ordering of indices; thus L°y, not L*^y). The 
coefficients in this transformation are identical with the coefficients in the law for 
the transformation of a second-rank spinor with one index undotted and the other 
dotted:” 

= L A B L a i4 e V* (41.59) 

In this sense one can say that “a 4-vector transforms like a second-rank spinor.” 
To be completely explicit about this connection between a 4-vector and a second-rank 
spinor^ note from (41.57) the relations 


X li = *° + a : 3 , 
* 12 = x 1 - ix\ 
X 2i = x 1 + ix\ 
X 22 = x° - x 3 . 


(41.60) 


In a more compact form, one has 

X AU = [/ + (*• a )Y° = x^o* 0 (41.61) 

where a 0 is the unit matrix. This equation tells immediately how to go from the 
components of a 4-vector, or “1-index tensor,” to the components of the correspond¬ 
ing “1,1 -spinor” (one undotted and one dotted index). 

With each real 4-vector x a is associated a 1,1-spinor that is Hermitian in the sense 
that 

X**=X&. (41.62) 

An example of a Hermitian 1,1-spinor is provided by (41.61). The concept of 
Hermiticity can be stated in other words, and more generally. Associated with any 
A r , A-spinor with components <fi A -r-~ A \ f ;■■ A -\ is the conjugate complex spinor <P with 


(<P) a i- a .v&1"U n — (4>U l ...U„A I ...A !l \ 


(41.63) 


An AjAAspinor is said to be Hermitian when it is equal to its conjugate complex. 


§41.7. SPINOR ALGEBRA 

Equation (41.53) showed the component of a spinor rising exponentially with 
a boost in proportion to the factor e 1/2 ^, and the other component, £' 2 faffing 
exponentially. If from two spinors £ and f, there is to be any quantity constructed 
which is unaffected in value by the boost, it must be formed out of such products 



A/,A/-spinors and Hermiticity 


Spinor algebra: 
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as g 1 ^ 2 and g 2 £K One can restate this product prescription in other language. 


(1) e AB , e AB defined 

Introduce the alternating symbols e ab and e ab such that e 12 = e 10 = 

1 and 


g'iB _ _ BA _ 

* ’ e AB ~ E BA’ 

(41.64) 


the only other nonvanishing components being e 21 = e 21 = — 1 . Define the lower- 
label spinor ^ in terms of the upper-label spinor by the equation 

(2) raising and lowering 
spinor indices 

£a = 

with the inverse 

(41.65) 


= eB %. 

(41.66) 


Then the scalar product of one spinor by another is defined to be 


(3) scalar products of spinors 


(41.67) 


The value of this scalar product is unaffected by any boost or rotation or combination 
thereof: 

vj /A = r\ A r A 

= (L%g D )e BA (L\n 
= (det L)i D e DC $ c 

= t c $ C - ( 41 . 68 ) 

The proof uses the fact that the expression L b d e ba L a c (1) vanishes when D = C, 
and (2) reduces to the determinant of L (unity!) or its negative when D — 1, C = 2, 
or D = 2, C = 1. Note that the scalar product g A $ A is the negative of the scalar 
product g A { A . The value of the scalar product of a spinor with itself is automatically 
zero (“built-in null character of a spinor”). 

(4) the mapping between The components of a vector with upper index have been expressed in terms of 

vectors and 1,1-spinors the components of a 1,1-spinor with upper indices 

X A ° = x»o a0 , ( 41 . 69 ) 

and a similar correlation holds between vector and 1,1-spinor with lower indices; 
thus, * 

%AU = X !i a 11 a u- ( 41 . 70 ) 


(5) o" defined and related Here the “associated basic spin matrices” have the components 
to a „ 

a>i AU = ii B ^ E BA e iV’ 


( 41 . 71 ) 
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or, explicitly, 





10 

ill 




0 

1 

a"ii 



1 

0 

i aM 22 


. 1 

|0 

_ l 


+ 


1 0 


0 

0 

-1 


for /i = 0, 
for /x = 1, 
for ix = 2, 
for fi = 3. 


(41.72) 


The same type of multiplication law holds for these matrices, (a 1 ) 2 = (a v ) 2 = 
(a*) 2 = 1, a x a v = - a v a * = ia\ etc., as for the matrices a x , a y , a z of (41.2). Be¬ 
tween the “basic spin matrices,” a M , and the “associated basic spin matrices,” a", 
the following orthogonahty and normalization relations obtain: 

=-2 Si 8$ (41.73) 

and 

<’S* a 'AU= ~ 2 K (41.74) 

One can use these relations to “go back from a quantity expressed as a 1,1 -spinor 
(‘spinor equivalent of a vector’) to the same quantity expressed directly as a vector 
(first-rank tensor).” Thus, multiply through (41.61) on both sides by -\o v Atj , sum 
over the spinor indices, and employ (41.74) to find the contravariant components 
of the vector, 

= -jo’aO***- (41-75) 


Similarly from (41.70) and (41.73) one finds the covariant components, 


x„ = 


\°, aV Xav- 


(41.76) 


An A-index tensor 7Tets itself be expressed in spinor language (“spinor equivalent (6) 
of the tensor”) by a generalization of (41.61) or (41.70); thus, for a mixed tensor 
of third order, one has 


T A0 BtCW =a a A6Op BV Oy CW T jy 


and the converse relation 


t Py — 

1 a 



3 . n y T .BVCW 
BV° CW X AU 


(41.77) 


(41.78) 


the mapping between 
rank-A/ tensors and 
A/,/V-spinors 


Box 41.1 gives the spinor representation of several simple tensors. 
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Box 41.1 SPINOR REPRESENTATION OF CERTAIN SIMPLE TENSORS IN 
THE CONTEXT OF A LOCAL LORENTZ FRAME 


Quantity 

Tensor language 

Spinor language 

General 4-vector 

x a (four complex numbers) 

X A v (4 complex numbers) 

Real 4-vector (example: 

4-momentum) 

x a = x a (four real numbers) 

X A v — ( X LA ) (2 real components. 

1 distinct complex component) 

Null 4-vector 

= 0 

dtXX A ° = 0 [see (41.57)]; hence 
there exist two spinors and t\ v 

such that X AU = £W' 

Future-pointing real null 4-vector 
(such as 4-momentum of a 
photon) 

X a = X Q 

VtfiX’x* = 0 
x° > 0 

There exists a spinor £ A (two com¬ 
plex numbers, unique up to a 
common multiplicative phase_fac¬ 
tor e i5 ) such that X A = 1^(0^ 

Past-pointing real null 4-vector 

x° < 0 

x a0 - 

Real bivector or 2-form (such as 
Maxwell field) 

(subscript implying F afi - 
— Fp a : six distinct real 
components) 

There exists a symmetric spinor 
(three distinct complex compo¬ 
nents 4> n , 22 ) such_that 

F AVBV = $AB*UV + e AB$)vV 

Real 2-form dual to foregoing real 
2-form 

•f./j = 

*Favbv = “ i^AB^uv + ^abWuv 
(duality for 2-form corresponds to 
multiplication of spinor 4> AB 
by -0 

Real fourth-order tensor with sym¬ 
metries of Weyl conformal curva¬ 
ture tensor; that is, with symme¬ 
tries of Riemann curvature tensor 
and with additional requirement 
of vanishing Ricci tensor (“empty 
space;” “vacuum Riemann 
tensor”) 

C a pys = CWhysi) (antisymmetric in 
first two indices; antisymmetric in 
last two indices; symmetric against 
interchange of first pair with sec¬ 
ond pair) C a [fiys] = 0 (20 algebrai¬ 
cally distinct components, as for 
the Riemann tensor, reduced to 10 
by the further vacuum condition:) 

C a — 0 

^ 0 a« — u 

There exists a completely symmetric 
spinor ^ BCD ^ ve distinct 

complex components, 

^1111 

^1112 

^1122 

^1222 

^2222 

such that C A iT B j V wD£ = _ 

4'ABCD E UV e UX + e AB £ CD'l / VVWX 


V 


In some treatises on spinor analysis, the factor (—in equations like (41.78) 
is eliminated by the following double prescription: (1) insert into the matrices a^ 
and a" a factor l/\/2 not included above; and (2) use for the standard metric not 
diag ij Mr = (-1,1,1, 1) as above, but (1,-1, -1, -1). This prescription was not 
adopted here (1) because the introduction of l/\/2 in the matrices a x , o„, a z would 
put them out of line with the Pauli matrices as used for many years throughout 
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Quantity 


Fully developed Riemann curvature 
tensor (space where matter is 
present) 


i 


Each physical quantity is described 
by a geometric object. Every local 
physical quantity is described by a 
mathematical quantity that trans¬ 
forms under a proper local Lor- 
entz transformation as an “irre¬ 
ducible representation of the 
group of proper Lorentz 
transformations.” 


Tensor language 


R apy& — R [ap]y& — R ap[y&) ~ 
R ([ap][ys)) 

R aipy&] = 0 (20 algebraically dis¬ 
tinct components) 


Each local physical quantity is de¬ 
scribed by a tensor with its own 
rank and specific symmetry 
properties. 


Spinor language 


There exists a completely symmetric 
spinor ^abcd (“Weyl” or “confor¬ 
mal” part of curvature, or part of 
nonlocal origin) and a scalar A 
(measure of trace of part of curva¬ 
ture of local origin) and a spinor 

&ABUV ~ = (^)aBUV 

(measure of trace-free part of cur¬ 
vature of local origin; last of the 
three irreducible parts of the cur¬ 
vature tensor) such that 

R AUBVCWDX = ] pABCD £ Ui*WX 

T e AB e CD('t / )uVWX 
4 * 2A(e A c e BD e ti ,€ wx 
+ e AB e CD*l\*Vw) 
T e AB < f > CDUV e W.\ 

■E e CD^AJBWX e UV‘ 

In order to provide the required fi¬ 
nite irreducible representation of 
to represent a local physical 
quantity, the associated spinor 
must be completely symmetric in 
all of its undotted indices, and 
also completely symmetric in all 
its dotted indices [Gel’fand (1963)]. 


y 


atomic and nuclear physics, and (2) because a positive definite metric within a 
spacelike hypersurface has the advantage of naturalness for the analysis of the 
initial-value problem of geometrodynamics and for the definition of what one means 
by a 3-geometry. The price of the factor (—is paid here for these advantages. 
Conventions that avoid this price are preferable for extensive spinor computations; 
see, e.g., Pirani (1965) or Penrose (1968a). 
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Linear independence of 
spinors 


Basis spinors and spinor 
mates 


§41.8. SPIN SPACE AND ITS BASIS SPINORS 

The “space” of elementary spinors is two-dimensional. Therefore it is spanned by 
any two linearly independent spinors A x and j u. A . Moreover, it is easy to diagnose 
a pair of spinors for possible Unear dependence, that is, for existence of a relation 
of the form fi A = const A^. In this event, the scalar product of \\i A with \ A , Uke the 
scalar product of \ A with \ A (41.67) automatically vanishes. Therefore a nonvan¬ 
ishing scalar product 


(41.79) 

is a necessary and sufficient condition for the linear independence of two spinors. 

The general 4-vector lets itself be represented as a linear combination of four basis 
vectors. Similarly the general spinor lets itself be represented as a Unear combination 
of two basis spinors: 


u A = \£ A + fin A . (41.80) 

Here it is understood that the term “basis spinor” implies that £ A and tj a satisfy 
the normalization condition 


£aV A = 1 (= -vJ A )- (41.81) 

From this condition one derives simple expressions for the expansion coefficients 
in (41.80): 

A = -il A w A (= w b v b ), 

lt = i A u A (= - Ub $ b ). (41.82) 

Inserting these expansion coefficients back into (41.80) wiU reproduce any arbitrarily 
chosen spinor u A . In other words, the foUowing equation has to be an identity in 
the components of u B : 


u A = = (£A V b _ (41.83) 

From this circumstance, it foUows that the components of the two basic spinors are 
linked by the equations 

i A V B - V A i B = e ab . -—_ (4L84) 

ft 

Given two basis spinors £ A and , one can get two equally good basis spinors by 
writing 

tA _ tA 
» new ^ > 

^^new = V A + (41.85) 

with a any real or complex constant, as one checks at once by substitution into (41.81) 
or (41.84). The most general “spinor mate” to a given spinor ( A , satisfying the 
normalization condition (41.81), has this form (41.85). 
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Figure 41.7. 

Spinor represented by (1) “flagpole” [Penrose terminology; track of pulse of light; 
null vector 6&] plus (2) “flag” [arrow (<? —*■) flashed onto surface of moon by 
laser pulse from earth or, in expanded view in the inset above, a flag itself, substi¬ 
tuted for the arrow] plus (3) the orientation-entanglement relation between the flag 
and its surroundings [symbolized by strings drawn from comers of flag to surround¬ 
ings]. When the spinor itself is multiplied by a factor pe i<J , the components of the 
null vector (flagpole) are multiplied by the factor p 2 and the flag is rotated through 
the angle 2a about the flagpole. 


§41.9. SPINOR VIEWED AS FLAGPOLE PLUS FLAG PLUS 
ORIENTATION-ENTANGLEMENT RELATION 

How can one visualize a spinor? Aim the laser, pull the trigger, and send a megajoule Geometric representation 

pulse from the here and now (event (?) to the there and then (event 9: center of s P inor: 

the crater Aristarchus, 400,000 km from 6 in space, and 400,000 km from 6 in 

light-travel time). The laser has been designed to produce, not a mere spot of light, 

but an illuminated arrow. Following Roger Penrose, speak of the null vector 69 

as a “flagpole,” and of the illuminated arrow as a “flag.” A spinor (Figure 41.7) 

consists of this combination of (1) null flagpole plus (2) flag plus (3) the orientation- 
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entanglement relation between the flag and its surroundings. “Rotate the flag” by 
repeatedly firing the laser, with a bit of rotation of the laser about its axis between 
one firing and the next. When the flag has turned through 360° and has come back 
to its original direction, the spinor has reversed sign. A rotation of the flag about 
the flagpole through any even multiple of 2tt restores the spinor to its original value. 

One goes from a spinor £, a mathematical object with two complex components 
I 1 and £ 2 , to the geometric object of “flagpole plus flag plus orientation-entanglement 
(1) null vector (flagpole), plus relation” in two steps: first the pole, then the flag. Thus, go from the spinor £ A to 

the real null 4-vector of the “pole” by way of the formula 


X Atj = i A (t)V 


lit+ 2) 

(x - iy) 


I 1 ! 1 

^£2 

(x + iy) 

it ~ z) 





(41.86) 

(41.87) 


The matrix on the right has its first row identical up to a factor £V£ 2 with its second 
row. Therefore the determinant of the matrix on the right vanishes. So also for the 
left. Therefore the 4-vector 6$ — (t, x,y, z) is a null vector. One “stretches” this 
vector by a factor p 2 when one multiplies the spinor £ A by the nonzero complex 
number \ — pe i<J (p, a real); however, the vector is unchanged in direction. The 
4-vector is also unaffected by the choice of the angle a. In other words, this null 
4-vector is uniquely fixed by the spinor; but the spinor is not fixed with all uniqueness 
by the 4-vector. To a given 4-vector corresponds a whole family of spinors. They 
differ from one another by a multiplicative phase factor of the form e ia (“flag 
factor”). 

(2) bivectpr (flag) and its Looking further to see the influence of the flag factor showing up, turn from a 

relat'cm' 0 ' 1 ' 611 ^ 0 ^ 1601601 rea * vector (f° ur components) generated out of the spinor £ A to a real bivector (six 

components) generated out of the same spinor: 


ffiv _^ _ £A^B t uv + 

p —AU ) v —BK 


(41.88) 


That this quantity has no more than six distinct components ( F^ ~ —F Vfi ) follows 
from interchanging A with B and U with V, and noting the resultant change in sign 
on the righthand side of (41.88). To unfold the meaning of this bivector, look in 

(41.88) for every appearance of the alternating factor Wherever such a factor 
appears, insert the expression (41.84) for this factor in terms of the starting spinor 
i A and insert the additional spinor r\ A that is needed, along with £ A , to supply a 
basis for all spinors. In this way, find 


F*' —= £^ B (I V - if*?*) + (t A r) B - y A £ B )£ & F 

= + V B P) - (IV + ^ WP (41.89) 

_ x au y bv _ Y AtJ X Bir — xy v - y*x v . 
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Thus the 2,2-spinor built from £ A represents a bivector constructed out of the two 
4-vectors x and y. Of these, the first is the “real null vector of the flagpole,” already 
seen to be determined uniquely by the spinor £ A . The second vector, 


y 


Y A& _ £A~U + 


(41.90) 


is also determined by £ A , but not uniquely, because the “spinor mate,” r] A , to £ A 
is not unique. Go from one choice of mate, t\ a , to a new choice of mate (equation 
41.85), 


4^ new =1 A + (*£ A - 

Then the real 4-vector y* goes to the new real 4-vector 


(41.91) 


y “:new = T" + (« + «)*"• (41.92) 

a 

Were the 4-vector y unique, there would project out from the flagpole, not a flag 
but an arrow. The range of values open for the real constant a + a makes one arrow 
into many arrows, all coplanar; hence the flag of Penrose. Otherwise stated, the 
choice of a spinor £ A fixes no individual arrow, but does fix the totality of the 
collection of arrows, and thus uniquely specifies the flag. 

The 4-vector y (and with it y new ) is orthogonal to the null 4-vector x, 

x-y = x^ = -Lx’ Ai} Y A * 

= - ^S A Itr(& A y u + v A I u ) = 0, (41.93) 

and spacelike, 

V ‘ V — — y HaVo + Va^oX^V 0 + V A i & ) 

= - j(Lv A )(Vu(°) - jCVA^XtcV*) = 1 (41.94) 

(“unit length of flag”). 

Multiplication of the spinor ( A by the “flag factor” e ia rotates the flag about the Rotation of flag about 
-flagpole-by the^rigle 2o, because the spinor mate, r/ A , of $ A is multiplied by the flagpole 
factor e~ ia [see the normalization condition (41.81)]. These changes alter the vector 
y to a rotated vector y rot , with 

y °ot —»- rf* = e 2i n A v° + e-*°7, A l* 

= cos 2a($ A f} l ' : + r] A ^) + sin 2a(i^ A if^ — hj A (^) 

—*- y a cos 2a + z a sin 2a. (41.95) 

Here the 4-vector z shares with the vector y these properties: it is (1) real, 

(2) spacelike, (3) of unit magnitude, (4) orthogonal to the null 4-vector x of the 
flagpole, and (5) uniquely specified by the original spinor £ A up to the additive real 
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Equations relating spinor, 
flagpole, and flag 


multiple (a + a) of x. In addition, z and y are orthogonal. Thus y and z provide 
basis vectors in the two-dimensional space in which—to overpictorialize—the “tip 
of the flag” undergoes its rotation. 

Recapitulate by returning to the laser pulse. Two numbers, such as the familiar 
polar angles 9 (angle with the z-axis) and <J> (azimuth around z-axis from x-axis) 
tell the direction of its flight. A third number, r, gives the distance to the moon 
and also the travel time for light to reach the moon. A fourth number, an angle 

tells the azimuth of the illuminated arrow shot onto the surface of the moon, 
this azimuth to be measured from the e 9 direction (where \p = 0), around the flagpole 
in a righthanded sense. Then the spinor associated with the flagpole plus flag (rotated 
arrow) is 


'€‘\ = (2 1/2 /cos(0/2)e-**'*+ t +'*\ 
£ 2 / \ sin (9/2)e i<t>/2Jri '!' /2 ) 


(41.96) 


according to the conventions adopted here [see (41.87)]. The mate t} A to this spinor, 
unique up to an additive multiple of £ A , is 


(2 . in (-sin(d/2)e-W-W\ 

\tj 2 / 1 ’ V cos (9/2)e i<t ‘ /2 ~ i 'l' /2 )' 

The 4-vector of the flagpole determined by £ A is found from (41.87): 


/x°\ 


/ ' \ 

X 1 


[ r sin 9 cos ^ 

x2 l 


r sin 9 sin <j> 

wl 

f ' 

i r cos 9 1 


(41.97) 


(41.98) 


To determine the flag itself, one requires, in addition to x“, the unit spacelike 4-vector 
y a , normal to x“, and unique up to an additive real multiple of x“. This vector is 
evaluated by use of (41.90) and has the form 


/ y°\ 

/ 0 \ 

y 1 


cos 9 cos </> cos \p -f sin <^> sin \p 

y 2 


cos 9 sin <j> cos \p — cos <f> sin ^ 

\y 3 l 


\ — sin 9 cos \p / 


(41.99) 


From these expressions forx" and y*, one calculates the components of the fyivector 
(“flag”) = x^y” — y^x" by simple arithmetic. 


§41.10. APPEARANCE OF THE NIGHT SKY: 

AN APPLICATION OF SPINORS 

Attention has gone here to extracting all four pieces of information contained in 
a spinor: separation in time (equal to separation in space), direction in space, and 
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Figure 41.8. 

Representation of a direction in space (one of the stars of the Big Dipper, regarded 
as a point on the celestial sphere) as a point in the complex f plane (f = ratio 
of spinor components) by stereographic projection from the South Pole. 


rotation about that direction. Turn now to an application where not all that informa- Spinors used to analyze 
tion is needed. Look at the night sky and ask (1) how to describe its appearance " L °rentz transformations" of 
and (2) how to change that appearance. As one way to describe its appearance, give a PP earance of ni 9 ht sky 
the direction of each star. Abandon any concern about the distance of the star, and 
any concern about any rotation ip about the flagpole. In other words, the complex 
factor 

(. 2r) 1/2 e 

common to I 1 and £ 2 drops from attention. All that is left as significant is the ratio 
f of these spinor components: 

S = fVfi 1 = tan (6/ly*. (41.100) 

To give the one complex number f (“stereographic coordinate;” Figure 41.8) for 
each star in the sky is to catalog the pattern of the stars. 

Let the observer change his stance. The celestial sphere appears to rotate. Or let 
him rocket past his present location in the direction of the North Star with some 
substantial fraction of the velocity of light. To him all that portion of the celestial 
sphere is opened out, as if by a magnifying glass. To compensate, the remaining 
stars are packed into a smaller angular compass. Any such rotation or boost or 
combination of rotation and boost being described in spinor language by a transfor¬ 
mation of the form 


tA _ tA tB 
Snew “* ^ BS » 


(41.101) 





1162 


41. SPINORS 


implies a transformation of the complex stereographic coordinate of any given star 
of the form 


_ ||ew _ L%S + L\ 
£Lw + L\ • 


(41.102) 


In the special case of a boost in the z-direction with velocity parameter a (velocity 
ft = tanh a), the off-diagonal components L l 2 and L 2 l vanish. The magnification 
of the overhead sky then expresses itself in the simple formula 


t — e a K 

* new o 


or 


<>new = 

tan (^new/ 2 ) = tan (#/ 2 )- 


(41.103) 


Contrary to this prediction and false expectation, no magnification at all is 
achieved of the regions around the North Star by moving with high velocity in that 
direction. On the contrary, any photon coming in from a star a little off that direction, 
with a little transverse momentum, keeps that transverse momentum in the new 
frame; but its longitudinal momentum against the observer is augmented by his 
motion. Thus the ratio of the momenta is decreased, and the observed angle relative 
to the North Star is also decreased. The consequence is not magnification, but 
diminution (“looking through the wrong end of a telescope”). The correct formula 
is not (41.103) but 

tan (0 new /2) = tan (8/2) (41.104) 


(reversal of the sign of a). The reason for this correction is not far to seek. The 
spinor analysis so far had dealt with an outgoing light pulse, and a 4-vector with 
positive time component. That feature was built into the formula adopted to tie 
the spinor to the 4-vector, 

rl + (r-a) = X= \\£ A £*\\. (41.105) 


In contrast the 4-vector that reaches back to the origin of an incoming photon has 
a time component that is negative (or, alternatively, sign-reversed space compo¬ 
nents)! For any null 4-vector with negative time component, one employs instead 
of (41.105) the formula 

X = (41.106) 

It is enough to mention here this point of principle without going through the details 
that give the altered sign for a in (41.104). From now on, to preserve the previous 
arithmetic, change the problem. Deal, not with incoming photons, but with outgoing 
photons. Replace the receiving telescope by the projector of a planetarium. It projects 
out into space a separate beam of light for each star of the Big Dipper and also 
one for the North Star itself. Let an observer move in the positive z-direction with 
velocity parameter a. In his frame of reference the beams actually will be widened 
out in full accord with (41.103). 

“The magnification process changes the size of the Big Dipper but not its shape.” 
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This statement is at the same time true and false. It is true of the Dipper and of 
any other constellation to the extent that the angular dimensions of that constellation 
can be idealized to be small compared to the entire compass of the sky. It is false 
in the sense that any well-rounded projected constellation, however small it may 
appear to an observer at rest with respect to the earth, can always be so “opened 
out” by the observer putting on any sufficiently high velocity, the observer still being 
near the earth, that the constellation encompasses a major fraction of the sky. 

That the “Lorentz-transformation-induced magnification” of a small object does 
not change its shape can be seen in three ways. (1) Stereographic projection (Figure 
41.8) and “fractional linear transformation” (41.102) are both known to leave all 
angles unchanged [“conformal invariance;” see for example Penrose (1959)] and 
known even to turn every old circle into a new circle. (2) Consider a given star, 
M, in the constellation and immediate neighbors, L and N, just below it and just 
above** it in the count of the members of that constellation. Consider the flagpole 
pointed at M and the flag pointed first from M to L, then from M to N. The flag 
has turnecUabout the flagpole through an angle xp. The two corresponding spinors 
therefore differ by a phase factor e x ^ /2 . They differ in no other way. After an arbitrary 
Lorentz transformation they still differ by the phase factor e^ /2 , and in no other 
way. The angle between the arcs ML and MN on the celestial sphere therefore 
remains at its original value xp after the Lorentz transformation (again conformal 
invariance of patterns on the celestial sphere!). (3) An even more elementary calcu¬ 
lation shows that infinitesimal arc lengths on the unit celestial sphere in the direction 
of increasing 6 and arc lengths in the direction of increasing <£ are magnified in 
the same proportion, thus leaving unchanged the angle between arc and arc (confor¬ 
mal invariance). Thus, consider a photon shot out from the planetarium projector 
to a point on the celestial sphere (“planetarium version of a Big-Dipper star”) with 
inclination 6 to the z-axis, as seen by an observer at rest relative to the earth. From 
the standard laws of transformation of angles in a Lorentz transformation (“aberra¬ 
tion”; Box 2.4), one has for the sine of the transformed angle 


sin 6 


new 


(1 - / 3 2) 1 ' 2 
1 — /} cos 6 


sin 8 


(41.107) 


and (by differentiating the expression for the cosine of the transformed angle) 


d8 


new 


2 -r>‘> 

1 — /? cos 6 


(41.108) 


From these expressions it follows at once that the inclination, relative to a meridian 
line, on the transformed celestial sphere is identical to the direction, relative to the 
same meridian line, on the original celestial sphere: 


new \ 
inclination/ 


sin fl Dew <fo new sin 8_ d<? 

^new d8 


tan 


( original 
inclination 


(41.109) 


Lorentz transformations leave 
angles on sky unchanged 
("conformal invariance") 


(again conformal invariance!). 
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So much for the elementary spinor and what it has to do with a null vector, with 
a “flagpole” pointed to the celestial sphere, and with rotation of a “flag” about such 
a flagpole. 


§41.11. SPINORS AS A POWERFUL TOOL IN 
GRAVITATION THEORY 


Spinor formalism in curved 
spacetime 


Spinors needed when 
analyzing fermions in 
gravitational fields 


Equivalence of spinor and 
tensor formalisms 


Just as vectors, tensors, and differential forms are easily generalized from flat space- 
time to curved, so are spinors. 

Each event 9 in curved spacetime possesses a tangent space. In that tangent space 
reside and operate all the vectors, tensors, and forms located at 9, The geometry 
of the tangent space is Lorentzian (“local Lorentz geometry at fP”), since the scalar 
product of any two vectors u and v at 9, expressed in an orthonormal frame at 
9, is 

u v= g(u, v ) = 

Thus, there is no mathematical difference between the tangent space at 9 on the 
one hand, and flat spacetime on the other. Whatever mathematical can be done 
in the one can also be done in the other. In particular, the entire formalism of spinors , 
developed originally in flat spacetime , can be carried over without change to the tangent 
space at the arbitrary event 9 in curved spacetime. 

Let it be done. Now spinors reside at every event in curved spacetime; and at 
each event one can translate back and forth between spinor language and tensor 
language, using the equations (valid in orthonormal frames) of §§41.6 and 41.7. 

Spinors in curved spacetime are an indispensible mathematical tool, when one 
wishes to study the influence of gravity on quantized particles of half-integral spin 
(neutrinos, electrons, protons, . . .). Consider, for example, Hartle’s (1971) proof that 
a black hole cannot exert any long-range, weak-interaction forces on external matter 
(i.e., that a black hole has no “weak-interaction hair”). His proof could not function 
without a spinor description of neutrino fields in curved spacetime. Similarly for 
Wheeler’s (1971b) analysis of the quasibound states of an electron in the gravitational 
field of a small black hole (gravitational radius ~~10“ 13 cm): it requires solving the 
Dirac equation for a spin-| particle in the curved spacetime geometry of Schwarzs- 
child. For a detailed discussion of the Dirac equation in curved spacetime see, e.g., 
Brill and Wheeler (1957). 

To use the mathematics of spinors, one need not be dealing with quanturri theory 
or with particles of half-integral spin. The spinor formalism is perfectly applicable 
in situations where only integral-spin entities (scalars, vectors, tensors) are in view, 
and where in fact, the spinor formalism is fully equivalent to the tensor formalism 
that pervades earlier chapters of this book. Equations (41.77) and (41.78) provide 
the translation from one formalism to the other, once an orthonormal frame has 
been chosen at each event in spacetime. 

Certain types of problems in gravitation theory are far more tractable in the 
language of spinors than in the language of tensors. Examples are as follows. 
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(1) Geometric Optics 

(the theory of “null congruences of geodesics") 

Here spinors make almost trivial the lengthy tensor algebra needed in derivations 
of the “focusing theorem” [equation (22.37)]; and they yield an elegant, simple 
formalism for discussing and calculating how, with increasing affine parameter, a 
bundle of rays alters its size (“expansion”), its shape (“shear”), and its orientation 
( rotation ). See, e.g., Sachs (1964), Pirani (1965), or Penrose (1968a) for a review 
and the original references. 


(2) Radiation Theory in Curved Spacetime 
^ (both gravitational and electromagnetic) 

Spinors provide the most powerful of all formalisms for decomposing radiation fields 
into spherical harmonics and for manipulating their decomposed components. See, 
for example, Price’s (1972a,b) analysis of how a perturbed Schwarzschild black hole 
radiates away all its radiatable perturbations, be they electromagnetic perturbations, 
gravitational perturbations, or perturbations in a fictitious field of spin 17; see, 
similarly, the analysis by Fackerell and Ipser (1972) and by Ipser (1971) of electro¬ 
magnetic perturbations of a Kerr black hole, and the analysis by Teukolsky (1972) 
of gravitational perturbations of a Kerr hole. Spinors also yield an elegant and 
powerful analysis of the “1/r” expansion of a radiation field flowing out from a 
source into asymptotically flat space. Among its results is a “peeling theorem which 
describes the algebraic properties of the coefficients in a 1/r expansion of the 
Riemann tensor. See, e.g., Sachs (1964) or Pirani (1965) for reviews and original 
references. 


(3) Algebraic Properties of Curvature Tensors 


The spinor formalism is a more powerful method than any other for deriving the 
Petrov-Pirani algebraic classification of the conformal curvature tensor,” and for 
proving theorems about algebraic properties of curvature tensors. See, e.g., Sachs 
(1964) or Pirani (1965) or Penrose (1968a) for reviews and references. 

Of cours e, the spinor formalism, like any formalism, has its limitations. For 
example, many of the elementary problems of gravitation theory, and a large fraction 
of the most difficult ones, would be more difficult in the language of spinors than 
in the language of tensors! But for certain classes of problems, especially those where 
null vectors play a central role, spinors are a most valuable tool. 

Cartan gave spinors to the world’s physics and mathematics. His text (American 
edition, 1966) is an important reference to the subject. 


Applications of spinor 
formalism in classical 
gravitation theory 



CHAPTER 


REGGE CALCULUS 


This chapter is entirely Track 2. 
As preparation for it, Chapter 
21 (variational principle and 
initial-value formalism) is 
needed. It is not needed as 
preparation for any later 
chapter, though it will be 
helpful in Chapter 43 
(dynamics of geometry). 


The need for Regge calculus 
as a computational tool 


§42.1. WHY THE REGGE CALCULUS? 

Gravitation theory is entering an era when situations of greater and greater com¬ 
plexity must be analyzed. Before about 1965 the problems of central interest could 
mostly be handled by idealizations of special symmetry or special simplicity or both. 
The Schwarzschild geometry and its generalizations, the Friedmann cosmology and 
its generalizations, the joining together of the Schwarzschild geometry and the 
Friedmann geometry to describe the collapse of a bounded collection of matter, the 
vibrations of relativistic stars, weak gravitational waves propagating in an otherwise 
flat space: all these problems and others were solved by elementary means. 

But today one is pressed to understand situations devoid of symmetry and not 
amenable to perturbation theory: How do two black holes alter in shape, and how 
much gravitational radiation do they emit when they collide and coalesce? What 
are the structures and properties of the singularities at the endpoint of gravitational 
collapse, predicted by the theorems of Penrose, Hawking, and Geroch? Can a 
Universe that begins completely chaotic smooth itself out quickly by processes such 
as inhomogeneous mixmaster oscillations? 

To solve such problems, one needs new kinds of mathematical tools—and in 
response to this need, new tools are being developed. The “global methods” of 
Chapter 34 provide one set of tools. The Regge Calculus provides anothei 5 [Regge 
(1961); see also pp. 467-500 of Wheeler (1964a)]. 


§42.2. REGGE CALCULUS IN BRIEF 

Consider the geodesic dome that covers a great auditorium, made of a multitude 
of flat triangles joined edge to edge and vertex to vertex. Similarly envisage space- 
time, in the Regge calculus, as made of flat-space “simplexes” (four-dimensional 


Approximation of smooth 
geometries by skeleton 
structures 
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item in this progression: two dimensions, triangle; three dimensions, tetrahedron; 
four dimensions, simplex) joined face to face, edge to edge, and vertex to vertex. 
To specify the lengths of the edges is to give the engineer all he needs in order to 
know the shape of the roof, and the scientist all he needs in order to know the 
geometry of the spacetime under consideration. A smooth auditorium roof can be 
approximated arbitrarily closely by a geodesic dome constructed of sufficiently small 
triangles. A smooth spacetime manifold can be approximated arbitrarily closely by 
a locked-together assembly of sufficiently small simplexes. Thus the Regge calculus, 
reaching beyond ordinary algebraic expressions for the metric, provides a way to 
analyze physical situations deprived, as so many situations are, of spherical symme¬ 
try, and systems even altogether lacking in symmetry. 

If the designer can give the roof any shape he pleases, he has more freedom than 
the analyst who is charting out the geometry of spacetime. Given the geometry of 
spacetime up to some spacelike slice that, for want of a better name, one may call 
“now,” one has no freedom at all in the geometry from that instant on. Einstein’s 
geometrodynamic law is fully deterministic. Translated into the language of the 
Re 8S e calculus, it provides a means to calculate the edge lengths of new simplexes 
from the dimensions of the simplexes that have gone before. Though the geometry 
is deterministically specified, how it will be approximated is not. The original 
spacelike hypersurface (“now”) is approximated as a collection of tetrahedrons joined 
together face to face; but how many tetrahedrons there will be and where their 
vertices will be placed is the option of the analyst. He can endow the skeleton more 
densely with bones in a region of high curvature than in a region of low curvature 
to get the most “accuracy profit” from a specified number of points. Some of this 
freedom of choice for the lengths of the bones remains as one applies the geometro¬ 
dynamic law in the form given by Regge (1961) to calculate the future from the 
past. This freedom would be disastrous to any computer program that one tried 
to write, unless the programmer removed all indefiniteness by adding supplementary 
conditions of his own choice, either tailored to give good “accuracy profit,” or 
otherwise fixed. 

Having determined the lengths of all the bones in the portion of skeletonized 
spacetime of interest, one can examine any chosen local cluster of bones in and 
by themselves. In this way one can find out all there is to be learned about the 
geometry in that region. Of course, the accuracy of one’s findings will depend on 
the fineness with which the skeletonization has been carried out. But in principle 
that is no limit to the fineness, or therefore to the accuracy, so long as one is working 
in the context of classical physics. Thus one ends up with a catalog of all the bones, 
showing the lengths of each. Then one can examine the geometry of whatever 
spacelike surface one pleases, and look into many other questions besides. For this 
purpose one has only to pick out the relevant bones and see how they fit together. 

§42.3. SIMPLEXES AND DEFICIT ANGLES 

Figure 42.1 recalls how a smoothly curved surface can be approximated by flat 
triangles. All the curvature is concentrated at the vertices. No curvature resides at 


Role of Einstein field 
equation in fixing the 
skeleton structure 
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Deficit angle as a 
skeletonized measure of 
curvature: 

(1) in two dimensions 


(2) in n (or four) dimensions 



Figure 42.1. 

A 2-geometry with continuously varying curvature can be approximated arbitrarily closely by a polyhedron 
built of triangles, provided only that the number of triangles is made sufficiently great and the size of 
each sufficiently small. The geometry in each triangle is Euclidean. The curvature of the surface shows 
up in the amount of deficit angle at each vertex (portion ABCD of polyhedron laid out above on a 
flat surface). 


the edge between one triangle and the next, despite one’s first impression. A vector 
carried by parallel transport from A through B and C to D , and then carried back 
by another route through C and B to A returns to its starting point unchanged in 
direction, as one sees most easily by laying out this complex of triangles on a flat 
surface. Only if the route is allowed to encircle the vertex common to A , B , C, and 
D does the vector experience a net rotation. The magnitude of the rotation is equal 
to the indicated deficit angle, S, at the vertex. The sum of the deficit angles over 
all the vertices has the same value, 4-77, as does the half-integral of the continuously 
distributed scalar curvature ( {2) R = 2/a 2 for a sphere of radius a) taken over the 
entirety of the original smooth figure, 


2 


skeleton 

geometry 


actual smooth 
geometry 


(2) R (/(surface) = 477. 


(42.1) 


Generalizing from the example of a 2-geometry, Regge calculus approximates a 
smoothly curved ^-dimensional Riemannian manifold as a collection of «-dimen- 
sional blocks, each free of any curvature at all, joined by (n — 2)-dimensional regions 
in which all the curvature is concentrated (Box 42.1). For the four-dimensional 
spacetime of general relativity, the “hinge” at which the curvature is concentrated 
has the shape of a triangle, as indicated schematically in the bottom row of Figure 
42.2. In the example illustrated there, ten tetrahedrons have that triangle in common. 
Between one of these tetrahedrons and the next fits a four-dimensional simplex. 
Every feature of this simplex is determined by the lengths of its ten edges. One 
of the features is the angle a between one of the indicated tetrahedrons or “faces” 
of the simplex and the next. Thus a represents the angle subtended by this simplex 
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Box 42.1 THE HINGES WHERE THE CURVATURE IS CONCENTRATED IN THE 

"ANGLE OF RATTLE" BETWEEN BUILDING BLOCKS IN A SKELETON MANIFOLD 


Dimensionality of manifold 

-2 

3 

4 

Elementary flat-space 
building block: 

triangle 

tetrahedron 

simplex 

Edge lengths to define it: 

3 

4 

5 

Hinge where cycle of such 
blocks meet with a deficit 
angle or “angle of rattle” 5: 

vertex 

edge 

triangle 

Dimensionality of hinge: 

A 

0 

1 

2 

a 

“Content” of such a hinge: 

1 

length / 

area A 

Contribution from all hinges 
within a given small region 
to curvature of manifold: 





region 

region 

region 

Continuum limit of this quantity 
expressed as an integral over 
the same small region: 

- f ,2 >K( (2 >g) 1/2 d 2 X 

i J ( 3 )^( 3 )^ 1/2 

1 f «>*(_<4>£)l/2 d 4 x 



J 


at the hinge. Summing the angles a for all the simplexes that meet on the given 
hinge b J2. 6 Jl, and subtracting from 2n, one gets the deficit angle associated with that 
hinge. And by then summing the deficit angles in a given small n-volume with 
appropriate weighting (Box 42.1), one obtains a number equal to the volume integral 
of the scalar curvature of the original smooth ^-geometry. See Box 42.2. 


§42.4. SKELETON FORM OF FIELD EQUATIONS 

Rather than translate Einstein’s field equations directly into the language of the 

skeleton calculus, Regge turns to a standard variational principle from which Ein- Einstein-Hilbert variational 
stein’s law lets itself be derived. It says (see §§21.2 and 43.3) adjust the 4-geometry principle reduced to skeleton 
throughout an extended region of spacetime, subject to certain specified conditions form 
on the boundary, so that the dimensionless integral (action in units of ft\), 

I = (c 3 /\6irhG) f R( — g) 1/2 d*x, (42.2) 

is an extremum. This statement applies when space is free of matter and electromag- 





Figure 42.2. 

Cycle of building blocks associated with a single hinge. Top row, two dimensions: left, schematic 
association of vertices 5, Jf, *?/, T, <2ff with “hinge” at the vertex right, same, but with elementary 
triangles indicated in full. Middle row, three dimensions: left, schematic; right, perspective representation 
of the six tetrahedrons that meet on the “hinge” Bottom row, four dimensions; shown only 
schematically. The five vertices belong to one simplex, a four-dimensional region throughout 

the interior of which space is flat. The five vertices if £$$£ belong to the next simplex; and so on around 
the cycle of simplexes. The two simplexes just named interface at the tetrahedron ifjSfM, inside which 
the geometry is also flat. Between that tetrahedron and the next, ifi£i#<*>, there is a certain hyperdihedral 
angle a subtended at the “hinge” ifipftf. The value of this angle is completely fixed by the ten edge 
lengths of the intervening simplex This dihedral angle, plus the corresponding dihedral angles 

subtended at the hinge if£fl? by the other simplexes of the cycle, do not in general add up to 2tt. The 
deficit, the “angle of rattle” or deficit angle 5, gives the amount of curvature concentrated at the hinge 
ifi?$. There is no actual rattle or looseness of fit, unless one tries to imbed the cycle into an over-all 
flat four-dimensional space (analog of “stamping on” the collection of triangles, and seeing them open 
out by the amount of the deficit angle, as indicated in inset in Figure 42,1). 


netic fields,* a simplification that will be made in the subsequent discussion to keep 
it from becoming too extended. When in addition all lengths are expressed in units 
of the Planck length * 


L* = (hG/c 3 ) 1/2 = 1.6 x 10~ 33 cm, (42.3) 

and the curvature integral is approximated by its expression in terms of deficit angles, 
Regge shows that the statement SI = 0 (condition for an extremum!) becomes 

H 

(W«2 A h s h = 0. 

hinges 
h = i 


(42.4) 
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Box 42.2 FLOW DIAGRAMS FOR REGGE CALCULUS 



A skeleton 4-geometry is completely determined by all its edge lengths. From the 
edge lengths one gets the integrated curvature by pursuing, for each hinge in the 
4-geometry, the following flow diagram: 



One finds it natural to apply this analysis in either of two ways. First, one can probe 
a given 4-geometry (given set of edge lengths!) in the sense 
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Box 42.2 { continued ) 


Second—and this is the rationale of Regge calculus—one can use the skeleton 
calculus to deduce a previously unknown 4-geometry from Einstein’s geometrody- 
namic law, proceeding in the direction 






In the changes contemplated in this variational principle, certain edge lengths are 
thought of as being fixed. They have to do with the conditions specified at the 
boundaries of the region of spacetime under study. It is not necessary here to enter 
into the precise formulation of these boundary conditions, fortunately, since some 
questions of principle still remain to be clarified about the precise formulation of 
boundary conditions in general relativity (see §21.12). Rather, what is important 
is the effect of changes in the lengths of the edges of the blocks in the interior of 
the region being analyzed, as they augment or decrease the deficit angles at the 
various hinges. In his basic paper on the subject, Regge (1961) notes that the typical 
deficit angle S h depends in a complicated trigonometric way on the values of numer¬ 
ous edge lengths i p . However, he proves (Appendix of his paper) that “quite re¬ 
markably, we can carry out the variation as if the 8 h were constants,” thus reducing 
the variational principle to the form 3 

H 

(I/ 877 ) 2 = (42.5) 

hinges 
h = 1 


Here the change in area of the /i-th triangle-shaped hinge, according to elementary 
trigonometry, is 

p cotan 0*. 

P 


(42.6) 
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In this equation 8 ph is the angle opposite to the p-th edge in the triangle. Conse¬ 
quently, Einstein s equations in empty space reduce in skeleton geometry to the form 

2 cotan 8 ph = 0, (p = 1,2,...), (42.7) 

hinges that 
have the 
given edge 
p in common 

one equation for each edge length in the interior of the region of spacetime being 
analyzed. 

§42.5. THE CHOICE OF LATTICE STRUCTURE 

Two questions arise in the actual application of Regge calculus, and it is not clear 
that either has yet received the resolution which is most convenient for practical 
applications of this skeleton analysis: What kind of lattice to use? How best to 
capitalize on the freedom that exists in the choice of edge lengths? The first question 
is discussed in this section, the second in the next section. 

It might seem most natural to use a lattice made of small, nearly rectangular 
blocks, the departure of each from rectangularity being conditioned by the amount 
and directionality of the local curvature. However, such building blocks are “floppy.” 
One could give them rigidity by specifying certain angles as well as the edge lengths. 
But then one would lose the cleanness of Regge’s prescription: give edge lengths, 
and give only edge lengths, and give each edge length freely and independently, 
in order to define a geometry. In addition one would have to rederive the Regge 
equations, including new equations for the determination of the new angles. There¬ 
fore one discards the quasirectangle in favor of the simplex with its 5 • 4/2 = 10 edge 
lengths. This decided, one also concludes that even in flat spacetime the simplexes 
cannot all have identical edge lengths. Two-dimensional flat space can be filled with 
identical equilateral triangles, but already at three dimensions it ceases to be possible 
to fill out the manifold with identical equilateral tetrahedrons. One knows that a 
given carbon atom in diamond is joined to its nearest neighbors with tetrahedral 
bonds, but a little reflection shows that the cell assignable to the given atom is far 
from having the shape of an equilateral tetrahedron. 

Synthesis would appear to be a natural way to put together the building blocks: 
first make one-dimensional structures; assemble these into two-dimensional struc¬ 
tures; these, into three-dimensional ones; and these, into the final four-dimensional 
structure. The one-dimensional structure is made of points, 1, 2, 3,..., alternating 
with line segments, 12, 23, 34,.... To start building a two-dimensional structure, 
pick up a second one-dimensional structure. It might seem natural to label its points 
1)2,3,..., etc. However, that labeling would imply a cross-connection between 
1 and 1 , between 2 and 2 , etc., after the fashion of a ladder. Then the elementary 
cells would be quasirectangles. They would have the “floppiness” that is to be 
excluded. Therefore relabel the points of the second one-dimensional structure as 
H > 2J, 3^, etc. The implication is that one cross-connects 2%' with points 2 and 3 
of the original one-dimensional structure, etc. One ends up with something like the 


Einstein field equation 
reduced to skeleton form 


The choice of lattice structure: 
(1) avoiding floppiness 


(2) necessity for unequal 
edge lengths 


(3) construction of two- 
dimensional structures 
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(4) 3-D structures built from 
2-D structures by 
"method of blocks" 


(5) 3-D structures from 2-D 
by "method of spheres" 


girder structure of a bridge, fully rigid in the context of two dimensions, as desired. 
The same construction, extended, fills out the plane with triangles. One now has 
a simple, standard two-dimensional structure. One might mistakenly conclude that 
one is ready to go ahead to build up a three-dimensional structure: the mistake 
lies in the tacit assumption that the flat-space topology is necessarily correct. 

Let it be the problem, for example, to determine the development in time of a 
3-geometry that has the topology of a 3-sphere. This 3-sphere is perhaps strongly 
deformed from ideality by long-wavelength gravitational waves. A right arrangement 
of the points is the immediate desideratum. Therefore put aside for the present any 
consideration of the deformation of the geometry by the waves (alteration of edge 
lengths from ideality). Ask how to divide a perfect 3-sphere into two-dimensional 
sheets. Here each sheet is understood to be separated from the next by a certain 
distance. At this point two alternative approaches suggest themselves that one can 
call for brevity “blocks” and “spheres.” 

(1) Blocks. Note that a 3-sphere lets itself be decomposed into 5 identical, tetra¬ 
hedron-like solid blocks (5 vertices; 5 ways to leave out any one of these vertices!) 
Fix on one of these “tetrahedrons.” Select one vertex as summit and the face through 
the other three vertices as base. Give that base the two-dimensional lattice structure 
already described. Introduce a multitude of additional sheets piled above it as evenly 
spaced layers reaching to the summit. Each layer has fewer points than the layer 
before. The decomposition of the 3-geometry inside one “tetrahedron” is thereby 
accomplished. However, an unresolved question remains; not merely how to join 
on this layered structure in a regular way to the corresponding structure in the 
adjacent “tetrahedrons”; but even whether such a regular joinup is at all possible. 
The same question can be asked about the other two ways to break up the 3-sphere 
into identical “tetrahedrons” [Coxeter (1948), esp. pp. 292-293: 16 tetrahedrons 
defined by a total of 8 vertices or 600 tetrahedrons defined by a total of 120 vertices]. 
One can eliminate the question of joinup of structure in a simple way, but at the 
price of putting a ceiling on the accuracy attainable: take the stated number of 
vertices (5 or 8 or 120) as the total number of points that will be employed in the 
skeletonization of the 3-geometry (no further subdivision required or admitted). 
Considering the boundedness of the memory capacity of any computer, it is hardly 
ridiculous to contemplate a limitation to 120 tracer points in exploratory calculations! 

(2) Spheres . An alternative approach to the “atomization” of the 3-sphere begins 
by introducing on the 3-sphere a North Pole and a South Pole and the hyperspherical 
angle x (x = 0 at the first pole, X = * at the second, x = w/2 at the equator; see 
Box 27.2). Let each two-dimensional layer lie on a surface of constant x (X equal 
to some integer times some interval Ax). The structure of this 2-sphere is already 
to be regarded as skeletonized into elementary triangles (“fully complete Buckminster 
Fuller geodesic dome”). Therefore the number of “faces” or triangles F, the number 
of edge lengths E , and the number of vertices V must be connected by the relation 
of Euler: 

F _ F , v / a topology-dependent \ _ [2 for 2-sphere, ^ g . 

\number or “Euler character”/ 10 for 2-torus. 


It follows from this relation that it is impossible for each vertex to sit at the center 
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of a hexagon (each vertex the point of convergence of 6 triangles). This being the 
case, one is not astonished that a close inspection of the pattern of a geodesic dome 
shows several vertices where only 5 triangles meet. It is enough to have 12 such 
5-triangle vertices among what are otherwise all 6-triangle vertices in order to meet 
the requirements of the Euler relation: 

n 5-triangle vertices 

V — n 6-triangle vertices 

F = (V — n)( 6/3) + n( 5/3) triangles 
E = (V — ri){6/2) + n(5/2) edges (42.9) 

V = (V — n)(6 /6) + n vertices 
( ^ — F — E + V = n/6 Euler characteristic 

n = 12 

Among all figures with triangular faces, the icosahedron is the one with the smallest 
number of faces that meets this condition (5-triangle vertices exclusively!) 

If each 2-surface has the pattern of vertices of a geodesic dome, how is one dome 
to be joined to the next to make a rigid skeleton 3-geometry? Were the domes 
imbedded in a flat 3-geometry, rigidity would be no issue. Each dome would already 
be rigid in and by itself. However, the 3-geometry is not given to be flat. Only by 
a completely deterministic skeletonization of the space between the two 2-spheres 
will they be given rigidity in the context of curved space geometry. (1) Not by running 
a single connector from each vertex in one surface to the corresponding vertex in 
the next (“floppy structure”!) (2) Not by displacing one surface so each of its vertices 
comes above, or nearly above, the center of a triangle in the surface “below.” First, 
the numbers of vertices and triangles ordinarily will not agree. Second, even when 
they do, it will not give the structure the necessary rigidity to connect the vertex 
of the surface above to the three vertices of the triangle below. The space between 
will contain some tetrahedrons, but it will not be throughout decomposed into 
tetrahedrons. (3) A natural and workable approach to the skeletonization of the 
3-geometry is to run a connector from each vertex in the one surface to the corre¬ 
sponding vertex in the next, but to flesh out this connection with additional structure 
that will give rigidity to the 3-geometry: intervening vertices and connectors as 
illustrated in Box 42.3. 

In working up from the skeletonization of a 3-geometry to the skeletonization ( 6 ) 4 -D structures built from 
o a geometry, it is natural to proceed similarly. (1) Use identical patterns of points 3-D structures 
in the two 3-geometries. (2) Tie corresponding points together by single connectors. 

(3) Halfway, or approximately half way between the two 3-geometries insert a whole 
additional pattern of vertices. Each of these supplementary vertices is “dual” to and 
lies nearly “below” the center of a tetrahedron in the 3-geometry immediately above. 

(4) Connect each supplementary vertex to the vertices of the tetrahedron immediately 
above, to the vertices of the tetrahedron immediately below, and to those other 
supplementary vertices that are its immediate neighbors. (5) In this way get the edge 
lengths needed to divide the 4-geometry into simplexes, each of rigidly defined 
dimensions. 






1176 


42. REGGE CALCULUS 
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Box 42.3 SYNTHESIS OF HIGHER-DIMENSIONAL SKELETON GEOMETRIES OUT OF 
LOWER-DIMENSIONAL SKELETON GEOMETRIES 



(1) One-dimensional structure as alternation of 
points and line segments. (2) Two-dimensional 
structure (a) “floppy” (unacceptable) and (b) rigid- 
ified (angles of triangles fully determined by edge 
lengths). When this structure is extended, as at 
right, the “normal” vertex has six triangles hinging 
on it. However, at least twelve 5-triangle vertices 
of the type indicated at d are to be interpolated 
if the 2-geometry is to be able to close up into 
a 2-sphere. (3) Skeleton 3-geometry obtained by 
filling in between the skeleton 2-geometry... 
d9 ... ff 96 and the similar structure 

... d '$'... 9'9'6 '... £'<%}'... as follows, (a) In¬ 
sert direct connectors such as 99* between 
corresponding points in the two 2-geometries, (b) 
Insert an intermediate layer of “supplementary 
vertices” such as ... Each of these 

supplementary vertices lies roughly halfway be¬ 
tween the center of the triangle “above” it and the 
center of the corresponding triangle “below” it. (c) 


Connect each such “supplementary vertex” with 
its immediate neighbors above, below, and in the 
same plane, (d) Give all edge lengths, (e) Then 
the skeleton 3-geometry between the two 2-ge¬ 
ometries is rigidly specified. It is made up of five 
types of tetrahedrons, as follows. (1) “Right- 
through blocks,” such as 99'S?F (six of these 
hinge on 99* when 9 is a normal vertex; five, 
when it is a 5-fold vertex, such as indicated by d 
at the upper right). (2) “Lower-facing blocks,” such 
as d&93~. (3) “Lower-packing blocks,” such as 
d9S?T. (4, 5) Corresponding “upper-facing blocks” 
and “upper-packing blocks” (not shown). The 
number of blocks of each kind is appropriately 
listed here for the two extreme cases of a 2-geom- 
etry that consists (a) of a normal hexagonal lattice 
extending indefinitely in a plane and (b) of a lat¬ 
tice consisting of the minimum number of 5-fold 
vertices (“type d vertices”) that will permit close- 
up into a 2-sphere. 
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r 


2-geometry of upper Hexagonal pattern Icosahedron 

_ (or lower) face _ of triangles ma de of triangles 

Its topology 
Vertices on upper face 
Nature of these vertices 
Edge lengths on upper face 
Triangles on upper face 
Number of “supplementary vertices” 


Infinite 2-plane 

2-sphere 

V 

12 

6-fold 

5-fold 

3 V 

\V= 30 

2V 

20 

2V 

20 



Outer facing blocks 
Outer packing blocks 
Right through blocks 
Inner packing blocks 
Inner facing blocks 

I 


2V 

20 

3V 

30 

6V 

60 

3V 

30 

2V 

20 




§42.6. THE CHOICE OF EDGE LENGTHS 

So much for the lattice structure of the 4-geometry; now for the other issue, the 
freedom that exists in the choice of edge lengths. Why not make the simplest choice 
and let all edges be light rays? Because the 4-geometry would not then be fully 
determined. The geometry g aP {x*) differs from the geometry \(x») g aj8 (x #1 ), even 
though the same points that are connected by light rays in the one geometry are 
also connected by light rays in the other geometry. 

If none of the edges is null, it is nevertheless natural to take some of the edge 
lengths to be spacelike and some to be timelike. In consequence the area A of the 
triangle in some cases will be real, in other cases imaginary. In 3-space the parallelo¬ 
gram (double triangle) spanned by two vectors B and C is described by a vector 

2 A = B x C 

perpendicular to the two vectors. One obtains the magnitude of A from the formula 

4 A 2 = B 2 C 2 - (B • C) 2 . 


In 4-space, let B and C be two edges of the triangle. Then, as in three dimensions, 
2 A is dual to the bivector built from B and C. In other words, if B goes in the t 
direction and C in the z direction, then A is a bivector lying in the (x,y) plane. 
Consequently its magnitude A is to be thought of as a real quantity. Therefore the 
appropriate formula for the area A is (Tullio Regge) 

4A 2 = (B • C) 2 — B 2 C 2 . (42.10) 

The quantity A is real when the deficit angle 8 is real. Thus the geometrically 
important product AS is also real. 


The choice of edge lengths 


(1) choose some timelike, 
others spacelike 
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(2) choose timelike lengths 
comparable to spacelike 
lengths 


(3) why some lengths must 
be chosen arbitrarily 


Deficit angles in terms of 
edge lengths 


Past applications of Regge 
calculus 


When the hinge lies in the (x,y) plane, on the other hand, the quantity A is purely 
imaginary. In that instance a test vector taken around the cycle of simplexes that 
swing on this hinge has undergone change only in its z and t components; that is, 
it has experienced a Lorentz boost; that is, the deficit angle 5 is also purely imaginary. 
So again the product AS is a purely real quantity. 

Turn now from character of edge lengths to magnitude of edge lengths. It is 
desirable that the elementary building blocks sample the curvatures of space in 
different directions on a roughly equal basis. In other words, it is desirable not to 
have long needle-shaped building blocks nor pancake-shaped tetrahedrons and 
simplexes. This natural requirement means that the step forward in time should be 
comparable to the steps “sidewise” in space. The very fact that one should have 
to state such a requirement brings out one circumstance that should have been 
obvious before: the “hinge equations” 

2 8 h cotan 9 ph = 0 (/? = 1,2,.. .), (42.7) 

hinges h that 
have edge p 
in common 

though they are as numerous as the edges, cannot be regarded as adequate to 
determine all edge lengths. There are necessarily relations between these equations 
that keep them from being independent. The equations cannot determine all the 
details of the necessarily largely arbitrary skeletonization process. They cannot do 
so any more than the field equations of general relativity can determine the coordi¬ 
nate system. With a given pattern of vertices (four-dimensional generalization of 
drawings in Box 42.3), one still has (a) the option how close together one will take 
successive layers of the structure and (b) how one will distribute a given number 
of points in space on a given layer to achieve the maximum payoff in accuracy 
(greater density of points in regions of greater curvature). To prepare a practical 
computer program founded on Regge calculus, one has to supply the machine not 
only with the hinge equations and initial conditions, but also with definite algorithms 
to remove all the arbitrariness that resides in options (a) and (b). 

Formulas from solid geometry and four-dimensional geometry, out of which to 
determine the necessary hyperdihedral angles a and the deficit angles 8 in terms 
of edge lengths and nothing but edge lengths, are summarized by Wheeler (1964a, 
pp. 469, 470, and 490) and by C. Y. Wong (1971). Regge (1961) also gives a formula 
for the Riemann curvature tensor itself in terms of deficit angles and number of 
edges running in a given direction [see also Wheeler (1964a, p. 471)]. 


§42.7. PAST APPLICATIONS OF REGGE CALCULUS 

Wong (1971) has applied Regge calculus to a problem where no time development 
shows itself, where the geometry can therefore be treated as static, and where in 
addition it is spherically symmetric. He determined the Schwarzschild and Reissner- 
Nordstrom geometries by the method of skeletonization. Consider successive spheres 
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surrounding the center of attraction. Wong approximates each as an icosahedron. 
The condition 

<3 ) R = I6 w ( ener §y den sity\ 

Von the 3-space/ 

(§21.5) gives a recursion relation that determines the dimension of each icosahedron 
m terms of the two preceding icosahedra. Errors in the skeleton representation of 
the exact geometry range from roughly 10 percent to less than 1 percent, depending 

on the method of analysis, the quantity under analysis, and the fineness of the 
subdivision. 

Skeletonization of geometry is to be distinguished from mere rewriting of partial 
differential equations as difference equations. One has by now three illustrations 
that on^ can capitalize on skeletonization without fragmenting spacetime all the way 
to the level of individual simplexes. The first illustration is the first part of Wong’s 
work, where the time dimension never explicitly makes an appearance, so that the 
building blocks are three-dimensional only. The second is an alternative treatment, 
also given by Wong, that goes beyond the symmetry in t to take account of the 
symmetry in 6 and <f>. It divides space into spherical shells, in each of which the 
geometry is pseudo-flat” in much the same sense that the geometry of a paper cone 
is flat. The third is the numerical solution for the gravitational collapse of a spherical 
star by May and White (1966), in which there is symmetry in 0 and <j>, but not in 
r or t. This zoning takes place exclusively in the r, t-plane. Each zone is a spherical 
shell. The difference as compared to Regge calculus (flat geometry within each 
building block) is the adjustable “conicity” given to each shell. The examples show 
that the decision about skeletonizing the geometry in a calculation is ordinarily not 
“whether” but “how much.” 


§42.8. THE FUTURE OF REGGE CALCULUS 

In summary, Regge’s skeleton calculus puts within the reach of computation prob¬ 
lems that in practical terms are beyond the power of normal analytical methods. 
It affords any desired level of accuracy by sufficiently fine subdivision of the space- 
time region under consideration. By way of its numbered building blocks, it also 
offers a practical way to display the results of such calculations. Finally, one can 
hope that Regge’s truly geometric way of formulating general relativity will someday 
make the content of the Einstein field equations (Cartan’s “moment of rotation”; 
see Chapter 15) stand out sharp and clear, and unveil the geometric significance 
of the so-called “geometrodynamic field momentum” (analysis of the boundary-value 
problem associated with the_yariatitmal problem of general relativity in Regge 
calculuij'sce §2T.12). 


Partial skeletonization 


Hopes for the future 


CHAPTER 



SUPERSPACE: ARENA FOR 
THE DYNAMICS OF GEOMETRY 


Traveler , there are no paths. 
Paths are made by walking. 

ANTONIO MACHADO (1940) 




- \ 

This chapter is entirely Track 2. 
Chapter 21 (initial-value 
formalism) is needed as 
preparation for it. in reading it, 
one will be helped by Chapter 
42 (Regge calculus), it is not 
needed as preparation for any 
later chapter, but it will be 
helpful in Chapter 44 (vision of 
the future). 

__ J 


Superspace is the arena for 
geometrodynamics 


§43.1. SPACE, SUPERSPACE, AND SPACETIME 
DISTINGUISHED 

Superspace [Wheeler (1964a), pp. 459 ff] is the arena of geometrodynamics. The 
dynamics of Einstein’s curved space geometry runs its course in superspace as the 
dynamics of a particle unfolds in spacetime. This chapter gives one simple version 
of superspace, and a little impression of alternative versions of superspace that also 
have their uses. It describes the classical dynamics of geometry in superspace in terms 
of the Hamilton-Jacobi principle of Boxes 25.3 and 25.4. No version of mechanics 
makes any shorter the leap from classical dynamics to quantum. Thus it provides 
a principle (“Einstein-Hamilton-Jacobi or EHJ equation”) for the propagation of 
wave crests in superspace, and for finding where those wave crests give one the 
classical equivalent of constructive interference (“envelope formation”). In this way 
one finds the track of development of 3-geometry with time expressed as a sharp, 
thin “leaf of history” that slices through superspace. The quantum principle^replaces 
this deterministic account with a fuzzed-out leaf of history of finite thickness. In 
consequence, quantum fluctuations take place in the geometry of space that dominate 
the scene at distances of the order oTrhe Planck length, L* = ( hG/c 3 ) v2 = 1.6 X 
10 -33 cm, and less. The present survey simplifiesT>y considering-onlyjhe^dynamics 
of curved empty space. When sources are present and are to be taken into account, 
supplementary terms are to be added, some of the literature on which is listed. 

In all the difficult investigations that led in the course of half a century to some 
understanding of the dynamics of geometry, both classical and quantum, the most 
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Box 43.1 GEOMETRODYNAMICS COMPARED WITH PARTICLE DYNAMICS 


Concept 


Dynamic entity 

Descriptors of momentary 
configuration 

Classical history 

History is a stockpile of 
configurations? 


Dynamic arena 


Particle dynemics 


Particle 

/ (“event”) 

* = *(t) 

Yes. Every point on world 
line gives a momentary 
configuration of particle 


Spacetime (totality of all 
points jc, t ) 


Geo/?? e tro dynamics 


Space 

(3) & (“3-geometry”) 

(4) & (“4-geometry”) 

Yes. Every spacelike 
slice through (4) & gives 
a momentary configura¬ 
tion of space 

Superspace (totality of all 
(3, £’s) 


j 


difficult point was also the simplest: The dynamic object is not spacetime. 1, is space. 3-geo m .„ y i, the 
The geometric configuration of space changes with time. But it is space, three-di- object 
mensional space, that does the changing (see Box 43.1). 

Space will be treated here as “closed” or, in mathematical language, “compact« 
either because physics adds to Einstein’s second-order differential equations the 
requirement of closure as a necessary and appropriate boundary condition [Einstein 
(1934, p. 52; 1950); Wheeler (1959; 1964c). Honl (1962); see also §21.12] or because 
that requirement simplifies the mathematical analysis, or for both reasons together. 

out “j :::rr: imatea r oot > ci r ed 3 : seometrybya skeiet ° n ^ Fim,e-dimension 

rons, as indicated schematically in Figure 43.1 (see Chapter 42 on superspace" 
the Regge calculus). Specify the 98 edge-lengths in this example to fix all the features 
of the geometry; and fix these 98 edge-lengths by giving the location of a single 
point m a space of 98 dimensions. This 98-dimensional manifold, this “truncated 
superspace,” goes over into superspace [Wheeler (1964a), pp. 453, 459, 463, 495] 
in the idealization in which the tracer points increase in density of coverage without 
limit. Accounts of superspace with more mathematical detail are given by DeWitt 
(1967a,b), Wheeler (1970), and Fischer (1970). 6 ^ 

Let the representative point move from one location to a nearby location either 
m truncated superspace or in full superspace. Then all edge-lengths alter, and the 
3-geometry of Figure 43.1 moves as if alive. No better illustration can one easily 
supply of what it means to speak of the “dynamics of space.” 

The term 3-geometry makes sense as well in quantum geometrodynamics as 
m classical theory. So does superspace. But spacetime does not. Give a 3-geometry 
and g.ve its time rate of change. That is enough, under typical circumstances (see 
Chapter 21) to fix the whole time-evolution of the geometry; enough in other words, 
to determine the entire four-dimensional spacetime geometry, provided one is 
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Figure 43.1. 

Superspace in the simplicial approximation. Upper left, space (depicted as two-dimensional but actually 
three-dimensional). Upper right, simplical approximation to space. The approximation can be made 
arbitrarily good by going to the limit of an arbitrarily fine decomposition. The curvature at a typical 
location is measured by a deficit angle. This angle is completely determined by the edge lengths (L v 
Z. 2 , . . . L s in the figure) of the simplexes that meet at that location. When there are 98 edge lengths 
altogether in the simplicial representation of the geometry, then this geometry is completely specified 
by a single point in a 98-dimensional space (lower diagram; “superspace”). 


The concept of spacetime is 
incompatible with the 
quantum principle 


considering the problem in the context of classical physics. In the real $orld of 
quantum physics, however, one cannot give both a dynamic variable and its time-rate 
of change. The principle of complementarity forbids. Given the precise 3-geometry 
at one instant, one cannot also know at that instant the time-rate of change of the 
3-geometry. In other words, given the geometrodynamic field coordinate, one cannot 
know the geometrodynamic field momentum. If one assigns the intrinsic 3-geometry, 
one cannot also specify the extrinsic curvature. 

The uncertainty principle thus deprives one of any way whatsoever to predict, 
or even to give meaning to, “the deterministic classical history of space evolving 
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m time.” No prediction of spacetime, therefore no meaning for spacetime, is the verdict 
of the quantum principle. That object which is central to all of classical general 
relativity, the four-dimensional spacetime geometry, simply does not exist, except 
m a classical approximation. 

These considerations reveal that the concepts of spacetime and time are not 
primary but secondary ideas in the structure of physical theory. These concepts are 
valid in the classical approximation. However, they have neither meaning nor 
application under circumstances where quantum geometrodynamic effects become 
important. Then one has to forego that view of nature in which every event, past, 
present, or,future, occupies its preordained position in a grand catalog called 
spacetime,” with the Einstein interval from each event to its neighbor eternally 
established. There is no spacetime, there is no time, there is no before, there is no 
after. The question of what happens “next” is without meaning. 

Tha&pacetime is not the right way does not mean there is no right way to describe 
the dynamics of geometry consistent with the quantum principle. Superspace is the 
Key to one right way to describe the dynamics (see Figure 43.2). 


Spacetime 

Space 



Figure 43.2. 

Space, spacetime, and superspace. Upper left: Five sample configurations, A, B, C, D, E, attained by 
space in the course of its expansion and recontraction. Below: Superspace and these five sample 
configurations, each represented by a point in superspace. Upper right: Spacetime. A spacelike cut lfke 
A, through spacetime gives a momentary configuration of space. There is no compulsion to limit attention 

s°a a ZTIT^ family ° f SliC , eS ’ A ’ B ’ C • £ thr0Ugh S P aCetime - The P hrase “ntany-fingered time” 

is slogan telling one not to so limit one’s slices, and B' is an example of this freedom in action The 

-geometries B and A, B, C, D, £, like all 3-geometries obtained by all spacelike slices whatsoever through 
he given classical spacetime, lie on a single bent leaf of history, indicated in the diagram, and cutting 
in slice through superspace. A different spacetime, in other words, a different solution of Einstein’s 
field equation, means a different leaf of history (not indicated) slicing through superspace 
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Spacetime is a classical leaf 
of history slicing through 
superspace 


§43.2. THE DYNAMICS OF GEOMETRY DESCRIBED 
IN THE LANGUAGE OF THE SUPERSPACE 

OF THE (3) £'s 

Given a spacetime, one can construct the corresponding leaf of history slicing through 
superspace. Conversely, given the leaf of history, one can reconstruct the spacetime. 

Consider the child’s toy commonly known as “Chinese boxes.” One opens the 
outermost box only to reveal another box; when the second box is opened, there 
is another box, and so on, until eventually there are dozens of boxes scattered over 
the floor. Or conversely the boxes can be put back together, nested one inside the 
other, to reconstitute the original package. The packaging of (3) £>’s into a is much 
more sophisticated. Nature provides no monotonic ordering of the (3) £>’s. Two of the 
dynamically allowed (3) £>’s, taken at random, will often cross each other one or more 
times. When one shakes the {4) h apart, one therefore gets enormously more (3) &’s 
“spread out over the floor” than might have been imagined. Conversely, when one 
puts back together all of the (3) £>’s lying on the leaf of history, one gets a structure 
with a rigidity that might not otherwise have been foreseen. This rigidity arises from 
the infinitely rich interleaving and intercrossing of clear-cut, well-defined (3) &’s one 
with another. 

In summary: (1) Classical geometrodynamics in principle constitutes a device, an 
algorithm, a rule for calculating and constructing a leaf of history that slices through 
superspace. (2) The (3) £>’s that lie on this leaf of history are YES 3-geometries; the 
vastly more numerous (3) £fs that do not are NO 3-geometries. (3) The YES (3) &’s 
are the building blocks of the {4) h that is classical spacetime. (4) The interweaving 
and interconnections of these building blocks give the {4) h its existence, its dimen¬ 
sionality, and its structure. (5) In this structure every i3) h has a rigidly fixed location 
of its own. (6) In this sense one can say that the “many-fingered time” of each 
3-geometry is specified by the very interlocking construction itself. Baierlein, Sharp 
and Wheeler (1962) say a little more on this concept of “3-geometry as carrier of 
information about time ” 

How different from the textbook concept of spacetime! There the geometry of 
spacetime is conceived as constructed out of elementary objects, or points, known 
as “events.” Here, by contrast, the primary concept is 3-geometry, in abstracto , and 
out of it is derived the idea of event. Thus, (1) the event lies at the intersection 
of such and such (3) £’s; and (2) it has a timelike relation to (earlier or later than, 
or synchronous with) some other (3) £>, which in turn (3) derives from the intercrossings 
of all the other (3) £)’s. $ 

When one turns from classical theory to quantum theory, one gives up the concept 
of spacetime, except in the semiclassical approximation. Therefore, one gives up any 
immediate possibility whatsoever of defining the concept, normally regarded as so 
elemental, of an “event.” The theory itself, however, here as always [Bohr and 
Rosenfeld (1933)], defines in and by itself, in its own natural way, the procedures- 
in-principle for measuring all those quantities that are in principle measurable. 

Quantum theory upsets the sharp distinction between YES 3-geometries and NO 
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am^mdf S ’ * &SSignS *° 3 ' ge ° metry n0t a YES or a NO, but a probability Probability amplitude for 
* ’ 3-geometry 

'P = >K (3, £). (43.1) 

This probability amplitude is highest near the classically forecast leaf of history and 

falls off steeply outside a zone of finite thickness extending a little way on either 
side of the leaf. 

Were one to take, instead of a physically relevant probability amplitude function, 
a typical solution of the relevant wave equation, one would have to expect to see 
not one trace of anything like classical geometrodynamics. The typical probability 
amplitude function is spread all over superspace. No surprise! Already in classical 
theory one has to reckon with a Hamilton-Jacobi function, 


s S=S(< 3 *), (43.2) 

spread out over superspace. Moreover, this “dynamic phase function” of classical 
geometrodynamics gives at once the phase of i>, according to the formula 


^ = /s lowl y varying \ 

V amplitude function/ ’ 


(43.3) 


indication enough that ^ and S are both unlocalized. 

Dynamics first clearly becomes recognizable when sufficiently many such spread- 
out probability amplitude functions are superposed to build up a localized wave 
packet, as in the elementary examples of Boxes 25.3 and 25.4; thus, 


Wave packet recovers 
classical geometrodynam' 


$ = eWq + c 2 4> 2 + .... (43_ 4) 

Constructive interference occurs where the phases of the several individual waves 
agree: 

S 1 ( (3) *) = 5 2 0 3 »*)=.... (43.5) 

This is the condition that distinguishes YES 3-geometries from NO 3-geometries. 
It is the tool for constructing a leaf of history in superspace. It is the key to the 
dynamics of geometry. Moreover, it is an equation that says not one word about 
the quantum principle. It is not surprising that the equation of constructive interfer¬ 
ence in (43.5) makes the leap from classical theory to quantum theory the shortest. 


§43.3. THE EINSTEIN-HAMILTON-JACOBI EQUATION 

Should one write down a differential equation for the Hamilton-Jacobi function 
SO 3 '*), solve it, and then analyze the properties of the solution? The exact opposite 
is simpler: look at the properties of the solution, and from that inspection find out 
what equation the dynamic phase or action S must satisfy. 
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Hilberts’ principle of least action reads 

Libert = (l/16w) f ( 4 > R(-g ) 1/2 d 4 x = extremum. (43.6) 

After one separates off complete derivatives in the integrand, what is left [see 
equations (21.13) and (21.95)] becomes 

(1/16t7)/ adm 

Arue (1/16 -tt) f |w y dgij/dt + Ng 1/2 R 

+ Ng ~ 1/2 [y (Tr n) 2 - Tr (/7 2 )] + 2N j w w li ] d 4 x. (43.7) 

In (43.7), but not in (43.6), g stands for the determinant of the three-dimensional 
metric tensor, and R for the scalar curvature invariant of the 3-geometry; the 
suffix (3) is omitted for simplicity. The integral is extended from (1) a spacelike 
hypersurface on which a 3-geometry is given with metric g {j \x,y, z) to (2) a spacelike 
hypersurface on which a 3-geometry is given with metric &/'(*> z ). Whatever is 
adjustable in the chunk of spacetime between is now to be considered as having 
been adjusted to extremize the integral. Therefore the value of the integral / ADM 
becomes a functional of the metrics on the two hypersurfaces and nothing more. 

Next, holding fixed the metric g / i j(x,y,z) on the earlier hypersurface, change 
slightly or even more than slightly the metric on the later hypersurface. Solve the 
new variation problem and get a new value of / ADM . Proceeding further in this way, 
for each new gy" one gets a new value of / ADM . Call the functional / ADM of the 
metric defined in this way “Hamilton’s principal function,” or the “action” or the 
“dynamic path length,*” S(g {j (x,y, z)) of the “history of geometry” that connects 
the two given 3-geometries. The double prime suffix is dropped from g {j " here and 
hereafter to simplify the notation. One knows from other branches of mechanics 
that the quantity defined in this way, Sig^), when it exists, even though it is a special 
solution, nevertheless is always a solution of the Hamilton-Jacobi equation. Jacobi 
could look for more general solutions, but Hamilton already had one! 

For (43.7) to be an extremal with respect to variations of the lapse N and the 
shift components N { , it was necessary (see Chapter 21) that the coefficients of these 
four quantities should vanish; thus, 

£~ 1/2 [y(Tr/7) 2 - Tr/7 2 ] + g l ' 2 R = 0 (43.8) 

and 5 

A = 0. (43.9) 

In the expression for the extremal value of the action, only one term, the first, is 
left: 

S(g(x,y, z)) 

/aDM, extremal f ’ {^’dgij/dt} d 4 x. ( 43 . 10 ) 

«'n 


Actually S = ~ 16775^ = 1677 (true dynamic path length). 
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The effect of a slight change, 5 g ijt in the 3-metric at the upper limit is therefore 
easy to read off: 

ss = f 7, z) 8gij(x, y, z) d 3 x. (43.11) 


The language of “functional derivative” [see, for example, Bogoliubov and Shirkov 
(1959)] allows one to speak in terms of a derivative rather than an integral: 

SS „ 

wr (43 - I2) 

The “field momenta” acquire a simple meaning: they give the rate of change of 
the action with respect to the continuous infinitude of “field coordinates,” gi(x,y, z). 
(Here tjie x,y, z, as well as the / and j, serve as mere labels.) 

Although the phase function S appears to depend on all six metric coefficients 
Si) individually, it depends in actuality only on that combination of the g J; . which 
is locked to the 3-geometry. To verify this point, express a particular 3-geometry 
ii throughout one local coordinate patch in terms of one set of coordinates x p 
by one set of metric coefficients g pq . Reexpress the same 3-geometry in terms of 
coordinates x p shifted by the small amount 


Geometrodynamic 
momentum as rate of change 
of dynamic path length with 
respect to 3-geometry of 
terminal hypersurface 


Action depends on 
3-geometry, not on metric 
coefficients individually 


x p - x p - £p. (43.13) 

To keep the 3-geometry the same, that is, to keep unchanged the distance ds from 
one coordinate-independent point to another, the metric coefficients have to change: 

_ Spq Spq ~b £p|g + £g|p- (43.14) 

Let the phase function S (or in quantum mechanics, let the probability amplitude 
ip) be considered to be expressed as a functional of the metric coefficients g u (x), 
g 12 (x ),..., g 33 (x). Changes Sg pQ (x) in these coefficients alter the H-J phase function 
and the probability amplitude by the amounts 


SS = f (SS/Sg pq )Sg M d 3 x, 
W = / (fy/ S gpq) Sgpq d 3 X, 


(43.15) 


according to the standard definition of functional derivative. Therefore the coordi¬ 
nate change produces an ostensible change in the dynamic path length or phase 
S given by 


SS = f (SS/Sg pq )(i plq + i qlp ) d 3 x 
= -2 f (6S/Sg pq ) lq t p d 3 x. 


(43.16) 


This change must vanish if S’ is to depend on the 3-geometry alone, and not on 
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the coordinates in terms of which that 3-geometry is expressed; and must vanish, 
moreover, for arbitrary choice of the £ p . From this condition, one concludes 



(43.17) 


Likewise, one finds three equations on the wave function \p itself, as distinguished 
from its phase S/fr, thus, 



(43.18) 


But (43.17), by virtue of (43.12), is identical with (43.9). In this sense (43.9) merely 
verifies what one already knew had to be true: the classical Hamilton-Jacobi function 
S (like the probability amplitude function \p of quantum theory) depends on 3-ge¬ 
ometry, not on individual metric coefficients, and not on choice of coordinates. 

All the dynamic content of geometrodynamics is summarized in the sole remaining 
equation (43.8), which takes the form 


Law of propagation of wave 
crests in superspace 


S~ 1 / 2 \jS Pg Srs - SprS q s)jf-j~- + g 1/2 R = 0. (43.19) 

L ^ J °Spq °5rs 


This is the Einstein-Hamilton-Jacobi equation, first given explicitly in the literature 
by Peres (1962) on the foundation of earlier work by himself and others on the 
Hamiltonian formulation of geometrodynamics. This equation tells how fronts of 
constant S (“wave crests”) propagate in superspace. 

That the one EHJ equation (43.19) contains as much information as all ten 
components of Einstein’s field equation has been demonstrated by Gerlach (1969). 
The central point in the analysis is the principle of constructive interference, and 
the main requirement for a proper treatment of this point is the concept of a 
completely parametrized solution of the EHJ equation. 

The problem of a particle moving in two-dimensional space, as treated by the 
Hamilton-Jacobi method in Boxes 25.3 and 25.4, required for complete analysis a 
solution containing two distinct and independently adjustable parameters, the energy 
per unit mass, E , and angular momentum per unit mass, Z; thus 

S(r, 6 , (; E , Z) = —Et + L6 


+ f r [E 2 ~ 0 - 2M/r)(l + L 2 /r 2 )Y' 2 - + S(E, L ). 5 (43.20) 


Here the additive phase 8(E , L) is required if one is to be able to arrange for the 
particle to arrive at a given r-value at a specified t value and at a specified value 
of 6. One thinks of superposing four probability amplitudes, as in (43.4), with 
dynamic phases, S, given by (43.20) and the parameters taking on, respectively, the 
following four sets of values: (E , Z); (E + AE , Z); ( E , Z + AL); and (E + AE , 
L + AL), The principle of constructive interference leads to the conditions 
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dS/dE = 0, 
dS/dL = 0. 


(4321) 


The points in the spacetime ( r , 6, t) that satisfy these conditions are the YES points; 
they he on the world line. The ones that don’t are the NO points. 

The desired solution of the EHJ equation (43.19) contains not two parameters 
(plus an additive phase, <5, depending on these two parameters), but an infinity of 
parameters, and even a continuous infinity of parameters. Thus the parameters are 
not to be designated as a x , a 2 ,...; /J x , P 2 ,... (parameters labeled by a discrete index), 
but as 

a(u, v, w) 


and 

* £(«, v, w) 


Condition of constructive 
interference gives classical 
“leaf of history" or 
spacetime 


is identical with the leaf of history, or equivalent 4-geometry, given by the ten 
components of Einstein’s geometrodynamic law. 

From the Hamilton-Jacobi equation for a problem in elementary mechanics, it 
is a short step to the corresponding Schroedinger equation; similarly in geometro- 
dynamics. No one has done more than Bryce DeWitt to explore the meaning and 
consequences of this “Einstein-Schroedinger equation” [DeWitt (1967a,b)]. One of 
the most interesting consequences is the existence of a conserved current in super¬ 
space, analogous to the conserved current 



that one encounters in the Klein-Gordon wave equation for a particle of spin zero. 

It is an unhappy feature of this Einstein-Schroedinger wave equation that it 
contains second derivatives; so one has to specify both the probability amplitude, 
and the normal derivative of the probability amplitude, on the appropriate “super- 


(two parameters “labeled” by three continuous indices w, y, w). Accidentally omit 
one of this infinitude of parameters? How could one ever hope to know that what 
purported to be a complete solution of the EHJ equation was not in actuality 
complete? Happily Gerlach provides a procedure to test the parameters for com¬ 
pleteness. 

Granted completeness, Gerlach goes on to show that the “leaf of history in 
superspace” or collection of 3-geometries that satisfy the conditions of constructive 
interference, 


8S( i3) h; a(u , y, w), /3(w, y, w)) 
5a 

5S( (3) £>; a(«, y, w), /?(«, y, w)) 
_ 


= 0 , 


= 0 , 


(43.22) 
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hypersurface” in superspace, in order to be able to predict the evolution of this state 
function elsewhere in superspace. One suggested way out of this situation—it is at 
least an inconvenience, possibly a real difficulty—has been proposed by Leutwyler 
(1968): impose a natural boundary condition that reduces the number of independent 
solutions from the number characteristic of a second-order equation to the number 
characteristic of a first-order equation. Another way out is to formulate the dynamics 
quite differently, in the way proposed by Kuchar (see Chapter 21), in which the 
resulting equation is of first order in the variable analogous to time. 

The exploration of quantum geometrodynamics is simplified when one treats most 
of the degrees of freedom of the geometry as frozen out, by imposition of a high 
degree of symmetry. Then one is left with one, two. or three degrees of freedom 
(see Chapter 30, on mixmaster cosmology), or even infinitely many, and is led to 
manageable problems of quantum mechanics [Misner (1972a, 1973)]. 

§43.4. FLUCTUATIONS IN GEOMETRY 

Of all the remarkable developments of physics since World War II, none is more 
impressive than the prediction and verification of the effects of the vacuum fluctua¬ 
tions in the electromagnetic field on the motion of the electron in the hydrogen atom 
(Figure 43.3). That development made it impossible to overlook the effects of such 
fluctuations throughout all physics and, not least, in the geometry of spacetime itself. 



Figure 43.3. 

Symbolic representation of motion of electron in hydrogen atom as affected by fluctuations 
in electric field in vacuum (“vacuum” or “ground state” or “zero-point” fluctuations). The 
electric field associated with the fluctuation, E T (t) = fE x (a 2 )e~ iut du, adds to the static 
electric field provided by the nucleus itself. The additional field brings about in the most 
elementary approximation the displacement Ax = f(e/nto3 2 )E x (o3)e~ iu>t do>. The average 
vanishes but the root mean square <(4x) 2 > does not. In consequence the electron feels 
an effective atomic potential altered from the expected value V(x,y,z) by the amount 

AV(x,y, z) = j ((Ax) 2 ) V 2 V(x,y, z). 

The average of this perturbation over the unperturbed motion accounts for the major part 
of the observed Lamb-Retherford shift AE = (AV(x y y y z)) in the energy level. Con¬ 
versely, the observation of the expected shift makes the reality of the vacuum fluctuations 
inescapably evident. 
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From the zero-point fluctuations of a single oscillator to the fluctuations of the 

electromagnetic field to geometrodynamic fluctuations is a natural order of progres- 
sion. r & 

A harmonic oscillator in its ground state has a probability amplitude of 


Hx) = 


t\ 1/4 e -(ma/2f,)x 2 
■nti) 


(43.23) 



Fluctuations for oscillator and 
for electromagnetic field 


to be displaced by the distance * from its natural classical position of equilibrium. 

In this sense, it may be said to “resonate” or “fluctuate” between locations in space 
ranging over a region of extent r 






ele ^ troma 8 netlc field can be treated as an infinite collection of independent 
field oscillators, ’ with amplitudes £ lf .... When the Maxwell field is in its state 
of lowest energy, the probability amplitude-for the first oscillator to have amplitude 
and simubane ously the second oscillator to have amplitude the third $, and 
so on is the product of functions of the form (43.23), one for each oscillator. When 

the scale of amplitudes for each oscillator is suitably normalized, the resulting infinite 
product takes the form 


£ 2 ,...) - A exp [-(£j 2 + £ 2 2 + - •.)]. (43.25) 

This expression gives the probability amplitude rf, for a configuration B(x> y,z) of 
e magnetic field that is described by the Fourier coefficients L, L One can 
forgo any mention of these Fourier coefficients if one desires, however, and rewrite 
) xrectly in terms of the magnetic field configuration itself [Wheeler (1962)]: 

H B (x,y, z)) = % exp ( ~ f d % )• (43.26) 


One no longer speaks of “the” magnetic field, but instead of the probability of this, 
that, or the other configuration of the magnetic field, even under circumstances, as 
here, where the electromagnetic field is in its ground state. [See Kuchar (1970) for 
a similar expression for the “ground state” functional of the linearized gravitational 
field.] 6 

It is reasonable enough under these circumstances that the configuration of greatest 
probability is B{x,y, r) = 0. Consider for comparison a configuration where the 
magnetic field is again everywhere zero except in a region of dimension L. There 
et the field, subject as always to the condition div B = 0, be of the order of magni¬ 
tude AB. The probability amplitude for this configuration will be reduced relative 
to the nil configuration by a factor exp (-/). Here the quantity / in the exponent 
is of the order (AB)-L 4 /tic. Configurations for which / is large compared to 1 occur 
with negligible probability. Configurations for which I is small compared to 1 occur 
with practically the same probability as the nil configuration. In this sense, one can 
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say that the fluctuations in the magnetic field in a region of extension L are of the 
order of magnitude 


AB~~ 


(he) 1 ' 2 
L 2 ’ 


(43.27) 


In other words, the field “resonates” between one configuration and another with 
the range of configurations of significance given by (43.27). Moreover, the smaller 
is the region of space under consideration, the larger are the field magnitudes that 
occur with appreciable probability. 

Still another familiar way of speaking about electromagnetic field fluctuations gives 
additional insight relevant to geometrodynamics. One considers a measuring device 
responsive in comparable measure to the magnetic field at all points in a region 
of dimension L . One asks for the effect on this device of electromagnetic disturbances 
of various wavelengths. A disturbance of wavelength short compared to L will cause 
forces to act one way in some parts of the detector, and will give rise to nearly 
compensating forces in other parts of it. In contrast, a disturbance of a long wave¬ 
length A produces forces everywhere in the same direction, but of a magnitude too 
low to have much effect. Thus the field, estimated from the equation 


( energy of electromagnetic N 
wave of wavelength A in a 
domain of volume A 3 j 


/energy of one quantum 
\of wavelength A 


or 

B 2 X 3 _ f>£_ 
A 


or 




(he) 1 ' 2 

~x 2 ~ 


(43.28) 


Fluctuations in geometry 
dominate at the Planck scale 
of distances 


is very small if A is large compared to the domain size L. The biggest effect is caused 
by a disturbance of wavelength A comparable to L itself. This line of reasoning leads 
directly from (43.28) to the standard fluctuation formula (43.27). 

Similar considerations apply in geometrodynamics. Quantum fluctuations in the 
geometry are superposed on and coexist with the large-scale, slowly varying curvature 
predicted by classical deterministic general relativity. Thus, in a region of dimension 
L, where in a local Lorentz frame the normal values of the metric coefficieifts will 
be — 1, 1, 1, 1, there will occur fluctuations in these coefficients of the order 



(43.29) 


fluctuations in the first derivatives of the g ik s of the order 



L* 

U’ 


(43.30) 



§43.4. FLUCTUATIONS IN GEOMETRY 


1193 


and fluctuations in the curvature of space of the order 



L* 

L 3 ' 


(43.31) 


Here 

L* = Y 2 = 1.6 X 10 - 33 cm (43.32) 

is the so-called Planck length [Planck (1899)]. 

It is appropriate to look at orders of magnitude. The curvature of space within 
and near the earth, according to classical Einstein theory, is of the order 

R ~~ ^ j p ~ (0.7 x 10~ 28 cm/g)(5 g/cm 3 ) 

' ~4x 10-cnr*. <43 ' 33) 


This quantity has a very direct physical significance. It measures the “tide-producing 
component of the gravitational field” as sensed, for example, in a freely falling 
elevator or in a spaceship in free orbit around the earth. By comparison, the quantum 
fluctuations in the curvature of space are only 

AR — 10 -33 cm -2 , (43.34) 

even in a domain of observation as small as 1 cm in extent. Thus the quantum 
fluctuations in the geometry of space are completely negligible under everyday 
circumstances. 

Even in atomic and nuclear physics the fluctuations in the metric, 


A g- 


10 33 cm 
10~ 8 cm 


lO" 25 


and 

A 10" 33 cm 10 - 2 Q M 3 35 ) 

8 10“ 13 cm ’ 1 ’ 

are so small that it is completely in order to idealize the physics as taking place 
in a flat Lorentzian spacetime manifold. 

The quantum fluctuations in the geometry are nevertheless inescapable, if one 
is to believe the quantum principle and Einstein’s theory. They coexist with the 
geometrodynamic development predicted by classical general relativity. The fluctua¬ 
tions widen the narrow swathe cut through superspace by the classical history of 
the geometry. In other words, the geometry is not deterministic, even though it looks 
so at the everyday scale of observation. Instead, at a submicroscopic scale it “reson¬ 
ates” between one configuration and another and another. This terminology means 
no more and no less than the following: (1) Each configuration <3) £) has its own 
probability amplitude^ = ^( ( 3 ) £i). (2) These probability amplitudes have comparable 
magnitudes for a whole range of 3-geometries included within the limits (43.29) on 
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either side of the classical swathe through superspace. (3) This range of 3-geometries 
is far too variegated on the submicroscopic sale to fit into any one 4-geometry, or 
any one classical geometrodynamic history. (4) Only when one overlooks these 
small-scale fluctuations (~~10 -33 cm) and examines the larger-scale features of the 
3-geometries do they appear to fit into a single space-time manifold, such as comports 
with the classical field equations. 

These small-scale fluctuations tell one that something like gravitational collapse 
is taking place everywhere in space and all the time; that gravitational collapse is 
in effect perpetually being done and undone; that in addition to the gravitational 
collapse of the universe, and of a star, one has also to deal with a third and, because 
it is constantly being undone, most significant level of gravitational collapse at the 
Planck scale of distances. 


EXERCISES 


Exercise 43.1. THE ACTION PRINCIPLE FOR A FREE PARTICLE 
IN NONRELATIVISTIC MECHANICS 

Taking as action principle 1 = fLdt = extremum, with specified x', t' and x", t" at the two 
limits, and with L = \m{dx/dt ) 2 , find (1) the extremizing history x = x(t) and (2) the dynam¬ 
ical path length or action S(x", x', /') = ^extremum * n * ts dependence on the end points. 

Also (3) write down the Hamilton-Jacobi equation for this problem, and (4) verify that ^(x, r; 
x\ O satisfies this equation. Finally, imagining the Hamilton-Jacobi equation not to be 
known, (5) derive it from the already known properties of the function S itself. 

Exercise 43.2. THE ACTION FOR THE HARMONIC OSCILLATOR 

The kinetic energy is \m(dx/dt ) 2 and the potential energy is \mu 2 x 2 . Carry through the 
analysis of parts (1), (2), (3), (4) of the preceding exercise. Partial answer: 

S - (* 2 + x ' 2 ) cos — £) - 2xx' 

2 sin — /') 


Verify that dS/dx gives momentum and — dS/dt gives energy. 

Exercise 43.3. QUANTUM PROPAGATOR FOR HARMONIC OSCILLATOR 

Show that the probability amplitude for a simple harmonic oscillator to transit from x', t f 
to x", t" is 


<x", t x t ') 

(277/1 


277/7? sin w(/' 


-) 


1/2 v ™ irno)[(x" 2 + x' 2 ) cos w(r" - t') - 2x / 'x / ] 
X CXp ———- 


2/? sin c o(t" — /') 


and that it reduces for the case of a free particle to 


.. m V n irn{ x " - x'f 

' ’ ’ ’ } -\ 2 *iHt"-n) eXp 2h(i" - /') 


Note that one can derive all the harmonic-oscillator wave functions from the solution by 
use of the formula 


<*">O = 2 u n (x")u n \x') exp iE n (i' - t")/h. 
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Exercise 43.4. QUANTUM PROPAGATOR FOR FREE 
ELECTROMAGNETIC FIELD 

In flat spacetime, one is given on the spacelike hypersurface t = t' the divergence-free 
magnenc fle d B'(x,y, z) and on the spacelike hypersurface t = t" the diveSce-free 

oroMeTt ^ F ° Uner analyS ' S (reducin S this P^blem to the preceding 

problem) or otherwise, find the probability amplitude to transit from B' at t' to B" at t". 


Exercise 43.5. HAMILTON-JACOBI FORMULATION 
OF MAXWELL ELECTRODYNAMICS 

Regard the four components of the electromagnetic 4-potential as the primary quantities; 
split them into a space part A { and a scalar potential <p. (1) Derive from the action principle 
(in flat spacetime) 


/ = (1/8tt) f ( E 2 - B 2 ) d 4 x. 


by splitting off an appropriate divergence, an expression qualitatively similar in character 
to (43.7). (2) Show that the appropriate quantity to be fixed on the initial and final spacelike 
hypersurface is not really A t itself, but the magnetic field, defined by B — curl/l. (3) Derive 
the Hamilton-Jacobi equation for the dynamic phase or action S(B, S ) in its dependence 
on the choice of hypersurface 5, and the choice of magnetic field B on this hypersurface, 


SS2 ~ + 


(4^) 2 

my 

8tt 1 

\sa / 


Box rr ° n the left 1S Tom0na8a s “ bubble time ” derivative [Tomonaga (1946); see also 
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44 


BEYOND THE END OF TIME 


"tHeaven wheels above you 
Displaying to you her eternal glories 
And still your eyes are on the ground “ 

DANTE 

The world ",stands before us as a great, eternal riddle" 

EINSTEIN (1949a) 


V 


§44.1. GRAVITATIONAL COLLAPSE AS THE GREATEST 
CRISIS IN PHYSICS OF ALL TIME 


r 'n 

j 

j This chapter is entirely Track 
I 2. No previous Track-2 material 
is needed as preparation for it, 
but Chapter 43 will be helpful. 

v__; 


The universe starts with a big bang, expands to a maximum dimension, then recon¬ 
tracts and collapses: no more awe-inspiring prediction was ever made. It is prepos¬ 
terous. Einstein himself could not believe his own prediction. It took Hubble’s 
observations to force him and the scientific community to abandon the concept of 
a universe that endures from everlasting to everlasting. 

Later work of Tolman (1934a), Avez (I960), Geroch (1967), and Hawking and 
Penrose (1969) generalizes the conclusion. A model universe that is closed, that obeys 
Einstein’s geometrodynamic law, and that contains a nowhere negative density of 
mass-energy, inevitably develops a singularity. No one sees any escape frpm the 
density of mass-energy rising without limit. A computing machine calculating ahead 
step by step the dynamical evolution of the geometry comes to the point where it 
can not go on. Smoke, figuratively speaking, starts to pour out of the computer. 
Yet physics surely continues to go on if for no other reason than this: Physics is 
by definition that which does go on its eternal way despite all the shadowy changes 
in the surface appearance of reality. 


The Marchon lecture given by J. A. W. at the University of Newcastle-upon-Tyne, May 18, 1971, 
and the Nuffield lecture given at Cambridge University July 19, 1971, were based on the material 
presented in this chapter. 
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Some day a door will surely open and expose the glittering central mechanism 
of the world in its beauty and simplicity. Toward the arrival of that day, no develop¬ 
ment holds out more hope than the paradox of gravitational collapse. Why paradox? 
Because Einstein’s equation says “this is the end” and physics says “there is no end.” 
Why hope? Because among all paradigms for probing a puzzle, physics proffers none 
with more promise than a paradox. 

No previous period of physics brought a greater paradox than 1911 (Box 44.1). 
Rutherford had just been forced to conclude that matter is made up of localized 
positive and negative charges. Matter as so constituted should undergo electric 
collapse in ~10 17 sec, according to theory. Observation equally clearly proclaimed 
that matter is stable. No one took the paradox more seriously than Bohr. No one 
worked around and around the central mystery with more energy wherever work 
was possible. No one brought to bear a more judicious combination of daring and 
conseA'ativeness, nor a deeper feel for the harmony of physics. The direct opposite 


The paradox of collapse: 
physics stops, but physics 
must go on 


The 1911 crisis of electric 
collapse 



Box 44.1 COLLAPSE OF UNIVERSE PREDICTED BY CLASSICAL THEORY, COMPARED 
AND CONTRASTED WITH CLASSICALLY PREDICTED COLLAPSE OF ATOM 



System 


Atom {1911) 


Universe ( 1970's ) 


Dynamic entity System of electrons 


Geometry of space 


Nature of classi¬ 
cally predicted 
collapse 


Electron headed toward point-center of 
attraction is driven in a finite time 
to infinite energy 


Not only matter but space itself arrives 
in a finite proper time at a condition 
of infinite compaction 


One rejected “way Give up Coulomb law of force 
out” 


Give up Einstein’s field equation 


Another proposal “Accelerated charge need not radiate” 

for a “cheap way 
out” that has to 
be rejected 


“Matter cannot be compressed beyond a 
certain density by any pressure, how¬ 
ever high 


How this proposal Coulomb field of point-charge cannot re¬ 
violates principle adjust itself with infinite speed out 

of causality to indefinitely great distances to 

sudden changes in velocity of charge 


A major new consider¬ 
ation introduced 
by recognizing 
quantum principle 
as overarching or¬ 
ganizing principle 
of physics 


Uncertainty principle; binding too close 
to center of attraction makes zero-point 
kinetic energy outbalance potential 
energy; consequent existence of a 
lowest quantum state; can’t radiate 
because no lower state available to 
drop to 


Speed of sound cannot exceed speed of 
light; pressure cannot exceed density 
of mass-energy 


Uncertainty principle; propagation of 
representative wave packet in super¬ 
space does not lead deterministically 
to a singular configuration for the 
geometry of space; expect rather a 
probability distribution of outcomes, 
each outcome describing a universe 
with a different size, a different set of 
particle masses, a different number of 
particles, and a different length of 
time required for its expansion and 
recontraction. 
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of harmony, cacophony, is the impression that comes from sampling the literature 
of the ’teens on the structure of the atom. (1) Change the Coulomb law of force 
between electric charges? (2) Give up the principle that an accelerated charge must 
radiate? There was little inhibition against this and still wilder abandonings of the 
well-established. In contrast, Bohr held fast to these two principles. At the same 
time he insisted on the importance of a third principle, firmly established by Planck 
in quite another domain of physics, the quantum principle. With that key he opened 
the door to the world of the atom. 

Great as was the crisis of 1911, today gravitational collapse confronts physics with 
its greatest crisis ever. At issue is the fate, not of matter alone, but of the universe 
itself. The dynamics of collapse, or rather of its reverse, expansion, is evidenced not 
by theory alone, but also by observation; and not by one observation, but by 
observations many in number and carried out by astronomers of unsurpassed ability 
and integrity. Collapse, moreover, is not unique to the large-scale dynamics of the 
universe. A white dwarf star or a neutron star of more than critical mass is predicted 
to undergo gravitational collapse to a black hole (Chapters 32 and 33). Sufficiently 
many stars falling sufficiently close together at the center of the nucleus of a galaxy 
are expected to collapse to a black hole many powers of ten more massive than 
the sun. An active search is under way for evidence for a black hole in this Galaxy 
(Box 33.3). The process that makes a black hole is predicted to provide an experi¬ 
mental model for the gravitational collapse of the universe itself, with one difference. 
For collapse to a black hole, the observer has his choice whether (1) to observe from 
a safe distance, in which case his observations will reveal nothing of what goes on 
inside the horizon; or (2) to follow the falling matter on in, in which case he sees 
the final stages of the collapse, not only of the matter itself, but of the geometry 
surrounding the matter, to indefinitely high compaction, but only at the cost of his 
own early demise. For the gravitational collapse of a closed model universe, no such 
choice is available to the observer. His fate is sealed. So too is the fate of matter 
and elementary particles, driven up to indefinitely high densities. The stakes in the 
crisis of collapse are hard to match: the dynamics of the largest object, space, and 
the smallest object, an elementary particle—and how both began. 


§44.2. ASSESSMENT OF THE THEORY THAT 
PREDICTS COLLAPSE 

No one reflecting on the paradox of collapse (“collapse ends physics”; “dollapse 
cannot end physics”) can fail to ask, “What are the limits of validity of Einstein’s 
geometric theory of gravity?” A similar question posed itself in the crisis of 1911. 
The Coulomb law for the force acting between two charges had been tested at 
distances of meters and millimeters, but what warrant was there to believe that it 
holds down to the 10 -8 cm of atomic dimensions? Of course, in the end it proves 
to hold not only at the level of the atom, and at the 10 -13 cm level of the nucleus, 
but even down to 5 X K)- 15 cm [Barber, Gittelman, O’Neill, and Richter, and Bailey 
et al. (1968), as reviewed by Farley (1969) and Brodsky and Drell (1970)], a striking 
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example of what Wigner (1960) calls the “unreasonable effectiveness of mathematics 
in the natural sciences.” 

No theory more resembles Maxwell’s electrodynamics in its simplicity, beauty, 
and scope than Einstein’s geometrodynamics. Few principles in physics are more 
firmly established than those on which it rests: the local validity of special relativity 
(Chapters 2-7), the equivalence principle (Chapter 16), the conservation of momen¬ 
tum and energy (Chapters 5, 15 and 16), and the prevalence of second-order field 
equations throughout physics (Chapter 17). Those principles and the demand for 
no “extraneous fields” (e.g., Dicke’s scalar field) and “no prior geometry” (§17.6) 
lead to the conclusion that the geometry of spacetime must be Riemannian and 
the geometrodynamic law must be Einstein’s. 

To say that the geometry is Riemannian is to say that the interval between any 
two nearby events C and Z), anywhere in spacetime, stated in terms of the interval 
AB between two nearby fiducial events, at quite another point in spacetime, has 
a value CD/AB independent of the route of intercomparison (Chapter 13 and Box 
16.4). There are a thousand routes. By this hydraheaded prediction, Einstein’s theory 
thus exposes itself to destruction in a thousand ways (Box 16.4). 

Geometrodynamics lends itself to being disproven in other ways as well. The 
geometry has no option about the control it exerts on the dynamics of particles and 
fields (Chapter 20). The theory makes predictions about the equilibrium configura¬ 
tions and pulsations of compact stars (Chapters 23-26). It gives formulas (Chapters 
27-29) for the deceleration of the expansion of the universe, for the density of 
mass-energy, and for the magnifying power of the curvature of space, the tests of 
which are not far off. It predicts gravitational collapse, and the existence of black 
holes, and a wealth of physics associated with these objects (Chapters 31-34). It 
predicts gravitational waves (Chapters 35-37). In the appropriate approximation, 
it encompasses all the well-tested predictions of the Newtonian theory of gravity 
for the dynamics of the solar system, and predicts testable post-Newtonian corrections 
besides, including several already verified effects (Chapters 38-40). 

No inconsistency of principle has ever been found in Einstein’s geometric theory 
of gravity. No purported observational evidence against the theory has ever stood 
the test of time. No other acceptable account of physics of comparable simplicity 
and scope has ever been put forward. 

Continue this assessment of general relativity a little further before returning to 
the central issue, the limits of validity of the theory and their bearing on the issue 
of gravitational collapse. What has Einstein’s geometrodynamics contributed to the 
understanding of physics? 

First, it has dethroned spacetime from a post of preordained perfection high above 
the battles of matter and energy, and marked it as a new dynamic entity participating 
actively in this combat. 

Second, by tying energy and momentum to the curvature of spacetime, Einstein’s 
theory has recognized the law of conservation of momentum and energy as an 
automatic consequence of the geometric identity that the boundary of a boundary 
is zero (Chapters 15 and 17). 

Third, it has recognized gravitation as a manifestation of the curvature of the 


Battle-tested theory of 
gravitation 


New view of nature flowing 
from Einstein's 
geometrodynamics 
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geometry of spacetime rather than as something foreign and “physical” imbedded 
in spacetime. 

Fourth, general relativity has reinforced the view that “physics is local”; that the 
analysis of physics becomes simple when it connects quantities at a given event with 
quantities at immediately adjacent events. 

Fifth, obedient to the quantum principle, it recognizes that spacetime and time 
itself are ideas valid only at the classical level of approximation; that the proper 
arena for the Einstein dynamics of geometry is not spacetime, but superspace; and 
that this dynamics is described in accordance with the quantum principle by the 
propagation of a probability amplitude through superspace (Chapter 43). In conse¬ 
quence, the geometry of space is subject to quantum fluctuations in metric coefficients 
of the order 


^ ^ (Planck length, L* - (fiG/c 3 ) 1/2 = 1.6 X 10~ 33 cm) 

5 (linear extension of region under study) 

Sixth, standard Einstein geometrodynamics is partial as little to Euclidean topology 
Electric charge as lines of as to Euclidean geometry. A multiply connected topology provides a natural de- 

o° r space PPed m the topo '° 9y scription for electric charge as electric lines of force trapped in the topology of a 

multiply connected space (Figure 44.1). Any other description of electricity postulates 
a breakdown in Maxwell’s field equations for the vacuum at a site where charge 



Figure 44.1. 

Electric charge viewed as electric lines of force trapped in the topology of a multiply connected space 
[for the history of this concept see reference 36 of Wheeler (1968a)]. The wormhole or handle is envisaged 
as connecting two very different regions in the same space. One of the wormhole mouths, viewed by 
an observer with poor resolving power, appears to be the seat of an electric charge. Out of this region 
of 3-space he finds lines of force emerging over the whole 4?r solid angle. He may construct a boundary 
around this charge, determine the flux through this boundary, incorrectly apply the theorem of Gauss 
and “prove” that there is a charge “inside the boundary.” It isn’t a boundary. Someone caught within 
it—to speak figuratively—can go into that mouth of the wormhole, through the throat, out Ae other 
mouth, and return by way of the surrounding space to look at his “prison” from the outside. Lines 
of force nowhere end. Maxwell’s equations nowhere fail. Nowhere can one place a finger and say, 
“Here there is some charge.” This classical type of electric charge has no direct relation whatsoever 
to quantized electric charge. There is a freedom of choice about the flux through the wormhole, and 
a specificity about the connection between one charge and another, which is quite foreign to the charges 
of elementary particle physics. For ease of visualization the number of space dimensions in the above 
diagram has been reduced from three to two. The third dimension, measured off the surface, has no 
physical meaning—it only provides an extra dimension in which to imbed the surface for more convenient 
diagrammatic representation. [For more detail see Misner and Wheeler (1957), reprinted in Wheeler 
(1962)]. 
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is located, or postulates the existence of some foreign and “physical” electric jelly 
imbedded in space, or both. No one has ever found a way to describe electricity 
free of these unhappy features except to say that the quantum fluctuations in the 
geometry of space are so great at small distances that even the topology fluctuates, 
makes “wormholes,” and traps lines of force. These fluctuations have to be viewed, 
not as tied to particles, and endowed with the scale of distances associated with 
particle physics (~10 -13 cm) but as pervading all space (“foam-like structure of 
geometry”) and characterized by the Planck distance (~10“ 33 cm). Thus a third type 
of gravitational collapse forces itself on one’s attention, a collapse continually being 
done and being undone everywhere in space: surely a guide to the outcome of 
collapse at the level of a star and at the level of the universe (Box 44.2). 


r 

Box 44.2 THREE LEVELS OF GRAVITATIONAL COLLAPSE 


"\ 


1. Universe 

2. Black hole 

3. Fluctuations at the Planck scale of distances 

Recontraction and collapse of the universe is a kind of mirror image of the “big 
bang,” on which one already has so much evidence. 

Collapse of matter to form a black hole is most natural at two distinct levels: 
(a) collapse of the dense white-dwarf core of an individual star (when that core 
exceeds the critical mass, — 1 M 0 or 2Af 0 , at which a neutron star is no longer a 
possible stable end-point for collapse) and (b) coalescence one by one of the stars 
in a galactic nucleus to make a black hole of mass up to 10 6 M o or even 10 9 M Q . 

In either case, no feature of principle about matter 
falling into the black hole is more interesting than the 
option that the observer has (symbolized by the branch¬ 
ing arrow in the inset). He can go along with the infall¬ 
ing matter, in which case he sees the final stages of 
collapse, but only at the cost of his own demise. Or he 
can stay safely outside, in which case even after in¬ 
definitely long time he sees only the first part of the 
collapse, with the infalling matter creeping up more and 
more slowly to the horizon. 

In the final stages of the collapse of a closed model universe, all black holes present 
are caught up and driven together, amalgamating one by one. No one has any way 
to look at the event from safely outside; one is inevitably caught up in it oneself. 

Collapse at the Planck scale of distances is taking place everywhere and all the 
time in quantum fluctuations in the geometry and, one believes, the topology of 
space. In this sense, collapse is continually being done and undone, modeling the 
undoing of the collapse of the universe itself, summarized in the term, “the reproc¬ 
essing of the universe” (see text). 

___ 


Y . * 
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§44.3. VACUUM FLUCTUATIONS: THEIR PREVALENCE 
AND FINAL DOMINANCE 


Is matter built out of 
geometry? 


The richness of the physics 
of the vacuum 


If Einstein s theory thus throws light on the rest of pliysics, the rest of physics also 
throws light on geometrodynamics. No point is more central than this, that empty 
space is not empty. It is the seat of the most violent physics. The electromagnetic 
field fluctuates (Chapter 43). Virtual pairs of positive and negative electrons, in effect, 
are continually being created and annihilated, and likewise pairs of mu mesons, pairs 
of baryons, and pairs of other particles. All these fluctuations coexist with the 
quantum fluctuations in the geometry and topology of space. Are they additional- 
to those geometrodynamic zero-point disturbances, or are they, in some sense not 
now well-understood, mere manifestations of them? 

Put the question in other words. Recall Clifford, inspired by Riemann, speaking 
to the Cambridge Philosophical Society on February 21,1870, “On the Space Theory 
of Matter” [Clifford (1879), pp. 244 and 322; and (1882), p. 21], and saying, “I hold 
in fact (1) That small portions of space are in fact of a nature analogous to little 
hills on a surface which is on the average flat; namely, that the ordinary laws of 
geometry are not valid in them. (2) That this property of being curved or distorted 
is continually being passed on from one portion of space to another after the manner 
of a wave. (3) That this variation of the curvature of space is what really happens 
in that phenomenon which we call the motion of matter, whether ponderable or 
etherial. (4) That in the physical world nothing else takes place but this variation, 
subject (possibly) to the law of continuity.” Ask if there is a sense in which one 
can speak of a particle as constructed out of geometry. Or rephrase the question 
in updated language: “Is a particle a geometrodynamic exciton?” What else is there 
out of which to build a particle except geometry itself? And what else is there to 
give discreteness to such an object except the quantum principle? 

The Clifford-Einstein space theory of matter has not been forgotten in recent years. 
“In conclusion,” one of the authors wrote a decade ago [Wheeler (1962)], “the vision 
of Riemann, Clifford, and Einstein, of a purely geometric basis for physics, today 
has come to a higher state of development, and offers richer prospects—and presents 
deeper problems—than ever before. The quantum of action adds to this geometro¬ 
dynamics new features, of which the most striking is the presence of fluctuations 
of the wormhole type throughout all space. If there is any correspondence at all 
between this virtual foam-like structure and the physical vacuum as it has come 
to be known through quantum electrodynamics, then there seems to be no escape 
from identifying these wormholes with ‘undressed electrons.’ Completely different 
from these ‘undressed electrons,’ according to all available evidence, are the electrons 
and other particles of experimental physics. For these particles the geometrodynamic 
picture suggests the model of collective disturbances in a virtual foam-like vacuum, 
analogous to different kinds of phonons or excitons in a solid. 

“The enormous factor from nuclear densities ~10 14 g/cm 3 to the density of field 
fluctuation energy in the vacuum, ~10 94 g/cm 3 , argues that elementary particles 
represent a percentage-wise almost completely negligible change in the locally violent 
conditions that characterize the vacuum. [‘A particle (10 14 g/cm 3 ) means as little 
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to the physics of the vacuum (lO^g/cm 3 ) as a cloud (10- 6 g/cm 3 ) means to the 
physics of the sky (10 -3 g/cm 3 ).’] In other words, elementary particles do not form 
a really basic starting point for the description of nature. Instead, they represent 
a first-order correction to vacuum physics. That vacuum, that zero-order state of 
affairs, with its enormous densities of virtual photons and virtual positive-negative 
pairs and virtual wormholes, has to be described properly before one has a funda¬ 
mental starting point for a proper perturbation-theoretic analysis.” 

These conclusions about the energy density of the vacuum, its complicated 
topological character, and the richness of the physics which goes on in the vac¬ 
uum, stand in no evident contradiction with what quantum electrodynamics has to 
say about the vacuum. Instead the conclusions from the ‘small distance’ analysis 
(10“ 33 cm)—sketchy as it is—and from ‘larger distance’ analysis (10~ n cm) would 
seem to [be able] to reinforce each other in a most natural way. 

“The most evident shortcoming of the geometrodynamic model as it stands is this, 
that it fails to supply any completely natural place for spin \ in general and for the 
neutrino in particular.” 

Attempts to find a natural place for spin \ in Einstein’s standard geometrodynamics 
(Box 44.3) founder because there is no natural way for a change in connectivity to 
take place within the context of classical differential geometry. 

A uranium nucleus undergoing fission starts with one topology and nevertheless 
ends up with another topology. It makes this transition in a perfectly continuous 
way, classical differential geometry notwithstanding. 

There are reputed to be two kinds of lawyers. One tells the client what not to 
do. The other listens to what the client has to do and tells him how to do it. From 
the first lawyer, classical differential geometry, the client goes away disappointed, 
still searching for a natural way to describe quantum fluctuations in the connectivity 
of space. Only in this way can he hope to describe electric charge as lines of electric 
force trapped in the topology of space. Only in this way does he expect to be able 
to understand and analyze the final stages of gravitational collapse. Pondering his 
problems, he comes to the office of a second lawyer, with the name “Pregeometry” 
on the door. Full of hope, he knocks and enters. What is pregeometry to be and 
say? Bom of a combination of hope and need, of philosophy and physics and 
mathematics and logic, pregeometry will tell a story unfinished at this writing, but 
full of incidents of evolution so far as it goes. 


No place in 
geometrodynamics for 
change of topology; therefore 
turn to ''pregeometry" 


§44.4. NOT GEOMETRY, BUT PREGEOMETRY, AS THE 
MAGIC BUILDING MATERIAL 

An early survey (Box 44.4) asked whether geometry can be constructed with the 
help of the quantum principle out of more basic elements, that do not themselves 
have any specific dimensionality. 

The focus of attention in this 1964 discussion was “dimensionality without dimen¬ 
sionality.” However, the prime pressures to ponder pregeometry were and remain 


(continued on page 1206 ) 
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Box 44.3 THE DIFFICULTIES WITH ATTEMPTS TO FIND A NATURAL PLACE 

FOR SPIN i IN EINSTEIN'S STANDARD GEOMETRODYNAMICS 


It is impossible” [Wheeler (1962)] “to accept any 
description of elementary particles that does not 
have a place for spin What, then, has any purely 
geometric description to offer in explanation of 
spin \ in general? More particularly and more im¬ 
portantly, what place is there in quantum geomet- 
rodynamics for the neutrino—the only entity of 
half-integral spin that is a pure field in its own 
right, in the sense that it has zero rest mass and 
moves with the speed of light? No clear or satisfac¬ 
tory answer is known to this question today. Un¬ 
less and until an answer is forthcoming, pure geo- 
metrodynamics must be judged deficient as a basis 
for elementary particle physics .” 

A later publication [Wheeler (1968a)] takes up 
this issue again, noting that, “A new world opens 
out for analysis in quantum geometrodynamics. 
The central new concept is space resonating be¬ 
tween one foamlike structure and another. For this 
multiple-connectedness of space at submicroscopic 
distances no single feature of nature speaks more 
powerfully than electric charge. Yet at least as 
impressive as charge is the prevalence of spin \ 
throughout the world of elementary particles.” 

Repeating the statement that “It is impossible 
to accept any description of elementary particles 
that does not have a place for spin the article 
adds to the discussion a new note: “Happily, the 
concept of spin manifold has come to light, not 
least through the work of John Milnor [see Lich- 
nerowicz (1961a,b,c) and (1964); Milnor (1962), 
(1963), and (1965a,b); Hsiang and Anderson 
(1965); Anderson, Brown, and Peterson (1966a,b); 
and Penrose (1968a)]. This concept suggests a new 
and interesting interpretation of a spinor field 
within the context of the resonating microtopology 
of quantum geometrodynamics, as the nonclassical 
two-valuedness [Pauli’s standard term for spin; see, 
for example, Pauli (1947)] that attaches to the prob¬ 
ability amplitude for otherwise identical 3-geome¬ 
tries endowed with alternative ‘spin structures 
More specifically: “One does not classify the 
closed orientable 3-manifold of physics completely 



when one gives its topology, its differential struc¬ 
tures, and its metric. One must tell which spin 
structure it has.” [On a 3-geometry with the topol¬ 
ogy of a 3-sphere, one can lay down a continuous 
field of triads (a triad consisting of three orthonor¬ 
mal vectors). Any other continuous field of triads 
can be deformed into the first field by a continuous 
sequence of small readjustments. One says that the 
3-sphere admits only one “spin structure,” a po¬ 
tentially misleading standard word for what could 
just as well have been called a “triad structure.” 
In contrast, a 3-sphere with n handles or worm- 
holes admits 2 n “spin structures” (continuous fields 
of triads) inequivalent to one another under any 
continuous sequence of small readjustments what¬ 
soever, and distinguished from one another in any 
convenient way by n “descriptors” w v w 2 , . . . . , 
w k , .... w n .] It is natural in quantum geometro¬ 
dynamics to expect “separate probability ampli¬ 
tudes for a 3-geometry with descriptor w k = +1 
and for an otherwise identical 3-geometry with 
descriptor w k = — 1. Does this circumstance imply 
that quantum geometrodynamics supplies all the 
machinery one needs to describe fields of spin \ 
in general and the neutrino field in particular? . . . 
That is the only way that has ever turned up within 
the framework of Einstein’s general relativity and 
Planck’s quantum principle. Is this the right path? 
It is difficult to name any question more decisive 
than this in one’s assessment of ‘everything as 
geometry.’” 

Why not spell out these concepts, reduce them 
to practice, and compare them with what one 
knows about the behavior of fields of spin J? There 
is a central difficulty in this enterprise. It assume^ 
and demands on physical grounds that the topol¬ 
ogy of the 3-geometry shall be free to change from 
one connectivity to another. In contrast, classical 
differential geometry says, in effect, “Once one 
topology, always that topology.” Try a question 
like this, “When a new handle develops and the 
number of descriptors rises by one, what boundary 
condition in superspace connects the probability 




amplitude ip for 3-geometries of the original topol¬ 
ogy with the probability amplitudes ip + and ip_ for 
the two spin structures of the new topology?” 
Classical differential geometry not only gives one 
no help in answering this question; it even forbids 
one to ask it. In other words, one cannot even get 
the enterprise “on the road” for want of a natural 

_ .___ 

/ - 


mathematical way to describe the required change 
in topology. The idea is therefore abandoned here 
and now that 3-geometry is “the magic building 
material of the universe.” In contrast, pregeometry 
(see text), far from being endowed with any frozen 
topology, is to be viewed as not even possessing 
any dimensionality. 


j 


Box 44.4 "A BUCKET OF DUST''—AN EARLY ATTEMPT TO FORMULATE 
\ the CONCEPT OF PREGEOMETRY [Wheeler (1964a)] 


“What line of thought could ever be imagined as 
leading to four dimensions—or any dimensionality 
at all—out of more primitive considerations? In 
the case of atoms one derives the yellow color of 
the sodium D-lines by analyzing the quantum 
dynamics of a system, no part of which is ever 
endowed with anything remotely resembling the 
attribute of color. Likewise any derivation of the 
four-dimensionality of spacetime can hardly start 
with the idea of dimensionality.” 

“Recall the notion of a Borel set. Loosely speak¬ 
ing, a Borel set is a collection of points (“bucket 
of dust”) which have not yet been assembled into 
a manifold of any particular dimensionality. . . . 
Recalling the universal sway of the quantum prin¬ 
ciple, one can imagine probability amplitudes for 
the points in a Borel set to be assembled into 
points with this, that, and the other dimensional¬ 
ity. . . . More conditions have to be imposed on 
a given number of points—as to which has which 
for a nearest neighbor—when the points are put 
together in a five-dimensional array than when 
these same points are arranged in a two-dimen¬ 
sional pattern. Thus one can think of each dimen¬ 
sionality as having a much higher statistical weight 
than the next higher dimensionality. On the other 
hand, for manifolds with one, two, and three di¬ 
mensions, the geometry is too rudimentary—one 
can suppose—to give anything interesting. Thus 
Einstein’s field equations, applied to a manifold 
of dimensionality so low, demand flat space; only 
when the dimensionality is as high as four do 
really interesting possibilities arise. Can four. 


therefore, be considered to be the unique dimen¬ 
sionality which is at the same time high enough 
to give any real physics and yet low enough to 
have great statistical weight? 

“It is too much to imagine that one has yet made 
enough mistakes in this domain of thought to 
explore such ideas with any degree of good judg¬ 
ment.” 

Consider a handle on the geometry. Let it thin 
halfway along its length to a point. In other words, 
let the handle dissolve into two bent prongs that 
touch at a point. Let these prongs separate and 
shorten. In this process two points part company 
that were once immediate neighbors. “However 
sudden the change is in classical theory, in quan¬ 
tum theory there is a probability amplitude func¬ 
tion which falls off in the classically forbidden 
domain. In other words, there is some residual 
connection between points which are ostensibly 
very far away (travel from one ‘tip’ down one 
prong, then through the larger space to which 
these prongs are attached, and then up the other 
prong to the other tip). But there is nothing dis¬ 
tinctive in principle about the two points that have 
happened to come into discussion. Thus it would 
seem that there must be a connection . . . between 
every point and every other point. Under these 
conditions the concept of nearest neighbor would 
appear no longer to make sense. Thus the tool 
disappears with the help of which one might oth¬ 
erwise try to speak [unambiguously about dimen¬ 
sionality.” 
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Sakharov: gravitation is the 
"metric elasticity of space" 


The stratification of space 


Comparison with everyday 
elasticity 


two features of nature, spin J and charge, that speak out powerfully from every part 
of elementary particle physics. 

A fresh perspective on pregeometry comes from a fresh assessment of general 
relativity. “Geometrodynamics is neither as important or as simple as it looks. Do 
not make it the point of departure in searching for underlying simplicity. Look 
deeper, at elementary particle physics.” This is the tenor of interesting new consider¬ 
ations put forward by Sakharov [the Sakharov] (1967) and summarized under the 
heading, “Gravitation as the metric elasticity of space,” in Box 17.2. In brief, as 
elasticity is to atomic physics, so—in Sakharov’s view—gravitation is to elementary 
particle physics. The energy of an elastic deformation is nothing but energy put into 
the bonds between atom and atom by the deformation. The energy that it takes 
to curve space is nothing but perturbation in the vacuum energy of fields plus 
particles brought about by that curvature, according to Sakharov. The energy 
required for the deformation is governed in the one case by two elastic constants 
and in the other case by one elastic constant (the Newtonian constant of gravity) 
but in both cases, he reasons, the constants arise by combination of a multitude 
of complicated individual effects, not by a brave clean stroke on an empty slate. 

One gives all the more favorable reception to Sakhorov’s view of gravity because 
one knows today, as one did not in 1915, how opulent in physics the vacuum is. 
In Einstein’s day one had come in a single decade from the ideal God-given Lorentz 
perfection of flat spacetime to curved spacetime. It took courage to assign even one 
physical constant to that world of geometry that had always stood so far above 
physics. The vacuum looked for long as innocent of structure as a sheet of glass 
emerging from a rolling mill. With the discovery of the positive electron [Anderson 
(1933)], one came to recognize a little of the life that heat can unfreeze in “empty” 
space. Each new particle and radiation that was discovered brought a new accretion 
to the recognized richness of the vacuum. Macadam looks smooth, but a bulldozer 
has only to cut a single furrow through the roadway to disclose all the complications 
beneath the surface. 

Think of a particle as built out of the geometry of space; think of a particle as 
a “geometrodynamic exciton”? No model—it would seem to follow from Sakharov’s 
assessment—could be less in harmony with nature, except to think of an atom as 
built out of elasticity! Elasticity did not explain atoms. Atoms explained elasticity. 
If, likewise, particles fix the constant in Einstein’s geometrodynamic law (Sakharov), 
must it not be unreasonable to think of the geometrodynamic law as explaining 
particles? 

Carry the comparison between geometry and elasticity one stage deeper (Fig$44.2). 
In a mixed solid there are hundreds of distinct bonds, all of which contribute to 
the elastic constants; some of them arise from Van der Waal’s forces, some from 
ionic coupling, some from homopolar linkage; they have the greatest variety of 
-strengths; but all have their origin in something so fantastically simple as a system 
of positively and negatively charged masses moving in accordance with the laws 
of quantum mechanics. In no way was it required or right to meet each complication 
of the chemistry and physics of a myriad of bonds with a corresponding complication 
of principle. By going to a level of analysis deeper than bond strengths, one had 
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Figure 44.2. 

Elasticity and geometrodynamics, as viewed at three levels of analysis. A hundred years 
of the study of elasticity did not reveal the existence of molecules, and a hundred years 
of the study of molecular chemistry did not reveal Schrodinger’s equation. Revelation 
moved upward in the diagram, not downward. 


emerged into a world of light, where nothing but simplicity and unity was to be 
seen. 

Compare with geometry. The vacuum is animated with the zero-point activity 
of distinct fields and scores of distinct particles, all of which, according to Sakharov, 
contribute to the Newtonian G, the “elastic constant of the metric.” Some interact 
via weak forces, some by way of electromagnetic forces, and some through strong 
forces. These interactions have the greatest variety of strengths. But must not all 
these particles and interactions have their origin in something fantastically simple? 
And must not this something, this “pregeometry,” be as far removed from geometry 
as the quantum mechanics of electrons is far removed from elasticity? 

If one once thought of general relativity as a guide to the discovery of pregeometry, 
nothing might seem more dismaying than this comparison with an older realm of 
physics. No one would dream of studying the laws of elasticity to uncover the 
principles of quantum mechanics. Neither would anyone investigate the work-hard¬ 
ening of a metal to learn about atomic physics. The order of understanding ran not 

Work-hardening (1 cm) —dislocations (10~ 4 cm) —atoms (10“ 8 cm), 
but the direct opposite, 

Atoms (10 _8 cm) —dislocations (10 _4 cm) —*- work-hardening (1 cm) 
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One had to know about atoms to conceive of dislocations, and had to know about 
dislocations to understand work-hardening. Is it not likewise hopeless to go from 
the “elasticity of geometry” to an understanding of particle physics, and from particle 
physics to the uncovering of pregeometry? Must not the order of progress again 
be the direct opposite? And is not the source of any dismay the apparent loss of 
guidance that one experiences in giving up geometrodynamics—and not only geo- 
metrodynamics, but geometry itself—as a crutch to lean on as one hobbles forward? 
Yet there is so much chance that this view of nature is right that one must take 
it seriously and explore its consequences. Never more than today does one have 
the incentive to explore pregeometry. 


Search for the central 
principle of pregeometry 


§44.5. PREGEOMETRY AS THE CALCULUS OF PROPOSITIONS 

Paper in white the floor of the room, and rule it off in one-foot squares. Down on 
one s hands and knees, write in the first square a set of equations conceived as able 
to govern the physics of the universe. Think more overnight. Next day put a better 
set of equations into square two. Invite one’s most respected colleagues to contribute 
to other squares. At the end of these labors, one has worked oneself out into the 
door way. Stand up, look back on all those equations, some perhaps more hopeful 
than others, raise one’s finger commandingly, and give the order “Fly!” Not one 
of those equations will put on wings, take off, or fly. Yet the universe “flies.” 

Some principle uniquely right and uniquely simple must, when one knows it, be 
also so compelling that it is clear the universe is built, and must be built, in such 
and such a way, and that it could not possibly be otherwise. But how can one discover 
that principle? If it was hopeless to learn atomic physics by studying work-hardening 
and dislocations, it may be equally hopeless to learn the basic operating principle 
of the universe, call it pregeometry or call it what one will, by any amount of work 
in general relativity and particle physics. 

Thomas Mann (1937), in his essay on Freud, utters what Niels Bohr would surely 
have called a great truth (“A great truth is a truth whose opposite is also a great 
truth”) when he says, “Science never makes an advance until philosophy authorizes 
and encourages it to do so.” If the equivalence principle (Chapter 16) and Mach’s 
principle (§21.9) were the philosophical godfathers of general relativity, it is also 
true that what those principles do mean, and ought to mean, only becomes clear 
by study and restudy of Einstein’s theory itself. Therefore it would seem reasonable 
to expect the primary guidance in the search for pregeometry to come from a 
principle both philosophical and powerful, but one also perhaps not destined to be 
wholly clear in its contents or its implications until some later day. 

Among all the principles that one can name out of the world of science, it is difficult 
to think of one more compelling than simplicity; and among all the simplicities of 
dynamics and life and movement, none is starker [Werner (1969)] than the binary 
choice yes-no or true-false. It in no way proves that this choice for a starting principle 
is correct, but it at least gives one some comfort in the choice, that Pauli’s “nonclassi- 
cal two-valuedness” or “spin” so dominates the world of particle physics. 



§44.6. THE REPROCESSING OF THE UNIVERSE 


1209 


It is one thing to have a start, a tentative construction of pregeometry; but how 
does one go on? How not to go on is illustrated by Figure 44.3. The “sewing machine” 
builds objects of one or another definite dimensionality, or of mixed dimensionalities, 
according to the instructions that it receives on the input tape in yes-no binary code. 
Some of the difficulties of building up structure on the binary element according 
to this model, or any one of a dozen other models, stand out at once. (1) Why 
N = 10,000 building units? Why not a different TV? And if one feeds in one such 
arbitrary number at the start, why not fix more features “by hand?” No natural 
stopping point is evident, nor any principle that would fix such a stopping point. 
Such arbitrariness contradicts the principle of simplicity and rules out the model. 

(2) Quantum mechanics is added from outside, not generated from inside (from the 
model itself). On this point too the principle of simplicity speaks against the model. 

(3) The passage from pregeometry to geometry is made in a too-literal-minded way, 
with no appreciation of the need for particles and fields to appear along the way. 
The model, in the words used by Bohr on another occasion, is “crazy, but not crazy 
enough to be right.” 

Noting these difficulties, and fruitlessly trying model after model of pregeometry 
to see if it might be free of them, one suddenly realizes that a machinery for the 
combination of yes-no or true-false elements does not have to be invented. It already 
exists. What else can pregeometry be, one asks oneself, than the calculus of proposi¬ 
tions? (Box 44.5.) 


§44.6. THE BLACK BOX: THE REPROCESSING 
OF THE UNIVERSE 

No amount of searching has ever disclosed a “cheap way” out of gravitational 
collapse, any more than earlier it revealed a cheap way out of the collapse of the 
atom. Physicists in that earlier crisis found themselves in the end confronted with 
a revolutionary pistol, “Understand nothing—or accept the quantum principle.” 
Today’s crisis can hardly force a lesser revolution. One sees no alternative except 
to say that geometry fails and pregeometry has to take its place to ferry physics 
through the final stages of gravitational collapse and on into what happens next. 
No guide is evident on this uncharted way except the principle of simplicity, applied 
to drastic lengths. 

Whether the whole universe is squeezed down to the Planck dimension, or more 
or less, before reexpansion can begin and dynamics can return to normal, may be 
irrelevant for some of the questions one wants to consider. Physics has long used 
the “black box” to symbolize situations where one wishes to concentrate on what 
goes in and what goes out, disregarding what takes place in between. 

At the beginning of the crisis of electric collapse one conceived of the electron 
as headed on a deterministic path toward a point-center of attraction, and unhappily 
destined to arrive at a condition of infinite kinetic energy in a finite time. After the 
advent of quantum mechanics, one learned to summarize the interaction between 

(continued on page 1213) 


A first try at a pregeometry 
built on the principle of 
binary choice 


A more reasonable picture: 
pregeometry is the calculus 
of propositions 


The role of the black box in 
physics 



Figure 44.3. 

Ten thousand rings , or an example of a way to think of the connection between pregeometry and 
geometry, wrong because it is too literal-minded, and for other reasons spelled out in the text. The vizier 
[story by Wheeler, as alluded to by Kilmister (1971)*) speaks: “Take N= 10,000 brass rings. Take an 
automatic fastening device that will cut open a ring, loop it through another ring, and resolder the joint. 
Pour the brass rings into the hopper that feeds this machine. Take a strip of instruction paper that is 
long enough to contain N(N — l)/2 binary digits. Look at the instruction in the (/&)-th location on this 
instruction tape (j\k = 1, 2,..., N; j < k). When the binary digit at that location is 0, it is a signal to 
leave they-th ring disconnected from the *-th ring. When it is 1, it is an instruction to connect that 
particular pair of rings. Thread the tape into the machine and press the start button. The clatter begins. 
Out comes a chain of rings 10,000 links long. It falls on the table and the machine stops. Pour in another 
10,000 rings, feed in a new instruction tape, and push the button again. This time it is not a one-dimen¬ 
sional structure that emerges, but a two-dimensional one: a Crusader’s coat of mail, complete with neck 
opening and sleeves. Take still another tape from the library of tapes and repeat. Onto the table thuds 
a smaller version of the suit of mail, this time filled out internally with a solid network of rings, a 
three-dimensional structure. Now forego the library and make one’s own instruction tape, a random 
string of 0 s and 1 s. Guided by it, the fastener produces a “Christmas tree ornament,” a collection of 
segments of one-dimensional chain, two-dimensional surfaces, and three-, four-, five-, and higher-di¬ 
mensional entities, some joined together, some free-floating. Now turn from a structure deterministically 
fixed by a tape to a probability amplitude, a complex number, 

*(ta P e) = .• **-!,*) K = 0, I), (1) 

defined over the entire range of possibilities for structures built of 10,000 rings. Let these probability 
amplitudes not be assigned randomly. Instead, couple together amplitudes, for structures that differ from 
each other by the breaking of a single ring, by linear formulas that treat all rings on the same Footing. 
The separate ^’s, no longer entirely independent, will still give non-zero probability amplitudes for 
Christmas tree ornaments.” Of greater immediate interest than these “unruly” parts of the structures 
are the following questions about the smoother parts: (I) In what kinds of structures is the bulk of 
the probability concentrated? (2) What is the dominant dimensionality of these structures in an appropri¬ 
ate correspondence principle limit? (3) In this semiclassical limit, what is the form taken by the dynamic 
law of evolution of the geometry?” No principle more clearly rules out this model for pregeometry than 
the principle of simplicity (see text). 


•Wheelers story about the vizier and whai the vizier had to say about superspace was told at the May 18, 1970, 
Gwatt Seminar on lhe Bearings of Topology upon General Relativity. Kilmister’s (1971) published article alludes to 
the unpublished siory, but does not actually contain it. 
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^ Box 44.5 "PREGEOMETRY AS THE CALCULUS OF PROPOSITIONS" 


A 


A sample proposition taken out of a standard text on logic selected almost at random 
reads [Kneebone (1963), p. 40] 


[X^((X^ X)^ Y)] & (X —Z) eq (X V YV Z) & 
(X V Y V Z) & (X V Y V Z) & (X V Y V Z). 


The symbols have the following meaning: 


i 


A, Not A; 

A V B, A or B or both (“A vel £”); 

A & B 9 A and B ; 

A — B , A implies B (“if A, then £”); 

A *—*■ B , B is equivalent to A (“£ if and only if A ”). 


Propositional formula 9 is said to be equivalent (“eq”) to propositional formula 
P if and only if is a tautology. The letters A , B , etc., serve as connectors 

to “wire together” one proposition with another. Proceeding in this way, one can 
construct propositions of indefinitely great length. 

A switching circuit [see, for example, Shannon (1938) or Hohn (1966)] is isomor¬ 
phic to a proposition. 

Compare a short proposition or an elementary switching circuit to a molecular 
collision. No idea seemed more preposterous than that of Daniel Bernoulli (1733), 
that heat is a manifestation of molecular collisions. Moreover, a three-body encounter 
is difficult to treat, a four-body collision is more difficult, and a five- or more molecule 
system is essentially intractable. Nevertheless, mechanics acquires new elements of 
simplicity in the limit in which the number of molecules is very great and in which 
one can use the concept of density in phase space. Out of statistical mechanics in 
this limit come such concepts as temperature and entropy. When the temperature 
is well-defined, the energy of the system is not a well-defined idea; and when the 
energy is well-defined, the temperature is not. This complementarity is built ines¬ 
capably into the principles of the subject. Thrust the finger into the flame of a match 
and experience a sensation like nothing else on heaven or earth; yet what happens 
is all a consequence of molecular collisions, early critics notwithstanding. 

Any individual proposition is difficult for the mind to apprehend when it is long; 
and still more difficult to grasp is the content of a cluster of propositions. Neverthe¬ 
less, make a statistical analysis of the calculus of propositions in the limit where 
the number of propositions is great and most of them are long. Ask if parameters 
force themselves on one’s attention in this analysis (1) analogous in some small 
measure to the temperature and entropy of statistical mechanics but (2) so much 

j 
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Box 44.5 (continued) 

more numerous, and everyday dynamic in character, that they reproduce the contin¬ 
uum of everyday physics. 

Nothing could seem so preposterous at first glance as the thought that nature is 
built on a foundation as ethereal as the calculus of propositions. Yet, beyond the 
push to look in this direction provided by the principle of simplicity, there are two 
pulls. First, bare-bones quantum mechanics lends itself in a marvelously natural 
way to formulation in the language of the calculus of propositions, as witnesses not 
least the book of Jauch (1968). If the quantum principle were not in this way already 
automatically contained in one’s proposed model for pregeometry, and if in contrast 
it had to be introduced from outside, by that very token one would conclude that 
the model violated the principle of simplicity, and would have to reject it. Second, 
the pursuit of reality seems always to take one away from reality. Who would have 
imagined describing something so much a part of the here and now as gravitation 
in terms of curvature of the geometry of spacetime? And when later this geometry 
came to be recognized as dynamic, who would have dreamed that geometrodynamics 
unfolds in an arena so ethereal as superspace? Little astonishment there should be, 
therefore, if the description of nature carries one in the end to logic, the ethereal 
eyrie at the center of mathematics. If, as one believes, all mathematics reduces to 
the mathematics of logic, and all physics reduces to mathematics, what alternative 
is there but for all physics to reduce to the mathematics of logic? Logic is the only 
branch of mathematics that can “think about itself.” 

“An issue of logic having nothing to do with physics” was the assessment by many 
of a controversy of old about the axiom, “parallel lines never meet.” Does it follow 
from the other axioms of Euclidean geometry or is it independent? “Independent,” 
Bolyai and Lobachevsky proved. With this and the work of Gauss as a start, Riemann 
went on to create Riemannian geometry. Study nature, not Euclid, to find out about 
geometry, he advised; and Einstein went on to take that advice and to make geometry 
a part of physics. 

“An issue of logic having nothing to do with physics” is one’s natural first assess¬ 
ment of the startling limitations on logic discovered by Godel (1931), Cohen (1966), 
and others [for a review, see, for example, Kac and Ulam (1968)]. The exact opposite 
must be one’s assessment if the real pregeometry of the real physical world indeed 
turns out to be identical with the calculus of propositions. 

“Physics as manifestation of logic” or “pregeometry as the calculus of proposition^” 
is as yet [Wheeler (1971a)] not an idea, but an idea for an idea. It is put forward 
here only to make it a little clearer what it means to suggest that the order of progress 
may not be 

physics —pregeometry 


but 


pregeometry —*- physics. 
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Figure 44.4. 

The “black-box model” applied (1) to the scattering of an electron by a center of attraction 
and (2) to the collapse of the universe itself. The deterministic electron world line of classical 
theory is replaced in quantum theory by a probability amplitude, the wave crests of which 
are illustrated schematically in the diagram. The catastrophe of classical theory is replaced 
in quantum theory by a probability distribution of outputs. The same diagram illustrates 
the “black-box account” of gravitational collapse mentioned in the text. The arena of the 
diagram is no longer spacetime, but superspace. The incident arrow marks no longer a 
classical world line of an electron through spacetime, but a classical “leaf of history of 
geometry” slicing through superspace (Chapter 43). The wave crests symbolize no longer 
the electron wave function propagating through spacetime, but the geometrodynamic wave 
function propagating through superspace. The cross-hatched region is no longer the region 
where the one-body potential goes to infinity, but the region of gravitational collapse where 
the curvature of space goes to infinity. The outgoing waves describe no longer alternative 
directions for the new course of the scattered electron, but the beginnings of alternative 
new histories for the universe itself after collapse and “reprocessing” end the present cycle. 


center of attraction and electron in a “black box:” fire in a wave-train of electrons 
traveling in one direction, and get electrons coming out in this, that, and the other 
direction with this, that, and the other well-determined probability amplitude (Figure 
44.4). Moreover, to predict these probability amplitudes quantitatively and correctly, 
it was enough to translate the Hamiltonian of classical theory into the language of 
wave mechanics and solve the resulting wave equation, the key to the “black box.” 

A similar “black box” view of gravitational collapse leads one to expect a “proba- Probability distribution of 
bility distribution of outcomes.” Here, however, one outcome is distinguished from outcome s ©f collapse 
another, one must anticipate, not by a single parameter, such as the angle of scatter¬ 
ing of the electron, but by many. They govern, one foresees, such quantities as the 
size of the system at its maximum of expansion, the time from the start of this new 
cycle to the moment it ends in collapse, the number of particles present, and a 
thousand other features. The “probabilities” of these outcomes will be governed by 
a dynamic law, analogous to (1) the Schrodinger wave equation for the electron, 
or, to cite another black box problem, (2) the Maxwell equations that couple together, 
at a wave-guide junction, electromagnetic waves running in otherwise separate wave 
guides. However, it is hardly reasonable to expect the necessary dynamic law to 
spring forth as soon as one translates the Hamilton-Jacobi equation of general 
relativity (Chapter 43) into a Schrodinger equation, simply because geometrody- 
namics, in both its classical and its quantum version, is built on standard differential 
geometry. That standard geometry leaves no room for any of those quantum fluc¬ 
tuations in connectivity that seem inescapable at small distances and therefore also 
inescapable in the final stages of gravitational collapse. Not geometry, but pregeom¬ 
etry, must fill the black box of gravitational collapse. 
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"Reprocessing" the universe 


All conservation laws 
transcended in the collapse 
of the universe 


Little as one knows the internal machinery of the black box, one sees no escape 
from this picture of what goes on: the universe transforms, or transmutes, or transits, 
or is reprocessed probabilistically from one cycle of history to another in the era 
of collapse. 

However straightforwardly and inescapably this picture of the reprocessing of the 
universe would seem to follow from the leading features of general relativity and 
the quantum principle, the two overarching principles of twentieth-century physics, 
it is nevertheless fantastic to contemplate. How can the dynamics of a system so 
incredibly gigantic be switched, and switched at the whim of probability, from one 
cycle that has lasted 10 11 years to one that will last only 10 6 years? At first, only 
the circumstance that the system gets squeezed down in the course of this dynamics 
to incredibly small distances reconciles one to a transformation otherwise so unbe¬ 
lievable. Then one looks at the upended strata of a mountain slope, or a bird not 
seen before, and marvels that the whole universe is incredible: 

mutation of a species, 

metamorphosis of a rock, 

chemical transformation, 

spontaneous transformation of a nucleus, 

radioactive decay of a particle, 

reprocessing of the universe itself. 

If it cast a new light on geology to know that rocks can be raised and lowered 
thousands of meters and hundreds of degrees, what does it mean for physics to think 
of the universe as being from time to time “squeezed through a knothole,” drastically 
“reprocessed,” and started out on a fresh dynamic cycle? Three considerations above 
all press themselves on one’s attention, prefigured in these compressed phrases: 

destruction of all constants of motion in collapse; 
particles, and the physical “constants” themselves, as the 
“frozen-in part of the meteorology of collapse;” 

“the biological selection of physical constants.” 

The gravitational collapse of a star, or a collection of stars, to a black hole 
extinguishes all details of the system (see Chapters 32 and 33) except mass and charge 
and angular momentum. Whether made of matter or antimatter or radiation, whether 
endowed with much entropy or little entropy, whether in smooth motion or^haotic 
turbulence, the collapsing system ends up as seen from outside, according to all 
indications, in the same standard state. The laws of conservation of baryon number 
and lepton number are transcended [Chapter 33; also Wheeler (1971b)]. No known 
means whatsoever will distinguish between black holes of the most different prove¬ 
nance if only they have the same mass, charge, and angular momentum. But for 
a closed universe, even these constants vanish from the scene. Total charge is 
automatically zero because lines of force have nowhere to end except upon charge. 
Total mass and total angular momentum have absolutely no definable meaning 
whatsoever for a closed universe. This conclusion follows not least because there 
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is no asymptotically flat space outside where one can put a test particle into Keplerian 
orbit to determine period and precession. 

Of all principles of physics, the laws of conservation of charge, lepton number, 
baryon number, mass, and angular momentum are among the most firmly estab¬ 
lished. Yet with gravitational collapse the content of these conservation laws also 
collapses. The established is disestablished. No determinant of motion does one see 
left that could continue unchanged in value from cycle to cycle of the universe. 
Moreover, if particles are dynamic in construction, and if the spectrum of particle 
masses is therefore dynamic in origin, no option would seem left except to conclude 
that the mass spectrum is itself reprocessed at the time when “the universe is 
squeezed through a knot hole.” A molecule in this piece of paper is a “fossil” from 
photochemical synthesis in a tree a few years ago. A nucleus of the oxygen in this 
air is a fossil from thermonuclear combustion at a much higher temperature in a 
star a few 10 9 years ago. What else can a particle be but a fossil from the most 
violent event of all, gravitational collapse? 

That one geological stratum has one many-miles long slope, with marvelous 
linearity of structure, and another stratum has another slope, is either an everyday 
triteness, taken as for granted by every passerby, or a miracle, until one understands 
the mechanism. That an electron here has the same mass as an electron there is 
also a triviality or a miracle. It is a triviality in quantum electrodynamics because 
it is assumed rather than derived. However, it is a miracle on any view that regards 
the universe as being from time to time “reprocessed.” How can electrons at different 
times and places in the present cycle of the universe have the same mass if the 
spectrum of particle masses differs between one cycle of the universe and another? 

Inspect the interior of a particle of one type, and magnify it up enormously, and 
in that interior see one view of the whole universe [compare the concept of monad 
of Leibniz (1714), “The monads have no window through which anything can enter 
or depart”]; and do likewise for another particle of the same type. Are particles 
of the same pattern identical in any one cycle of the universe because they give 
identically patterned views of the same universe? No acceptable explanation for the 
miraculous identity of particles of the same type has ever been put forward That 
identity must be regarded, not as a triviality, but as a central mystery of physics. 

Not the spectrum of particle masses alone, but the physical “constants” themselves, 
would seem most reasonably regarded as reprocessed from one cycle to another 
Reprocessed relative to what? Relative, for example, to the Planck system of units’ 

L* = (fiG/c 3 )"* = 1.6 x 10 -33 cm, 

T* = (fiG/c 5 ) 1/z = 5.4 X 10 -44 sec, 

M* = (hc/Gy'z = 2.2 x 10-5 g, 

the only system of units, Planck (1899) pointed out, free, like black-body radiation 
itself, of all complications of solid-state physics, molecular binding, atomic constitu¬ 
tion. and elementary particle structure, and drawing for its background only on the 
simplest and most uni versal principles of physics, the laws of gravitation and black- 
body radiation. Relative to the Planck units, every constant in every other part of 
physics is expressed as a pure number. 


Three hierarchies of fossils 
molecules, nuclei, particles 


Reason for identity in mass 
of particles of the same 
species? 


Reprocessing of physical 
constants 
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Values of physical constants 
as related to the possibilities 
for life 


No pure numbers in physics are more impressive than hc/e 2 = 137.0360 and the 
so-called “big numbers” [Eddington (1931, 1936, 1946); Dirac (1937, 1938); Jordan 
(1955, 1959); Dicke (1959b, 1961, 1964b); Hayakawa (1965a,b); Carter (1968b)]: 


— 10 s0 particles in the universe,* 


-lO 40 - 


10 28 cm 
10~ 12 cm 


( radius of universe at\* 
maximum expansion/ 

( “size” of an elementary\ ’ 
particle / 


- 10 40 


GmM 
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Xparticle 

(Planck length) 
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/number of photonsX 
\in universe / 

/number of baryonsX * 
\in universe / 


Some understanding of the relationships between these numbers has been won 
[Carter (1968b)]. Never has any explanation appeared for their enormous magnitude, 
nor will there ever, if the view is correct that reprocessing the universe reprocesses 
also the physical constants. These constants on that view are not part of the laws 
of physics. They are part of the initial-value data. Such numbers are freshly given 
for each fresh cycle of expansion of the universe. To look for a physical explanation 
for the “big numbers” would thus seem to be looking for the right answer to the 
wrong question. 

In the week between one storm and the next, most features of the weather are 
ever-changing, but some special patterns of the wind last the week. If the term 
“frozen features of the meteorology” is appropriate for them, much more so would 
it seem appropriate for the big numbers, the physical constants and the spectrum 
of particle masses in the cycle between one reprocessing of the universe and another. 

A per cent or so change one way in one of the “constants,” hc/e 2 , will cause all 
stars to be red stars; and a comparable change the other way will make fell stars 
be blue stars, according to Carter (1968b). In neither case will any star like the sun 
be possible. He raises the question whether life could have developed if the deter¬ 
minants of the physical constants had differed substantially from those that charac¬ 
terize this cycle of the universe. 

Dicke (1961) has pointed out that the right order of ideas may not be, here is 
the universe, so what must man be; but here is man, so what must the universe 

* Values based on the “typical cosmological model” of Box 27.4; subject to much uncertainty, in the 
present state of astrophysical distance determinations, not least because the latitude in these numbers 
is even enough to be compatible with an open universe. 



§44.6. THE REPROCESSING OF THE UNIVERSE 


1217 


be? In other words: (1) What good is a universe without awareness of that universe? 
But: (2) Awareness demands life. (3) Life demands the presence of elements heavier 
than hydrogen. (4) The production of heavy elements demands thermonuclear 
combustion. (5) Thermonuclear combustion normally requires several 10 9 years of 
cooking time in a star. (6) Several 10 9 years of time will not and cannot be available 
in a closed universe, according to general relativity, unless the radius-at-maximum- 
expansion of that universe is several 10 9 light years or more. So why on this view 
is the universe as big as it is? Because only so can man be here! 

In brief, the considerations of Carter and Dicke would seem to raise the idea of 
the “biological selection of physical constants.” However, to “select” is impossible 
unless there are options to select between. Exactly such options would seem for the 
first time to be held out by the only over-all picture of the gravitational collapse 
of the universe that one sees how to put forward today, the pregeometry black-box 
model of the reprocessing of the universe . 

Proceeding with all caution into uncharted territory, one must nevertheless be 
aware that the conclusions one is reaching and the questions one is asking at a given 
stage of the analysis may be only stepping stones on the way to still more penetrating 
questions and an even more remarkable picture. To speak of “reprocessing and 
selection ’ may only be a halfway point on the road toward thinking of the universe 
as Leibniz did, as a world of relationships, not a world of machinery. Far from being 
brought into its present condition by “reprocessing” from earlier cycles, may the 
universe in some strange sense be “brought into being” by the participation of those 
who participate? On this view the concept of “cycles” would even seem to be 
altogether wrong. Instead the vital act is the act of participation. “Participator” is 
the incontrovertible new concept given by quantum mechanics; it strikes down the 
term “observer” of classical theory, the man who stands safely behind the thick glass 
wall and watches what goes on without taking part. It can’t be done, quantum 
mechanics says. Even with the lowly electron one must participate before one can 
give any meaning whatsoever to its position or its momentum. Is this firmly estab¬ 
lished result the tiny tip of a giant iceberg? Does the universe also derive its meaning 
from “participation”? Are we destined to return to the great concept of Leibniz, 
of “preestablished harmony” (“Leibniz logic loop”), before we can make the next 
great advance? 

Rich prospects stand open for investigation in gravitation physics, from neutron 
stars to cosmology and from post-Newtonian celestial mechanics to gravitational 
waves. Einstein’s geometrodynamics exposes itself to destruction on a dozen fronts 
and by a thousand predictions. No predictions subject to early test are more en¬ 
trancing than those on the formation and properties of a black hole, “laboratory 
model for some of what is predicted for the universe itself. No field is more pregnant 
with the future than gravitational collapse. No more revolutionary views of man 
and the universe has one ever been driven to consider seriously than those that come 
out of pondering the paradox of collapse, the greatest crisis of physics of all time. 

AH of these endeavors are based on the belief that existence should have a 
completely harmonious structure. Today we have less ground than ever before for 
allowing ourselves to be forced away from this wonderful belief. 


Black hole as "laboratory 
model for collapse of 
universe 


EINSTEIN (1 934) 



(Sur I'air de "J'ai rendez-vous avec vous," 
chanson de Georges Brasseus) 



We will first understand 
How simple the universe is 
When we realize 
How strange it is. 

ANON. 


To some one who could grasp the 
universe from a unified standpoint 
the entire creation would appear 
as a unique truth and necessity. 

J. D'ALEMBERT 


Le Rayonnement dipolaire 

On sait qu'il n'est pas pour nous 

C'est pour Maxwell ' oui mais Maxwell on s'en fout 

Tout est rlatif apres tout 

Un argument qu'on revere 
Celui de Synge pour dire le tout 
Nous promet le quadrupolaire 
Tout est rlatif apres tout 

Les sources quasi stellaires 
Disparaissent comme dans un trou 

Dans le Schwarzschild, oui mais Schwarzschild on s'en fout 
Tout est rlatif apres tout 


Aux solutions singulieres 
On prefere et de beaucoup 
Une metrique partout reguliere 
Tout est rlatif apres tout 


Yo ho, it's hot ... the sun is not 
A place where we could live 
But here on earth there'd be no life 
Without the fight it gives 

H. ZARET 


Probable-Possible, my black hen. 

She fays eggs in the Relative When. 

She doesn't lay eggs in the Positive Now 
Because she's unable to postulate How. 

F. WINSOR 

From A Space Child's Mother Goose. 

(?) 1956, 1957, 1958 by Frederick Winsor and Marian Parry, 
by permission of Simon and Schuster. 


Les physiciens nuc lea ires 
Comme Us nous aiment pas beaucoup 
Y gardent tout Uric, oui mais le fric on sen fout 
Tout est rlatif apres tout 

Les experiences de Weber 
Le gyroscope, ga coutent des sous 
Cedes de pensees sont moins cheres 
Tout est rlatif apres tout 



M. A. TONNELAT 

Reprinted with the kind permission of M. A. Tonnelat. 















"Omnibus ex nihil ducendis sufficit unum!" 
(One suffices to create Everything of nothing!) 

GOTTFRIED WILHELM VON LEIBNIZ 


(Sur I'air de Aupres de ma blonde) 

Dans les jar dins d'Asnieres 
La science a refleuri 
Tous les savants du monde 
Apportent leurs ecrits 

Refrain: 

Aupres de nos ondes 
Qu il fait bon , fait bon f fait bon 
Aupres de nos ondes 
Qu'il fait bon rever 

Tous les savants du monde 
Apportent leurs ecrits 
Loi gravitationnelle 
Sans tenseur d'energie 


Pregeometry 


Loi gravitationnelle 
Sans tenseur d'energie 
De ravissants mo deles 
Pour la cosmologie 

De ravissants modeles 
Pour la cosmologie 
Pour moi ne m'en faut guere 
Car fen ai un joli 

Pour moi ne m'en faut guere 
Car fen ai un joli 
11 est dans ma cervelle 
Void mon manuscrit 

Le champ laisse des plumes 
Aux bosses de I'espace-temps 
En prendrons quelques unes 
Pour deer ire le mouvement 

C. CATTANEO, J. GEHENIAU 
M. MAVRIDES, and M. A. TONNELAT 

Reprinted with the kind permission of the authors. 


When Arthur Evans began this excavation 
neither he nor anyone knew that he 
would uncover an unknown world. 

PAT (Mrs. Hypatia Vourloumis 
at Knossos (1971)) 


And as imagination bodies forth 
The form of things unknown , the poet's pen 
Turns them to shapes , and gives to airy nothing 
A local habitation and a name. 

SHAKESPEARE 


Geometrodynamics ^y^ 

y^ Bianchi 
/TV identity and 
^y^ j Ny boundary \ 
^y^ / \ of a / \ 

^ boundary / \ 

Gravitational / c . A . t \ 

n / Einstein s X / B| ack 

/geometrodynamic\ / holes 

/ law / 


, Cosmology 


Neutron 

stars 


Geodesic 

law 


Gravitational 


/ Post- 
Newtonian 
celestial 
mechanics . 


Appreciation and farewell to our patient reader. 
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Aberration 

formulas for, 68 

in light-deflection experiments, 1101 
Absolute space of Newtonian theory, 19, 
40, 29If 

Absolute time of Newtonian theory, 29If 
Abundances of elements, 765 
Accelerated observer in curved spacetime, 
327-332. See also Proper reference 
frame 

Accelerated observer in flat spacetime, 
163-175 

measuring equipment of, 164-165 
problems of principle in defining 
coordinate system of, 168-169 
constraints on size of frame, 168-169 
tetrad Fermi-Walker transported with, 
169-172 

local coordinate system of, 172-176 
with rotating tetrad, 174f 
Acceleration 

gravity mocked up by, 163fT 
equivalent to gravitational field. See 
Equivalence principle 
special relativity adequate to analyze, 
163 ff 

of neutron in nucleus, 163 
constant in comoving frame, for 
hyperbolic motion. 166-167 
4-acceleration always orthogonal to 
4-velocity, 166 

See also Fermi-Walker transport 
Acceleration, “absolute.” and the 
equivalence principle, 17 
Acceleration, relative. See Geodesic 
deviation 

Accretion of gas onto a black hole, 885 
Action. See Dynamical path length 
Action principle. See Variational principle 


Action at a distance, gravitational, 4 
Newton’s stricture against, 41 
derived from local law, 120 
Active vs. passive transformations, 1140 
Adiabatic index defined, 692 
Advanced fields, and radiation reaction, 474 
Advanced potential, 121 
After, undefined term in quantum 
geometrodynamics, 1183 
Affine connection. See Connection, 
Covariant derivative 
Affine geometry’ 
characterized, 191, 242 
in extenso, Chap. 10 
See also specific concepts , such as 
Covariant derivative, Connection 
coefficients 

Affine parameter, defined, 211, 244ff 
of geodesic, 244-246 

effect of changing, on geodesic deviation, 
269 

variational principle adapted to, 322-323 
in geometric optics, 575 
Alternating symbol, in spinor analysis, 

defined, 1152. See also Levi-Civita 
tensor. Permutation tensor 
Alternating tensor. See Permutation tensor 
Ampere’s law. from electromagnetic 
4-potential, 122 

Angle-effective distance vs. redshift, 795f 
Angular integrals, useful formulas, 1001 
Angular momentum in curved spacetime, 
for an isolated source 
defined by way metric approaches 
flatness 

in extenso , chapter 19 
in linearized theory, 448-451 
in general. 453ff 
as geometric object residing in 
asymptotically flat region, 453 


no meaning of, for closed universe, 457ff 
contribution of interbody matter and 
fields to. 468 

total unambiguous, despite contribution 
of pseudotensor to, 470 
Gaussian flux integral for, 460-464 
volume integral for, 460-466 
measured by satellite-orbit precession, 
451,454,457 

measured by gyroscope precession, 451, 
454, 457 

measured by frame dragging, 451, 457 
conservation laws for, 455. 468-471 
for Kerr-Newman black hole, 891 
Angular momentum in flat spacetime 
density of. 151, !56f 
total, 156-159 

decomposition of total into intrinsic and 
orbital, 158f 
conservation of, 156f 
intrinsic, sets lower limit to size, 162 
parallel transport of, and Thomas 
precession. 175-176 

Angular momentum in Newtonian theory’, 
flux integral for, 470 
Angular momentum operators, 240 
Angular momentum, orbital, for test 
particles 

in Schv arzschild geometry, 656ff 
in Kerr-Newman geometry. 898f 
Angular vclocitv 

extended to four dimensions, 170f 
rotating tetrad, vs. Fermi-Walker tetrad, 
1741’ 

m context of spinor analysis, 1139, 1142 
Angular velocity' ol orbital motion in 
Kerr-Newman geometry, 893ff 
Anholonomic basis,"204. 210, 239 
Anisotropy energy, 802, 807 
Amisymmetrization, of tensor, 83 
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Asymptotically flat spacetime geometry. 453 
form of, in linearized theory. 448ff " 
form of far from station ary - fully 
relativistic source, 456f 
key to defining mass and angular 
momentum. 457ff 

in evaluation of Gaussian flux integral, 
462f 

“1 weigh all that's here,” 475 
conformal treatment of infinity, 917-921 
Automatic conservation of source, 404, 

408f, 417. See also Bianchi identities 


B 

Background geometry 
defined by limiting procedure, 479-480. 
See also Gravitational waves, shortwave 
formalism of 

Backscatter of waves off curvature. 864f 
869ff, 957 
Bar operation 
in linearized theory, 436ff 
in shortwave formalism, 967 
Baryons 

number density of, 558 
mass density of. 1069, 1074 
conservation law for. See under 
Conservation laws 

Base metric, in time-symmetric initial-value 
problem, 535 
Basis forms 

3- and 4-forms for volume integrals, 150 
2-forms and dual labeling thereof, 151 
Basis 1-forms 

dual to basis vectors, 60f, 202f, 232, 234 
as coordinate gradients, 60ff 
transformation laws for, 68, 203 
connection coefficients for, 209. 215, 258f 
Basis vectors, 50 
in extenso, 201-207 
as differential operators, 229f 
dual to basis 1-forms. 60ff, 232 
transformation laws for, 68, 201, 203, 

230f 

commutation coefficients for, 204 
connection coefficients for, 209, 258f 
coordinate vs. general basis, 201-203 
coordinate basis, 230f 
See also Proper reference frame, tetrad 
Bell bongs, 55 f, 60, 99. 202, 231 
Bertotti-Robinson electromagnetic universe 
845 

Betti numbers, characterize connectivity, 

221 

Bianchi identities 
stated, 22If, 224, 325f 
proved, 287 

model for, in geodesic identity, 318 
expressed in terms of curvature 2-form, 
362 

in terms of boundary' of a boundary', 
Chap. 15 


as automatically fulfilled conservation 
law, 405 

required because geometrodynamic law 
must not predict coordinates. 409 
applied to equations of motion, 473 
from coordinate-neutrality of Hilbert- 
Palatini variational principle, 503 
Big Dipper, shape unaffected by velocity of 
observer, 1160-1164 
Binary star 

black holes as members of, 886f 
generation of gravitational waves by, 

986, 988ff7 995 

Binding energy of orbits around black 
holes. 885, 911 
Birkhoffs theorem 
for Schwarzschild geometry, 843f 
for Reissner-Nordstrom geometry, 844ff 
Bivector 
defined, 83 

in surface of Whitaker’s calumoid, 125 
Black body. See under Radiation 
Black hole, 884-887 
in extenso, Chap. 33 
brief summary of properties, 620 
history of knowledge of, 620, 623 
why deserve their name, 872-875 
Kerr-Newman geometry' as unique 

external field, 863, 875-877, esp. 876 
“hair on,” 43, 863, 876 
baryon number transcended by, 876 
lepton number transcended by, 640, 876 
astrophysical aspects of, 883-887 
mechanisms of formation, 883-884 
gravitational waves from collapse that 
forms, 1041 

dynamical processes, 884ff 
can never bifurcate, 933 
collision and coalescence of, 886, 924, 
939 

gravitational waves from hole-hole 
collisions, 886, 939, 982 
interactions with matter, 885f 
Cygnus X-l as an examplar of, ix 
gravitational waves from matter falling 
into, 885, 904, 982f, 986 
change of parameters of hole due to 
infall of particles, 904-910, 913 
extraction of energy from, 906, 908 
experimental tests of general relativity 
using, 1047 

See also Black-hole dynamics, laws of; 
Collapse, gravitational; 

Kerr-Newman geometry’; 
Schwarzschild geometry 
Black-hole dynamics, laws of, 887f. See also 
Second law of black-hole dynamics 
Boost, 67ff 
Boundary 

of a boundary’, route to Bianchi 
identities, Chap. 15 
of a boundary is zero, 364-370 
automatically conserve’s Cartan’s 
moment of rotation, 377-378 


of the boundary of a 4-simplex, 380-381 
Boundary operator, 96 
Boyer-Lindquist coordinates. 877-880 
Brackets, round and square, define 
symmetry, 126 

Bragg reflection, related to 1-forms, 232 
Brans-Dicke theory of gravity. See 
Gravitation, theories of: 
Dicke-Brans-Jordan theory 
Brill-Hartle averaging process, 970 
Brownian forces, 1038 
Bubble-time derivative, 497 
Buffer zone, in analysis of departures from 
geodesic motion, 476-480 
Buoyant force, 606 


C 

Calumoid, Whitaker’s, related to flux 
integrals, 125 

Canonical structure, metric and symplectic 
structure, 126 

Canonical variables, in Hamiltonian 
mechanics, 125 

Cartan structure equations, 359 
Carter’s fourth constant, 899 
Causal relationships in flat spacetime, 

48, 51 

Causal structure of curved spacetime, 922ff 
future horizons, 923-924 
global structure of horizons, theorems 
about, 924-925 

global structure of horizons, analysis of, 
926-931 

See also Global techniques, Horizons 
Causality, principle of, and the mechanism 
of radiation, 110 
Caustics, of a horizon, 925 
Cavendish experiment, 112If 
Cavendish gravitational constant, 1121 ff 
dependence on velocity relative to 
“preferred universal rest frame,” 
1123-1124 

dependence on chemical composition of 
gravitating bodies, 1125 
variations in, cause deviations from 
geodesic motion, 1127-1128 
Center of mass, 161 
Centrifugal forces, 294 
Centrifuge, in idealized redshift experiment 
63f 

Centroid, 161 ^ 

Cepheid variable stars 
pulsation of, 632 

period-luminosity relation discovered, 758 
as distance indicators, 786 
confused with HII regions in Hubble’s 
work, 709 

confusion resolved by Baade, 710, 760 
Chain rule 

abstract, 314-315 

for covariant derivative, 252, 257f, 260f 
Chandrasekhar limit, 619 
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Charge 

evaluated from flux integral, 98 
of closed universe, meaningless integral 
for, 457-458 

as measured by tubes of force, in 2-form 
representation, 107 

as lines of force trapped in the topology 
of space, 221, 368, 1200f 
Charge conservation. See Conservation 
laws, charge 
Charge density-current 
4-vector, Lorentz transformation of, 68 
dual representations 88, 97f 
3-form, 113f, 151 

Dirac’s representation, for particle in 
arbitrary motion, 120f 
Chemical potential. See under 
Thermodynamics 

Chines^ historical records of Crab 
supernova, ii. 

Classical mechanics, correspondence with 
quantum mechanics, 413 
Classical theory, conceives of geometry and 
fields as measureable, 13 
Clock “paradox,” 167 
Clocks 

bad vs. good, 26-27 
stability of, 28, 1048 
ideal 

defined, 393 

in Newton-Cartan theory, 301 
built on geodesics, 396-399 
specific types of, 28, 393-396 
influence of acceleration on, 164f, 327, 
396 

influence of tidal forces on, 396 
as tools in parametrization of geodesics, 
246 

infinite sequence of, needed as one 
approaches a singularity, 813f 
Closed form, 114. See also Forms, 
differential 

Closure of universe. See Cosmological 
models 

Clusters of galaxies 
origin of, 766, 769f 

Virgo as source of gravitational waves, 
1042 

Cold, catalyzed matter, 624-626 
Collapse, gravitational 

in one and two dimensions, 867f 
of a spherical shell of dust, 555-556 
of a spherical star 

analyzed by examining exterior 
geometry, 846-850, 857 
redshift of radiation from, 847, 849f. 
872 

decay of luminosity of, 847, 850. 872 
surface of last influence. 873f 
Eddington-Finklestein diagram for. 

849, 864, 873 

Kruskal diagram for. 848, 855 
embedding diagrams for. 855f 
comoving coordinates for. 857 


equations governing adiabatic collapse, 
858f 

models with zero pressure, 859 
models with zero pressure and uniform 
density, 851-856, 859 
realistic, 862f, 883f 
triggering of, in late stages of stellar 
evolution, 627, 862 

collapse, pursuit, and plunge scenario, 
629 

evolution of small perturbations from 
spherical symmetry, 864-866 
Price’s theorem, 866 
gravitational waves emitted during, 

1041 

inevitability of, for massive stars, 819 
in a dense star cluster, 884 
creation of Kerr-Newman black hole 
by, 882-883 

at three levels: universe, black hole, 
quantum fluctuations, 1201 
issue of the final state, 940, 1196f 
black box model of, 1209, 1213-1217 
importance of and philosophical 
implications of, 437, 1196f 
Collapsed star. See Neutron star, Black 
hole 

Collisions of particles in flat spacetime, 19, 
69f 

Comma-goes-to-semicolon rule, 387-392. 

See also Equivalence principle 
Commutation, of observables on spacelike 
hypersurface, 554 

Commutation coefficients of basis vectors, 
204,243,314 

calculated by exterior derivative of basis 
1-forms, 358f 
for rotation group, 243 
Commutator 

of tangent vectors, 204, 206f, 235-240 
Jacobi identity for. 240 
as closer of quadrilaterals, 236, 278 
pictorial representation of, 236-237 
for rotation group, 332 
for normal and tangent to spacelike slice, 
517 

of covariant derivatives, 276, 389ff 
Compatibility of metric and covariant 
derivative, 313ff, 353f 
Complexion, of electromagnetic field, 108, 
482 

Component manipulations. See Index 
manipulations 

Component notation, to remove ambiguity 
of slots, 84 
Components 

of vectors, introduced, 8-10, 50-51 
of 1-forms, introduced. 61 
of tensors, introduced. 75 
of curvature tensor, introduced, 34. 37. 

40. 42 

See also Index manipulations 
Concepts of physics, defined bv theory 
itself. 71 f 


Conduction of heat, 567 
Conformal curvature tensor. See under 
Curvature, formalism of 
Conformal part of 3-geometry, in York’s 
formulation of initial-value problem, 
540-541 

Conformal transformation of infinity. See 
under Infinity 

Connection, measured by light signals and 
free particles, 324 
See also Covariant derivative 
Connection coefficients 
summarized, 223 

as components of covariant derivative, 
208f, 256, 261-262 

calculated from metric and commutators, 
210, 216, 314 
symmetries of, 213-214 
transformation law for, 262 
specialized to a coordinate basis 
called “Christoflel symbols,” 210 
contraction of, in terms of metric, 222 
formula for, from Palatini variational 
principle, 502 

unique, to make geodesics agree with 
straight lines of local Lorentz 
geometry, 3l4f 

illustrated by great-circle navigation, 212 
specific cases of 

for plane, in polar coordinates, 213, 

263 

for flat 3-seometry, polar coordinates. 
213 

for 2-sphere, 341, 345 
for rotation group, 264 
for Riemann normal coordinates, 286f 
for Newton-Cartan spacetime, 29If, 
294, 298 

for proper reference frame of 
accelerated observer, 330f 
Connectivity 
at small distances, 221 
of spacetime, in classical differential 
geometry, 1204-1205 

charge as trapped lines of force, 221, 368 
1200f 

See also Topology 
Conservation laws 

equivalence of differential and integral 
formulations of, 146 
baryon number. 558f. 563fT 
applied to pulsating stars. 69If 
applied to collapsing stars, 858 
in PPN formalism, To88 
electric charge, 369f 
differential formulation of. 88, 568, 570 
integral formulation from differential 
via Stokes theorem, 98, 156 
as consequence of dd — 0, 118 
energy-momentum(V • T = 0) 

tested in elementarv particle physics. 19 
in flat spacetime. 132, 146. 152-155 
integral formulation in fiat spacetime 
142-146 
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Conservation laws ( continued) 

transition to curved spacetime, 386f. 

390 

to be interpreted as automatic, via 
“wirina up” to geometry, 364, 

367f, 371, 404-407 
in terms of generalized exterior 
derivative, 362f 

various mathematical representations 
for, 379 

total mass-energy and 4-momentum of a 
gravitating source, 455. 468-471 
for test-particle motion 
related to Killing vector field, 651 
related to Hamilton’s principle, 654 
in Schwarzschild geometry, 655-658 
in any spherical, static geometry, 681 
in Kerr metric and electromagnetic 
field. 898f 

Constants, fundamental 
listed, endpapers 

limits on deviations from constancy, 
1061-1063 

Constraint, as signaling reduced number of 
degrees of freedom, 528f 
Constraints, first and second class, in 
Dirac’s formulation of 
geometrodynamics, 486 
Constructive interference 
as shortest leap from quantum to 
classical, 1185 

in particle mechanics and in 

geometrodynamics, compared, 1186f 
behind Hamilton-Jacobi formulation of 
mechanics and geometrodvnamics, 
423f 

Continuity, equation of, 152IT, 565 
“Continuous'creation,” 745, 750, 770 
Contraction of tensor, 82 
Contravariant components, 76, 201-207, 312 
Controlled ignorance, philosophy of, 452 f, 
996 

Convection, in supermassive stars, 600 
Coordinate patch, concept introduced, 

10-12 

Coordinate systems 

nature of, deducible from metric, 595f 
of accelerated observers, 172-176 
asymptotically Minkowskiian, 463 
Boyer-Lindquist, 877-880 
comoving, for collapsing star, 857 
comoving, for universe, 715ff 
curvilinear, in linearized theory, 441 
Eddington-Finklestein, 828-831, 849 
Euclidean, 22f 
Fermi normal, 332 
Galilean, 289, 291-298, 414 
Gaussian normal, 516, 518, 715ff 
isotropic, for Schwarzschild geometry’, 

840 

isotropic, for static, spherical system, 595 
Kerr, 879f 
Kerr-Schild, 903 
Kruskal-Szekeres, 827, 831-836 


local Lorentz, 207 
Lorentz. 22 f 

Minkowski, same as Lorentz 
Novikov, 826f 

of post-Newtonian formalism, 1073f, 
1082-1087, 1089. 1091, 1097 
Regge-W'heeler, same as Tortoise 
Riemann normal. 285ff, 329-332 
Schwarzschild, for Schwarzschild 
geometry. 607 

Schwarzschild, for static, spherical 
systems, 597 

Schwarzschild, for pulsating star, 689 
for any spherical system, 616f 
T Tortoise, for Schwarzschild geometry, 

663. 665-666 
Coordinates, 5-10 

canonical, in context of differential forms 
and symplectic structure, 125f 
must not be predicted by 

geo metro dynamic law, 409 
rotation and translation of, in 
Newton-Cartan theory’, 294f 
preferred, in Newton, Minkowskii, and 
Einstein spacetime, 296 
Coordinate singularities. See Singularities, 
coordinate 

Coplanarity, test for, 281 
Coriolis forces, 165, 175, 294, 327, 332 
Correspondence, between 1-forms and 
vectors, 310. See also Vectors; Forms, 
differential 

Correspondence principles, 412f 
Cosmic censorship. 937 
Cosmic gravitational-wave background, 712, 
736f, 764f 

Cosmic microwave radiation, 712f, 764ff 
prediction of by Gamow’ et a!., 760 
isotropy of, 703 

existence of, refutes steady-state 
cosmological model, 770 
incompatible with “turnaround universe,” 
751 

Cosmic neutrino background, 712, 736f, 
764f 

Cosmic rays, 757 
evolution of mean density of, 798 
observations refute Klein-Alfven 
cosmological models, 770 
Cosmological constant, 4l0ff 
Einstein’s invention and retraction of, 

41 Of, 707, 758 

influence on evolution of universe, 747, 
771, 774 

Cosmological models 
anisotropic, Chap. 30 
Brans-Dicke, 770 
closure of universe 
related to Mach’s principle, 543, 549 
as boundary condition, 1181 
de Sitter, 745, 758 

Einstein static universe, 746f, 750, 758f 
flat, closed, static 3-torus model, 284 
Friedmann 


discovery of, by Friedmann and 
Lemaitre. 751. 758 
assumption of homogeneity and 
isotropy. 703, 713 
assumption of perfect-fluid 
stress-energy tensor, 71 If 
assumed equation of state. 713, 726 
implications of homogeneity and 
isotropy, 714f, 720ff 
isotropy implies homogeneity. 715, 723 
coordinate system constructed, 715ff 
expansion factor introduced, 718 
arbitrariness in expansion factor, 720ff 
expansion factor renormalized, 72If 
possible 3-geometries for homogeneous 
hypersurfaces, 720-725 
curvature parameter K = k/a 2 . 721 
line element, various forms for. 72Iff, 
731. 759 

embedding diagrams, 723, 725 
topology not unique, 725 
first law of thermodynamics for. 726ff 
assumption that matter and radiation 
exchange negligible energy, 726ff, 
765 

assumption that pressure of matter can 
always be neglected, 726, 728 
density and pressure expressed in terms 
of expansion factor, 727 
Einstein tensor for, 728 
orthonormal frames attached to matter, 
728 

initial-value equation (for a, t 2 ), 744 
dynamic equation (for a, tt ), 729 
dynamic equation derivable from 

initial-value equation plus first law 
of thermodynamics, 729 
time parameters; t, a, rj, 730-732 
observer’s parameters vs. relativity 
parameters, 77 Iff 

implications of parameter values for 
future of universe, 747, 771, 773f 
dynamics of early stage independent of 
k (unaffected by closure), 742f 
critical density for closure of universe 
if A = 0, 782 

small perturbations of, 800f 
See also Hubble constant, Density 

parameter, Deceleration parameter 
Friedmann, closed {k = +1, A = 0) 
in extenso, 733-742 
track-1 overview, 704-711 
Einstein’s arguments favoring closure, 
704 5 

critical density for closure, 710, 782 
geometry of 3-sphere hypersurfaces, 

704, 721, 723 f 
radius of, defined, 704 

radius of maximum expansion, 705 

embedding diagram, 723f 

volume of, 724 

topology not unique, 725 

first law of thermodynamics applied to, 

705, 726ff 
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initial-value equation for, 537, 705f, 

729, 733 

effective potential for evolution of, 706 
inevitability of recollapse, 707 
solutions of field equations for, 734f 
radiation-dominated era, 733-737, 740ff 
matter-dominated era, 733ff, 738-742 
coordinate diagram for, 741 
concrete numbers for a typical model, 
738 

propagation of signals around universe, 
741, 750 

causal isolation of various regions from 
each other, 740ff 

mocked up by Schwarzschild-lattice 
universe, 739f 
compared with Newtonian 
cosmological models, 707f 
Friedmann, flat and open (k = 0, 

'k = -1; A = 0) 
geometry of homogeneous 
hypersurfaces, 721, 724f 
embedding diagram, 724f 
topology not unique, 725 / 

Solutions of field equations for, 742 
radiation-dominated era, 742f 
matter-dominated era, 743f 
Friedmann, plus cosmological constant 
( k = 0, ±1; A f 0) 
initial-value equation (for c, f 2 ), 744 
effective potential for evolution of, 744, 
746, 748f 

dynamical evolution of, 744-747 
special cases of, 745ff, 750f 
hesitation universe, 750 
hierarchic (island) universe, 748f, 770 

inhomogeneous, Chap. 30-- 

inhomogeneous Gowdy models, 804 
inhomogeneous but spherical models, 

804 

Kasner model, 801, 805ff 
Klein-Alfven model, 748, 770 
mixmaster, 805-814 
Newtonian, 707f, 759 
primordial chaos in big-bang models, 

769, 802ff 

in extenso, Chap. 30 

primordial black holes produced by, 

884 


See also Isotropy and homogeneity of 
universe, possible explanations of 
Schwarzschild lattice universe, 739f 
steady-state universe, 745, 750, 770 
turnaround universe, 750f 
See also Cosmology: history’ of universe 
according to “standard big-bang 
model” 

Cosmology: 
expansion of universe 
prediction of by Friedmann, de Sitter, 
and Weyl, 758, 776 
discovery of by Hubble, 759, 792-794 
removed motive for cosmological term, 
410-411 


was greatest prediction of Einstein’s 
theory, 411 

what expands and what does not, 719, 
739 

“Where is the new space added?” 719, 
739 

will Universe recontract? 747, 771, 774 
See also Hubble expansion rate 
history of man’s ideas and knowledge of 
the universe, 752-762 
history of the universe according to the 
“standard big-bang model” 
in extenso, Chap. 28 
initial singularity, 769f 
what “preceded” initial singularity? 

769 

possible roles of primordial chaos, 769, 
803f, 816. See also Cosmological 
models: primordial chaos 
complete thermal equilibrium at 
t < 1 second, 736, 763f 
decoupling of gravitational waves and 
neutrinos, 736, 764 
recombination of pairs, 736f, 764 
thermal interaction of matter and 
radiation during expansion, 765f 
transition from matter dominance to 
radiation dominance, 74If, 765f 
condensation of stars, galaxies, and 
clusters of galaxies, 766, 769, 800 
past history' not much affected by k 
(by geometry' of hypersurfaces), 
742f, 763 

expansion forever vs. recontraction, 

747, 771, 774 

observational probes of standard model, 
780-798 

summary of, 797f 

distance-redshift relation, derivation of, 
780f 

distance-redshift relation, observational 
data, 781, 785-788, 792fT. See also 
Hubble expansion rate 
magnitude-redshift relation, derivations 
of, 782-785, 794 
magnitude-redshift relation, 
observational data, 788-791 
angle-effective distance vs. redshift 
(“lens effect of universe”), 795f 
source counts (number-flux relation), 
798 

mean mass density of universe, 710ff, 
796f 

comparison of temperature, redshift, 
and emission times for cosmic 
background radiations, 737 
abundances of elements, 765 
comparison of ages deduced by various 
methods, 797f 

evolution of quasar population, 767f, 

770 

experimental tests of general relativity 
using cosmological observations 
1047 . 
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observed properties of universe 
homogeneity on large scales, 703, 815 
isotropy on large scale, 703, 801, 815 
rotation, observational limits on, 939 
cosmological expansion, 772, 775f, 
785-788, 793f 

age deduced from expansion rate, 709f, 

797 

ages of oldest stars, 709, 797f 
ages of rocks and meteorites, 759, 761, 

798 

deceleration parameter, 785, 788-791 
density parameter, 796f 
mean density of luminous matter, 710f, 
761 

mean density of cosmic rays, 712, 757, 
798 

mean density of intergalactic matter, 
712, 761f, 797 

mean density in electromagnetic 
radiation, 712 

energy and pressure in kinetic motions 
of galaxies and stars, 711 
abundances of elements, 765 
entropy per baryon, 766 
quasar population, evolution of, 767f, 
770 

“fine-scale” structure, 703 
See also Cosmic microwave radiation, 
Hubble expansion rate 
speculations about initial and final states 
of universe, 707, 1209, 1213-1217 
Coulomb field, “pancaking” of, for fast 
charged particle, 124 
Coulomb force, from electromagnetic 
4-potential, 122 

Coupling of fields to matter, direct vs. 
indirect, 1063f 

Covariance, general. See General 
covariance 

Covariant components of a tensor, 76, 
201-207, 312 
Covariant derivative 
fundamental equations summarized, 
223-224 

defined by parallel transport, 208, 249 
pictorialized, 209, 212 
algebra of, 250-261 

chain rule for, 214, 250, 252, 257f, 260f 
symmetry' of (“no torsion”), 250, 252, 

353f 

additivity of, 252 

commutes with contraction, 214 

compatibility with metric, 215f, 313ff, 

353f 

noncommutation of two covariant 
derivatives, 389ff 
as a machine with slots, 253ff 
is not a tensor, 253, 255f 
connection coefficients as its components, 
210, 256, 26If 

rotation 1-forms constructed from, 349ff, 
359f 

semicolon notation for, introduced, 210 
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Covariant derivative ( continued ) 
component calculations of, 215 
of tensor densities, 50If 
in a hypersurface, 510 
regarded as a gravitational field, 387 
See also Connection coefficients; Parallel 
transport; Rotation coefficients 
Crab nebula, ii, 619f. 760 
Cross section 
collisional, 69 

Lorentz transformation of, 70 
Crystallography, related to 1-forms, 232 
Current 4-vector. See Charge 
density-current 

Curvature, constant, 3-geometries of, 
720-725 

Curvature, formalism of 

fundamental equations, summarized, 
223-224 

Bel-Robinson (tidal) tensor, 38If 
conformal (Weyl) tensor 
introduced, 325, 327 
principal null congruences of, 902 
Petrov-Pirani algebraic classification of, 
1165 

spinor representation of, ll54f 
in Nordstrom-Einstein-Fokker theory 
of gravity, 429, 431 
vanishes in 3 dimensions. 550 
Einstein tensor 
introduced, 222, 325f 
track-1 equations summarized, 224 
as trace of double dual of Riemann, 
325f, 376 

formula for mixed components in 
terms of Riemann components, 
343f 

in terms of intrinsic and extrinsic 
curvature, 515 

interpreted as moment of rotation, 
373-377 

contracted Bianchi identity 

(“conservation of Einstein”), 325, 
377 ff 

conservation of, from bound ary of a 
boundary, 377ff 
uniqueness of, 405, 407f 
curvature 2-form, 348-363 
picture of, for 2-sphere, 337 
picture of, for pith helmet, 338 
curvature operator 
introduced, 271 

regarded as bivector-valued 2-form, 
376-380 

as twice-applied exterior derivative, 351 
as machine-with-slots, 35If 
in context of Newton-Cartan theory’, 

299 

extrinsic curvature of a hypersurface, 
511-516 

contrasted with intrinsic curvature, 336, 
421 

operator for, 511 
tensor for, 512 


from Lie derivative of metric, 520 
Gauss-Codazzi relations, 514ff 
Gaussian curvature of a 2-surface, 30. 44 
336f 

intrinsic curvature of a hypersurface. 509f 
invariants of Riemann , 491 
. Jacobi curvature tensor, 286f 
Jacobi curvature operator, 286 
in context of Newton-Cartan theory, 
299. 301 

principal radii of curvature for a 
2-surface, 44, 335f 
Riemann tensor 

component formulas for, summarized, 
224, 266 

component formula for in 
non-coordinate basis, 277 
Riemann, matrix display of 
components of, 360f 
elementary introduction to, 31, 34-37, 

39 

in extenso track-1 treatment (metric 
present), 218-224 
in extenso, in absence of metric, 

270-288 

in extenso, properties induced by 
introduction of metric, 324-327 
defined by parallel transport around 
closed curve, 277-282 
proof of tensor character, 276 
defined by geodesic deviation, 29-37, 

218f, 270-277, 287 
relation to curvature operator, 274ff 
relation to noncommuting covariant 
derivatives, 389ff 
relation to curvature 2-form, 352 
as machine with slots, 271, 274f 
symmetries of, 35, 220ff, 286, 324f 
number of independent components, 

326 

invariants of, 491 

in 2 and 3 dimensions: deducible from 
Ricci tensor, 334, 343, 550 
Bianchi identities, 22If, 224, 325f. See 
also Bianchi identities 
only tensor from, and linear in, second 
derivatives of metric, 408 
wave equation for, 382 
dynamic components of, 517f 
spinor representation of, ll54f 
in Newton-Cartan spacetime, 290, 302 
in linearized theory, 438 
Riemann tensor, double dual of, 325f, 

343, 371, 376 
Ricci tensor, 222, 325f 
in Newton-Cartan theory’, 290, 300 
scalar curvature 
introduced, 222, 325 
in terms of area deficit, 516 
for a 3-surface, 422f 
Gauss-Bonnet integral of, 309, 381 
in Hilbert action principle, 418, 491 
Weyl tensor. See Conformal tensor 
York’s curvature, 541, 550 


Curvature, methods of calculating 
in extenso, Chap. 14 
analytical, on a computer, 342 
straightforward method, from connection 
and its derivative, 340f 
mixed components of Einstein expressed 
explicitly in terms of Riemann 
components, 343f 

geodesic Lagrangian method, 344-348 
via 2-forms, theory’. 348-354 
via 2-forms, method, 354-362 
ways to display results, 334, 360f 
“Curvature coupling” in equivalence 
principle, 389-392 
Curvature of spacetime 
modeled by surface of apple. 4f 
implied by gravitational red shift, 187ff 
generation of, by mass-energy, 37-44, 
Chap. 17 

measured by geodesic deviation, 29-37, 
195f, 270-275 

procedure-in-principle to measure, 72 
measured by gravity gradiometer, 

400-403 

coupling to physics in equivalence 
principle, 389-392 

coupling to moments of a macroscopic 
object, 39 If, 476-480, 1120f 
can be great locally even if average is 
near zero, 220 

See also Geodesic deviation, Tidal forces, 
Spacetime geometry 
“Curvature parameter” of Friedmann 
cosmologies, 721 

Curvature tensors for specific manifolds 
gravitational wave, exact, plane, 346 f, 444 
gravitational wave, linearized, 948 
linearized theory, any metric, 438 
Friedmann cosmology, 345, 348, 355ff, 
537, 728 

Newton-Cartan spacetime, 290 
Newtonian sp'here of uniform density, 

39f 

Newtonian spherical vacuum field, 37 
Schwarzschild metric, 82Iff 
spherical, dynamic line element, 36If 
spherical, static line element in 

Schwarzschild coordinates, 360f 
3-hyperboloid, 343, 721 
3-sphere, 343, 721 

3-surface of “constant curvature,” 721 
2-hyperboloid, 334 
2-sphere, 30, 341 ^ 

2-surface of revolution, 339f 
world tube of a collapsing star’s surface, 
853 

Curvature. See also Bianchi identities; 
Gauss-Weingarten equations; 
Gauss-Codazzi equations 
Curve, in context of differential topology, 
226 

Curves, congruence of, 240 
Cutoff, related to Planck length, 428 
Cycloidal motion 
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for radial geodesics in Schwarzschild 
geometry, 664 

for test particle in field of a Newtonian 
point mass, 708 

for radius of closed Friedmann 
cosmology, 708 

for surface of a pressure-free collapsing 
star, 852 

D 

“d,” three usages of this differential 
symbol, 95-96 

d’Alembertian operator. See Wave 
operators 

Day, length of, 23-26, 1124f 
de Rham operator. See Wave operators 
de Broglie wave, 53, 55-59 
de Sitter universe, 745, 758 
Decel&ation parameter of universe 
defined, 772 

relationship to other cosmological 
parameters, 771-773 
determinant of whether universe will 
recontract, 774 

magnitude-redshift relation for 
measuring, 782-785, 794 
observational data on, 785, 788-791 
Deficit angles, 309, 1167fF 
Deflection of light, gravitational, 
pictorial explanation of, 32 
early Einstein words on, 431 
calculated in linearized theory, 184f, 446 
calculated in Schwarzschild coordinates, 
679 

calculated in PPN formalism, llOlff 
post-post-Newtonian corrections 4o, 1069 
magnitude of, compared with current 
technology, 1048, 1101 
experimental results on, 1104f 
in flat-space theories of gravity, 179, 184f 
Deflection of particles by a central field, 
671, 1099f 

Degenerate electron gas. See White-dwarf 
matter 

Degenerate neutron gas. See Neutron-star 
matter 

Degrees of freedom, counting of, for 
geometrodynamics and 
electrodynamics, 529-533 
Delta, Kronecker, 22 
Delta function, Dirac, 121 
Democracy of histories, 418-419 
Density of universe. See under Cosmology: 

observed properties of the universe 
“Density parameter” of universe, 772, 796f 
Derivative, covariant. See Covariant 
derivative 

Derivative, directional. See Directional 
derivative 

Derivative, following fluid, 153, 1078 
Detailed balance, principle of, 1028ff, 1033, 
1035f 

Determinant 


derivative of, 160-161 
and Jacobian, 160-161 
Deviation, geodesic. See Geodesic deviation 
DeWitt equation, 1189. See also 
Einstein-SchrOdinger equation 
Dicke-Brans-Jordan theory of gravity. See 
under Gravitation, theories of 
Dicke-EotvOs experiment. See EOtvos-Dicke 
experiment 

Dicke’s framework for analyzing 
experiments, 1049, 1064 
Differentiable manifold. See Manifold, 
differentiable 

Differentiable structure, 242 
Differential conservation law, equivalence 
to integral conservation law, 146 
Differential forms. See Forms, differential 
Differential geometry 
overview of, 194-198 
track-1 treatment of, Chap. 8 
track-2 treatment of, Chaps. 9-11, 13-15 
texts on, 196 

three levels of: pictorial, abstract, 
components, 198-200 
Cartan’s contributions to, 198 
applications of, listed, 198 
See also Differential topology, Affine 
geometry, Riemannian geometry, 
and specific concepts, such as 
Metric, Connection, Forms 
“Differential,” of differential calculus, 
rigorous version of, 62 
interpreted as a 1-form, 63 
interpreted as p-form, 160-161 
Differential topology, 197f, Chap. 9, esp. 
240-243. See also specific concepts, e.g., 
Manifold, Lie derivative 
Dimensionality, 10, 12 
Dirac brackets, 486, 520 
Dirac delta function, 121 
Dirac equation, in Schwarzschild geometry, 
1165 

Directional derivative 
of a function along a vector, 59-60 
operator for, 61 
as a tangent vector, 227-230 
Disks, rapidly rotating, in general relativity, 
621 

Dispersion relations obtained from 
Hamiltonians, 486f, 494, 498 
Distance, proper. See Interval, Lorentz 
Distance-redshift relation. See under 
Cosmology: observational probes of 
standard model 

Distances, as raw material of metric, 306-309 
Distant action. See Action at a distance 
Distant stars, inertial influence of. See 
Mach’s principle 

Distribution. See Dirac delta function 
Distribution function, 583f, 590 
Divergence of a vector or tensor, 82, 213, 

222, 261 

Divergences, in theory of particles and 
fields, 426-428 
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Double star. See Binary star 
Dragging of inertial frames 
in PPN formalism, 1117-1120 
by Earth’s rotation, 1119f 
by a slowly rotating star, 699 
in Kerr-Newman geometry, 879ff, 893-896 
prospects to measure, 1120 
See also Mach’s principle 
Dual bases, 60f, 119, 202, 232 
Duality operation on forms, vectors, and 
tensors 

on forms, 88, 97f, 108, 119, 151 
on simple forms, expressed in terms of 
perpendicularity, 98 

application to electromagnetism, 88, 97f, 
114 

double dual of Riemann, 371, 376 
not to be confused with duality of bases, 
119 

special star operation that does not act 
on forms, 376-380 

Duality rotation of electromagnetic field, 

108, 482f 

Dynamical path length 
in elementary mechanics, 486-487 
as proportional to phase of wave 
function, 486 

in superspace formulation of 
geometrodynamics, 419, 1186 
See also Variational principle 


Earth 

atmosphere and gravity, 388 
crust, as detector of gravitational waves, 
1013, 1015 

general precession (precession of rotation 
axis), 391, 392, 1112, 1113 
gravitational multipole moments, 401 
mass, radius, density. See endpapers 
motion relative to cosmic microwave 
radiation, 713 

particles oscillating in hole bored 
through, 39 

rotation of, drags inertial frames, 1119f. 
See also Day 

satellite orbits used to deduce mass, 638 
shape as described by collection of 
distances, 306-309 
subsurface mass variations, 401 
tides, as experimental test of general 
relativity, 1123f 

vibrations of, as detector for gravitational 
waves, 1013, 1015, 1035f 
Eccentricity of an elliptical orbit, 647 
Eclipses, 24-26, 1104 

Eddington-Finklestein coordinates, 828-831 
849 

Eddington-Finklestein diagrams, 829, 830, 
849, 864, 873 
Effective potentials 
for test particles in Schwarzschild 
geometry, 639, 656, 659-662 
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Effective potentials (continued) 
for charged test particles in equatorial 
plane of Kerr-Newman hole, 911 
for waves in Schwarzschild geometry, 

868, 870 

for scalar waves in Kerr geometry. 915 
for radius of Friedmann universe. 706, 

744, 746, 748f 

for oscillations of mixmaster universe, 

809 ff 

Einstein. See under Curvature, formalism of 
EIH equations of motion, 1091, 1094-1095 
Eikonal method, 1102 
Einstein A coefficients, 1029 
Einstein’s elevator, 298. See also 
Equivalence principle 
Einstein field equation, 431-434 
elementary introduction to, 4Iff 
integral equation equivalent to, 995-996 
variational principles for. See under 
Variational principles 
derivations of 

in extenso. Chap. 17, esp, 406, 416-482 
from automatic conservation of source, 
379f, 417 

from Hilbert’s action principle, 418 
from physics on a spacelike slice, 
419-423 

from spin-2 field theory, 424f, 437 
from superspace analysis, 423f 
from “metric elasticity of space,” 426ff 
modified by cosmological term, 410-412 
correspondence with Newtonian theory, 
412-416 

and collapse, 1198-1199 
See also Geometro dynamics 
Einstein-Infeld-Hoffman equations of 
motion, 1091, 1094-1095 
Einstein-Rosen bridge, 837ff 
Einstein-Schrodinger equation, 1189f 
Einstein static universe, 746, 747, 750, 758f 
Einstein summation convention, 9 
Einstein tensor. See under Curvature, 
formalism of 

Einstein’s theory of gravity. See General 
relativity 

Elasticity, 426-428 
Electrodynamics 

in flat spacetime, in extenso, Chap. 3 
in curved spacetime, in extenso, 385-391, 
568-570 

in language of forms, in extenso, Chap. 4 
in language of spinors, 1154, 1165 
in terms of boundary of a boundary, 
365-370 

in geometric optics limit. See Geometric 
optics 

canonical formulation of, as a guide to 
geometrodynamics, 496f, 522ff 
analog of Palatini variational method in, 
495-498 

three-plus-one view versus geometric 
view, 78-79 

deduced from vector potential, 122 


deduced from electrostatics plus 
covariance, 8] 

lines of force never end, as core principle 
of. 420 

analogies and comparisons with 

geometrodynamics, 35, 348. 364. 
367-370 

See also Initial-value problem. Integrating 
forward in time 
Electromagnetic field 
descriptions of and equations governing 
electric and magnetic fields, 73f 
Lorentz transformation of, 78f 
dual of electromagnetic field tensor. 
Maxwell 
introduced, 88, 105 
egg-crate picture of, 107, 109 
divergence vanishes, 88 
exterior derivative gives charge density 
and current, 113f 
vector potential, 88f, 120, 569 
wave equation for, 89, 120, 388-391, 
569 

electromagnetic field tensor (or 2-form), 

Faraday 

as machinery to produce force from 
4-velocity, 73, 101, 104 
components of, 73-74 
expressed in terms of exterior products, 
99 

egg-crate pictures of, 99f, 104, 106, 107, 
111 

“canonical representation” of, 122 
special cases of pure electric, pure 
magnetic, and null, 122 
generic case reduced to simplest form, 
122, 483 

Maxwell’s equations for, in component 
notation, 80f, 568 

divergence gives charge density and 
current, 81, 88 

exterior derivative vanishes, 112f, 117 
invariants, 110, 480-483 
field momentum, 496f, 522ff 
stress-energy tensor, 140f 
divergence vanishes, 89 
complexion, 108, 482 
calumoid, 125 

Lorentz force, 7Iff, 101, 104, 568 
Maxwell’s equations, 80f, 568. See also 
Maxwell’s equations 

Lorentz transformations, 78f, 108ff, 482f 
Electromagnetic field produced by specific 
sources 

oscillating dipole, 111-112 
point charge, 107-111, 12If 
Electron 

quasibound in field of small black hole, 
1164 

spinning, Thomas precession of, 175-176 
Electron capture, in white-dwarf matter, 

619 

Elementary-particle experiments as tests of 
relativity theory, 1054f, 1060. See also 


under Conservation laws, energy- 
momentum 

Elements, abundances of. 765 

Elevator, 431. See also Uniqueness of free 
fall. Tide-producing acceleration 

Embedding diagrams 
general discussion, 613 
for a static, spherical star. 613-615, 617 
for Schwarzschild geometry. 837. 839, 528 
for a spherical, collapsing star, 855-856 
for Friedmann cosmological models, 723, 
725 

Energy-at-infinity 

in Schwarzschild geometry, 656ff 
in Kerr-Newman geometry, 898f, 910 

Energy in mechanics, as time rate of 
change of action, 486-487 

Energy of a particle, expressed as — p • i/. 

65 

Energy-momentum 
4-vector, 51, 53f. 68 
density of 

revealed by stress-energy tensor, 131 
3-form for, 151 
of gravitational field 

nonlocalizable in generic case, 466ff 
precisely localizable only for spherical 
systems, 603-604, 858f 
localizable only to within a wavelength 
for gravitational waves, 955f, 
964-966, 969f 

total, of a gravitating source 
in terms of asymptotic gravitational 
field, Chap. 19 

expressed as a flux integral, 461-464 
expressed as a volume integral, 

464-466 

conservation of. See under Conservation 
laws 

Entropy. See under Second law of 
thermodynamics 

Eotvos-Dicke experiments, 14-17, 1050-1055 
early Einstein w f ords on, 431 
implications for constancy of 

fundamental constants, 1061-1063 
for massive (self-gravitating) bodies, 

1127^1131 

Ephemeris for solar system (J.P.L.), 1095, 
1097 

Ephemeris second, 28 

Equation of structure, Cartan’s, 378 

Equations of motion 
derived from Einstein field equation, 
42-43, 471-480 

for bodies separated by distances large 
compared to their sizes 
“EIH” (post-Newtonian) for spherical 
bodies, 1091, 1094-1095 
deviations from geodesic motion, 

1120-1121, fl28 

Equations of state 

for nuclear and white-dw^arf matter, 

. 624-626 

for “cosmological fluid,” 713, 726 
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Equinoxes, precession of, 39If, 1112f 
Equivalence principle 
enunciated, 386f, 1060 
Einstein’s 1911 formulation of, 17 
bridge from special relativity to general 
relativity, 164, Chap. 16, 207 
out of spin-2 field theory, 425 
in Newton-Cartan theory, 297 
basis for affine parameter, 211, 250 
factor-ordering problems in, 388-391 
role in metric theories, 1067f 
tests of, 187-190, 1054-1063 
weak equivalence principle. See 
Uniqueness of free fall 
Ergosphere, 880 
Ether, 1051, 1064f 
Euclidean geometry, 19-22 
contrasted with Lorentz geometry, 51 
Euler Angles, 243 

Euler relation, on vertices, edges, faces, 

1175 

Euler equation of hydrodynamics 
in flat spacetime, 152f 
in curved spacetime, 564 
in PPN formalism, 1088 
applied to a pulsating star, 693-694 
applied to a collapsing star, 858 
Eulerian perturbations, 690-691 
Events, 6, 9f 
identifiability as key, 225 
as classical, not valid quantum concept, 
1184 

Expansion of universe. See under 
Cosmology 

“Expansion,” of a bundle of null rays, 582, 
1165 

“Expansion,” of a congruencVof world 
lines, 565f 

Experimental tests of general relativity 
in extenso, Chaps. 38, 39, 40 
Beall test of uniqueness of free fall, 17 
black holes, 1047 
catalogued, 1129 

constancy of fundamental constants, 
1061-1063 

cosmological observations used for, 707, 
1047, 1061, 1067 

deflection of electromagnetic waves by 
sun, 1048, 1069, esp. 1101-1105 
“de Sitter effects” in Earth-moon orbit, 
1116, 1119 

Earth’s failure to collapse, 398f 
Earth’s rotation rate, periodicities in, 
1124-1125 

Earth tides due to galaxy and to motion 
relative to preferred frame, 

1123-1124 

E0tv5s-Dicke experiment. See 
EotvOs-Dicke experiments 
ether-drift experiments, 1064-1065 
expansion of universe, 707 
geophysical observations, 1061, 1123-1125 
gyroscope precession, 1117-1120 
gravitational (Cavendish) constant. 


variations of. See under Cavendish 
gravitational constant 
gravitational waves, 1047, 1072 
Hughes-Drever experiment, 1064 
isotropy of space, 1064 
Kreuzer experiment, 1125 
laser ranging to moon, 1048, 1130-1131 
lunar orbit, 1048, 1116, 1119, 1127, 
1128-1131 

Newtonian experiments, 1067 
Nordtvedt effect, 1128-1131 
null experiments, 1050, 1064 
perihelion shift, esp . 1110-1116 
planetary orbits, deviations from geodesic 
motion, 1111, 1126-1131 
planetary orbits, periodic effects in, 1069, 
1111 

Pound-Rebka-Snider experiment, 
1056-1058 

preferred-frame effects, 1098, 1113-1114 
pulsars used for, 1047 
quasars used for, 1047, 1048, 1061, 1101, 
1103, 1104-1105 

radar time delay, 1048, 1103, esp. 
1106-1109 

redshift, gravitational See Redshift, 
gravitational 

redshift, due to “ether drift,” 1064-1065 
singularities in spacetime, existence of, 939 
Turner-Hill experiment, 1064-1065 
See also Parametrized post-Newtonian 
formalism, Dicke’s framework for 
analyzing experiments, Experimental 
tests of special relativity 
Experimental tests of special relativity, 
1054-1055 
Exterior calculus 

introduction to and detailed summary of, 
91-98 

application to electromagnetism, Chap. 4 
largely unaffected by presence or absence 
of metric, 233 

extended to vector- and tensor-valued 
forms, 348-363 

See also specific concepts , e.g., Forms, 
differential; Exterior derivative; 

Stokes theorem 
Exterior derivative 
introduced, for scalar fields, 93f 
as operation to augment the order of a 
form, 114-120 

applied twice in succession, automatically 
gives zero, 116, 118 
results of, 119 

extended to vector- and tensor-valued 
forms, 348-363, Chap. 15 
Exterior product. See Wedge product 
External field of a gravitating source. See 
Asymptotically flat spacetime geometry 
Extrema, number of, 318 
Extreme Kerr-Newman geometry, as 
limiting case of Kerr-Newman, 878 
Extremization, of integral for proper time, 
316-324 
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Extrinsic curvature. See under Curvature, 
formalism of 

Extrinsic time, of Kuchar and York, 487, 

490 

F 

Factor-ordering problems, 388-391 
Faraday. See under Electromagnetic field 
Faraday stresses, 140f, 481 
Fast-motion approximation, 1072-1073 
Fermat’s principle in a static gravitational 
field, 1106, 1108 

Fermi energy, in neutron stars and white 
dwarfs, 599-600 
Fermi gas, ideal, 565, 599 
Fermi normal coordinates, 332 
Fermi-Walker transport, 165, 170f, 1117 
Feynman’s sum over histories, 320, 419, 499f 
Field equations. See Einstein field 
equations 

Fields, long range (i.e., zero rest mass) 
spin of, deduced from transformation 
laws for polarization of waves, 954 
radiation fields must have / > S, 866, 977 
role in slightly nonspherical collapse of a 
star, 866 

direct coupling vs. indirect coupling, 
1063-1064 

direct coupling, experimental searches 
for, 1063-1065 

indirect coupling, 1068, 1069 
Final state of stellar evolution, 624. See 
also White dwarfs, Neutron stars, 

Black holes 

Fine-structure constant, electromagnetic, 
constancy of, 399, 1061 
First law of thermodynamics 
general formulation for a simple fluid, 
559-560 

for a fluid in adiabatic flow, 563 
in PPN formalism, 1088 
role in laws of hydrodynamics, 564 
application to pulsating stars, 692 
application-to collapsing stars, 858 
application to closed Friedmann universe 
705, 726AF 

Fixed-point theorem, 978 
Flatness 
test for, 30 

equivalent to zero Riemann curvature, 
283-284 

does not imply Euclidean topology, 284 
local, accompanied by global curvature 
190-191 

of space slices in Newton-Cartan 
spacetime, 291-295 

Flatness, asymptotic. See Asymptotically 
flat spacetime 

Flat spacetime. See Special relativity; 

Lorentz geometry 

“Foamlike” character of space, 419, 480, 
1190-1194, 1202 

Fluctuations, See Quantum fluctuations 
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Fluid. See Hydrodynamics 

Flux of particles. See Number-flux vector 

Flux of energy, defined, 782 

Focusing 

of null rays, 582f, 932, 1165 
See also Lens effect 
4-Force, Lorentz, 73 
Forms, differential 

list of all definitions and formulas, 91-98. 
Note: this list is not indexed here, 
since it itself is organized like an 
index! 

machinery for working with, illustrated in 
context of electromagnetism, Chap. 4 
as intersecting stacks of surfaces, 99-120 
ordered progression of (1-form, 

2-form, . . .), 114-120 
closed forms distinguished from general 
forms, 114-119 

operations on. See Duality, Exterior 
derivative, Integration 

1- forms 

motivated, 53, 55f 
defined, 56f 
illustrated, 55-58 

“corresponding” tangent vector, 58f, 

62, 310 

pictorial addition of, 57 

basis. See Basis 1-forms 

algebra of, for general basis, 202-203 

in metric-free context, 226, 231-233 

closed, 123 

curl-free, 123 

rotation-free, 123-124 

with rotation, 123 

2- forms 

as machines to construct “number of 
tubes” from oriented surface, 
105-107 
simple, 103 

general, expressible as sum of two 
simple 2-forms, 103, 122f 
basis 2-forms, in direct and dual 
labeling, 151 

used in description and calculation of 
curvature, 337-340, 348-363 
vector-valued and tensor-valued forms, 
348-363 and chapter 15 
Four-momentum. See under 
Energy-momentum 
Four-vector. See Vector 
Four-velocity. See Velocity 4-vector 
Friedmann cosmologies. See under 
Cosmological models 
Frobenius theorem, on rotation-free 
1-forms, 124 

Frozen star. See Black hole 
“Future of.” See Causal relationships 


Galaxies 

classification of, 786f, 789, 793, 795 
origin of, 766, 769f 


evolution of, 791 
distribution of, homogeneity vs. 
hierarchy, 703 

fraction of skv covered by, 799 
nuclei of 
explosions in. 634 
black holes in, 887 
relativistic star clusters in, 634, 687 
Galaxy, The (Milky Way), 756-761 
metric correction at, 459 
oscillations of star through disc, analyzed, 
318-319 

Galilean coordinates, 289. 291-298. 414 
Gamma-ray observations, as tests of 
cosmological models, 770 
Gauge transformations and invariance 
in electromagnetism. 89 
in linearized gravitation theory (flat-space 
spin-2 theory), 180, 182f, 440f, 463 
in perturbations of curved spacetime, 
967ff 

See also Lorentz gauge 
Gauss-Bonnet theorem, for 2-sphere 
topology, 309 

Gauss-Codazzi relations, 514ff 
Gauss-Weingarten equations, for 
4-transport 

in terms of extrinsic curvature, 512 
Gaussian flux integrals 

for energy-momentum and angular 
momentum, 460-464 
for charge, 461 

Gaussian normal coordinate system, 552, 

717 

Gauss’s theorem, 148-151 
as special case of generalized Stokes 
theorem, 97 

applied to conservation of 

energy-momentum, 146, 152 
General covariance, principle of, 80, 43If 
General relativity 

epitomized briefly, 130 (line 1). 164, 190f, 
266, 289 

foundations developed in detail, Chaps. 

16, 17 

See also Einstein field equation, 

Equivalence principle, Experimental 
tests of general relativity, 
Geometrodynamics 

Generating function, for transformation 
from one canonical representation of a 
2-form to another, 122-123 
Geodesics 

track-1 introduction to, 211 
track-2 treatment, in absence of metric, 
244-247 

affine parametrization of, 244-246 
as straight-on parallel transport, 245 
as straight lines of local Lorentz 
geometry, 312-315, 321-324 
as curves of extremal proper length, 
314-321, 324 

“dynamic” variational principle for, 322f 
one-parameter family of, 265-267 


can’t change from timelike to null or 
spacelike en route , 321 
simple examples 

great circle on sphere. 21 If 
straight line on plane, in polar 
coordinates. 213 

of specific manifolds. See under the 
manifold of interest 

as world lines of freely falling particles. 

4, 196 

as tools for building ideal rods and 
clocks, 396-399 
Geodesic deviation 
elementary introduction to, 29-37 
double role: defines curvature, predicts 
motion, 72 

equation of, presented in track-1 
language, 218ff 
equation of, derived, 265-275 
in spacetime of Newton-Cartan, 272f, 293 
in gravitational-wave detector. 444-445, 
950-955, 101 If 
Geodesic equation 211, 262ff 
Geodesic motion 
experimental tests of, 1055-1060 
departures from. See under Equations 
of motion 

Geodesic separation vector, 265-270 
Geometric objects, 48 
absolute vs. dynamic, and “no prior 
geometry,” 431 

spinor representation of, I154f 
See also specific objects , e.g ., Vectors, 
Forms, Connection 
Geometric units 
introduced, 27ff, 36 
factors of conversion to and from, 36, 

638, end papers 
Geometric optics 

as limiting case of physical optics, 412 
in extenso , for electromagnetic waves, 

570- 583 

basic references on, 570n. 
conditions for validity of, 571 
two-length-scale expansion underlying, 

571- 572 

basic concepts of, 571-582 
summarized, 578-580 
affine parameter of ray, 575 
angular frequency, 575 
bundle of rays, 581-582 
electric field, 579 

magnetic field, 579 5 

phase, 571, 572, 574-575 
photons, 580, 581 

polarization vector, 573, 574-575, 577, 
578, 581 

scalar amplitude, 573 
rays, 573, 574-575 
stress-energy tensor, 579 
wave vector, 573, 574-575 
laws of 

described qualitatively, 571 
summarized in detail, 578-580 
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photon interpretation of, 580 
derived from wave equation and 

Lorentz gauge condition for vector 
potential, 573, 576-577 
post-geo metric optics corrections, 572f, 
803 f 

in spinor language, 1165 
examples of applications of, 570 
geometry of a bundle of rays, 581-582 
focusing equation, 582f 
breakdown of, related to pair creation, 
803-804 

Geometrodynamics (dynamics of geometry) 
ideas of, in brief, 4f 
built-in plan: initial data plus time 
evolution, 408f, 484f 
some history of, 486-488 
analogies with electrodynamics, 364, 

$ 367-370 

causal propagation of effects in, 554 
Amowitt, Deser, Misner formulation of 
in brief, 486-490 
action principle in, 521 
geometrodynamic field momenta for, 
521 

3-geometry fixed at surfaces in, 522 
split of variables made by, 525-526 
electrodynamic analog, 522-524 
Dirac formulation of, 520 
subject to standard quantum 
indeterminism, 1182 
illustrated in action, for Schwarzschild 
geometry, 528 

See also Einstein field equation, General 
relativity, Initial value, Integrating 
forward in time 

Geometry. See specific types: "Spacetime, 
Euclidean, Lorentz, Differential, Affine, 
Riemannian, Prior. See also Curvature 
Geon, 886 

Global techniques of analyzing spacetime 
structure 

in extenso, Chap. 34 
basic references on, 916-917 
examples of, 926-931 
attempt to combine with local methods, 
806 

See also Infinity, regions of, in 

asymptotically flat spacetime; Causal 
structure of spacetime; Singularities 
in spacetime 
Globular clusters, 757 
black holes in, 887 
“Glory,” in particle scattering, 670 
Gowdy metrics, 804 
Gradient 

of a scalar, in flat spacetime, 59f 
of a tensor, in fiat spacetime, 8If 
in a curved manifold, 208-212, 259-261 
See also Exterior, derivative 
Gradiometer, gravity, 400-403 
Gravitation, 13, 163-164 
local description in terms of 

tide-producing acceleration, 29-37 


Gravitation-matter “coupling loop,” in brief, 
5, 37 

Gravitation, theories of 
catalogs of, 429 

criteria for viability of, 1066-1067 
Bergmann’s scalar-tensor theories, 1049 
BirkhofTs, 1067 

Cartan’s (general relativity plus torsion) 
1049, 1068 

Cartan-Newton. See Newton-Cartan 
theory of gravity 
Coleman’s, 1114 
completeness of, 1067, 1068 
conservative, 1093 

Dicke-Brans-Jordan, 1048f, 1068f, esp. 
1070, 1093, 1098, 1122, 1127, 1129 
cosmological models in, 770 
general relativity, foundations of, Chaps. 
16, 17 

Kustaanheimo’s, 1067 
linearized. See Linearized theory of 
gravity 

metric. See Metric theories of gravity 
metric, not encompassed by the 

10-parameter PPN formalism, 1069 
Newtonian. See Newton-Cartan theory of 
gravity 

Ni’s, 1068f, esp. 1070f, 1083, 1098, 1123 
1129 

Nordstrom’s, 429flf, 1049 
Papapetrou’s, 1124 
post-Newtonian. See Post-Newtonian 
approximation; Post-Newtonian 
formalism, parametrized 
preferred-frame, 1083, 1093, 1098, 
1123-1125 

prior-geometric, 429-431, 1068, 1070-1071 
self-consistency of, 1066-1067 
spin-0 field, in flat spacetime, I78f 
spin-1 field, in flat spacetime, 179 
spin-2 field, in flat spacetime. See 
Linearized theory of gravity 
Whitehead’s, 430, 1049, 1067, 1069, 1124 
Gravitational collapse. See Collapse, 
gravitational 
Gravitational constant 
value of, 29, endpapers 
measurement of, 1121, 1123 
as measure of “metric elasticity of 
space,” 426-428 

See also Cavendish gravitational constant 
“Gravitational field” in general relativity 
theory 

as term with many meanings and none, 
399f 

spacetime geometry as, 399-400 
metric as, 399f 

covariant derivative and connection 
coefficients as, 387, 399-400 
Riemann curvature as, 399-403 
contribution of, to standard stress-energy 
tensor, specifically excluded, 131 
Gravitational lens effect, 589, 887 
Gravitational mass, 431 


Gravitational potential. See under 

Newton-Cartan theory, Post-Newtonian 
formalism 

Gravitational radiation reaction. See 

Gravitational waves; radiation reaction 
Gravitational radius, 820-826. See also 
Horizon, Black hole, Schwarzschild 
geometry, Kerr-Newman geometry 
Gravitational-wave detectors 
conceivable types of 
Earth-moon separation, 1013, 1014, 

1018 

normal-mode vibrations of Earth and 
moon, 1013, 1015 

oscillations of Earth’s crust, 1013, 1015 
normal-mode vibrations of an elastic 
bar, 1013, 1016, 1025, 1035, 1038 
normal-mode vibrations of general 
elastic bodies, 1013, 1016, 1025, 

1028- 1035, 1041-1042 
angular accelerations of rotating bars 

(“heterodyne detector”), 1013, 
1016-1017 

angular accelerations of driven 
oscillators, 1013, 1017 
pumping of fluid in a rotating pipe, 
1013, 1018 

idealized vibrator (2 masses on a 
spring), 1022-1028 
beads on stick, 444f 
nonmechanical detectors, 1040 
electromagnetic waves in a toroidal 
wave guide, 1043-1044 
methods of analyzing (for mechanical 
detectors small compared to 
wavelength) 

proper reference frame of detector, 
1005-1006, 1010, 1012 
dynamic analysis: Newtonian equation 
of motion plus wave driving 
forces, 1006-1009 

driving forces of waves, 1006, 1009, 

1010 

line-of-force diagram, 1011-1012 
method of detailed balance, 1028, 

1029- 1030, 1033 

for noisy detector, 1019, 1036-1040 
detailed analysis of 

two freely falling bodies, 1018 
idealized vibrator (two masses on a 
spring), 1022-1028 
any resonant vibrator, analyzed by 
detailed balance, 1030, 1033 
any resonant vibrator, analyzed by 
dynamic method, 1031-1034 
noisy resonant vibrator (extraction of 
signal from noise), 1036-1040 
Earth vibrating in quadrupole mode, 
1035-1036 

electromagnetic waves in a toroidal 
wave guide, 1043-1044 
cross sections 

limits on usefulness of concept of cross 
section, 1019, 1022 
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Gravitational-wave detectors (continued) 
summary of ways to use. for 
wave-dominated detectors. 
1020-1021 

used to calculate total energy deposited 
in detector, 1027, 1028" 
use of, for noisy detectors. 1038-1039 
related to emission patterns, 1032-1033 
1035 

for idealized vibrator, 1024. 1025 
for any resonant, mechanical detector, 
1025, 1029, 1032 
for a Weber bar, 1025 
for a Weber bar in multimode 
operation, 1035 

for Earth in fundamental quadrupole 
mode, 1036 

thermally noisy detectors 

extraction of small signal from noise, 
1036-1040 

sensitivity of, to hammer-blow waves, 
1039 

ways to improve sensitivity, 1040 
sensors for monitoring displacements, 

1041, 1042 

prospects for the future, 1040ff 
Gravitational waves 
exact solutions 
cylindrical wave, 950 
plane waves with one state of 
polarization, 957-963. See also 
Plane gravitational waves 
plane waves with two polarization 
states, 964 

experimental tests of general relativity 
using, 1047, 1072 

generation by slow-motion, weak-field 
sources • 

nonexistence of monopole and dipole 
waves, 974-978 

waves are predominantly quadrupolar 
975-978 

assumptions underlying formulas, 989 
991 

formula for metric perturbation, 991 
formulas for emitted flux of energy 
and angular momentum, 992 
formulas for total output of energy and 
angular momentum, 975, 992 
formulas for radiation reaction in 
source, 993-994 

formulas for spectrum in various 
polarization states, 1033, 1035 
formulas specialized to impulse events 
987 

order-of-magnitude formulas for 
978-979, 980-981 
derivation of formulas, 995-1003 
role of “gravitational stresses” in 
generation, 996-998 
generation by strong-fleld sources, 
techniques for calculating 
particle falling into black hole, by 


perturbations of Schwarzschild 
metric. 982. 983 

vibrations of a relativistic star, by 

perturbations of equilibrium stellar 
structure, 984-985 

rotation of a deformed relativistic star, 
by perturbations of spherical 
stellar structure. 986 
initial-value solutions for, 536 
intensity and spectrum of waves that 
bathe Earth, estimate of. 986 
linearized theory of 
in extenso, 944-955 
Lorentz gauge condition. 944-945 
propagation equation. 945 
gauge transformations that maintain 
Lorentz gauge, 945 
plane-wave solutions, 945-946, 949, 
1004-1005 

transverse-traceless gauge 946-950 
methods to calculate transverse-traceless 
part. 948-949 
Riemann tensor, 948 
geodesic deviation, 950-955, 1011-1012 
relative accelerations are purely 
transverse, 951 
polarization, 952-955 
Fourier analysis of, 1026, 1027 
specific flux of, 1027 
monopole and dipole waves absolutely 
forbidden, 977, 978 
nonlinear interaction of waves with 
themselves 

nonexistence of precisely periodic 
waves, 956 

self-gravitational attraction, 957, 968 
wave-wave scattering, 968 
propagation through curved spacetime 
analogy with water waves on ocean 
993-994 

refraction of wave fronts (deflection of 
rays) by background curvature 
956, 968, 972 

gravitational redshift of frequency, 
956-957, 968 

backscatter off curvature, 957, 864-865, 
869-871 

tails due to interaction with 
background curvature, 957, 

864-865, 869-871 
shock fronts, 554 

shortwave formalism for, 964-973. See 
also Gravitational waves: 
shortwave formalism 
propagation equation, 967-968 
stress-energy tensor, 969-970. See also 
Gravitational waves: stress-energy 
tensor for 

geometric optics formalism, 971-972 
propagation of polarization tensor, 968, 
971 

spinor formalism for, 1165 

See also Gravitational waves: nonlinear 


interaction of waves with 
themselves 
radiation reaction 

order-of-maanitude formulas for. 979, 
981 

formalism for calculating, in weak- 
field, slow-motion sources, 993ff. 
lOOlff 

linked to outgoing-wave condition, 993. 
1001-1002 

forbids existence of exactly periodic 
waves, 956 

damping of neutron-star vibrations by r , 
620, 628, 984f 

evolution of binary-star orbits due to, 
988 

shortwave formalism 
“steady” coordinates, 964 
expansion parameters of, 964 
assumptions underlying, 964 
expansion of Ricci tensor, 964-965 
coarse-grain viewpoint vs. fine-grain 
viewpoint, 965 

propagation equation, 967-968 
gauge freedom, 967-969 
Lorentz gauge, 968 
transverse-traceless gauge, 969 
stress-energy tensor, 969-970. See also 
Gravitational waves: stress-energy 
tensor 

Brill-Hartle averaging process, 970 
geometric optics specialization, 971-972 
variational principle used to derive, 
972-973 
sources of 

astrophysical sources, 

order-of-magnitude formulas for, 
980-981 

big-bang origin of universe, 712, 
736-737, 764-765 

gravitational collapse of a star, 628, 

629, 1041 

supernova explosions, 982, 1040, 1042 
explosion of a star, 987 
collapses and explosions in Virgo 
cluster of galaxies, 1042 
vibrations of neutron star, 982-986 
rotation of a deformed neutron star 
(young pulsar), 628f, 983, 986, 1040 
binary stars, 986, 988-990, 995 
fall of matter into a black hole, 885, 

982, 983, 986 

collision and coalescence of black r 
holes, 886, 939, 982 * 

vibrations of a black hole, 886 
rotating steel beam, 979-980 
fission of an atomic nucleus, 987 
atomic bomb, 987 
meteorite striking earth, 987 
stress-energy tensor for 
elementary summary of, 955-956 
expressed in terms of metric 
perturbations, 969 
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expression for in traceless Lorentz 
gauge, 970 

gauge invariance of, 972 
expressed as an average of stress-energy 
pseudotensor, 972 
divergence vanishes, 970 
as source for background curvature of 
spacetime, 966, 973 
for geometric-op tics waves, 972 
for waves in nearly flat spacetime, 
955-956 

for exact plane wave, 963 
Gravitons, 972 

Gravity gradiometer, 400-403 
Group. See Rotation group; Lorentz group 
Group of motions, 652-653. See also 
Killing vector fields 
Gyroscopes 

employed in definition of Fermi-Walker 
transport, 165 

employed in constructing proper 
reference frame, 327, 330f 
precession of, as experimental test of 
general relativity, 1117-1120 
See also Dragging of inertial frames 
Gyromagnetic ratio, of Kerr-Newman black 
hole, 883, 892 

H 

HII regions in galaxies, 710, 761, 786f 
Hair on a billiard ball, 978 
Hair on a hole. See Kerr-Newman 
geometry, uniqueness of 
“Hammer-blow waves” defined, 1019 
Hamilton-Jacobi theory, 486ff, 641-649 
relation to quantum theory, 641-643 
for harmonic oscillator, 1194 
for free particle, 1194 
for test-particle motion 

in Newtonian M/r potential, 644-649 
in Schwarzschild gravitational field, 649 
in Kerr-Newman gravitational and 
electromagnetic fields, 900-901 
deflection of light by sun, in PPN 
formalism, 1102f 

perihelion shift in PPN formalism, 

1114f 

for electrodynamics, 1195 

for geometrodynamics in superspace, 

423f, 1180-1190 
Hamiltonian 

contrasted with super-Hamiltonian, for 
charged particle in field, 488-489 
electromagnetic, 497 
for test-particle in Newtonian 1/r 
potential, 644 

See also Super-Hamiltonian 
Hamiltonian, ADM, applied to mixmaster 
cosmology, 809 
Hamiltonian dynamics, 
in the language of forms, 125-126 
symplectic structure of, 126 


Hamilton’s principle for geodesic motion, 
654 

Harrison-Wheeler equation of state, 625 
Harrison-Wakano-Wheeler stellar models, 
625 ff, 696 

Hat product. See Wedge product 
Heat flow in general relativity 
references on, 559 
heat-flux 4-vector, 567 
law of thermal conductivity, 567 
in a stationary gravitational field, 568 
Hilbert’s variational principle. See 
Variational principle, Hilbert’s 
Histories, 
space of, 318-319 
democracy of, 320 
sum over, 320, 419, 499f 
“History of geometry,” defined, 418-419 
Holonomic basis, 204, 210, 239 
Homologous pulsations of a star, 697, 1079 
Honeycomb structure. See Forms 
Horizons, in black-hole physics 
definition of, 923-924 
global structure of (theorem), 924-925 
global structure analyzed, 926-931 
caustics of, 925 

generators of, 903-904, 925, 929-931, 932 
created by gravitational collapse, 862, 

863, 867, 924 

for Kerr-Newman geometry, 879ff 
angular velocity of, 914 
area of, 889, 914 
generators of, 903f 
Horizons, in cosmology, 815f 

in Friedmann cosmologies, 740ff, 815 
Hubble expansion rate, 709f 
history of knowledge of, 709-710, 

758-761 

expressed in terms of expansion factor 
a (/), 732 

distance-redshift relation used in 
measuring, 780-781 
relationship to other cosmological 
parameters, 771-773 
See also under Cosmological models, 
Cosmology 

Hughes-Drever experiment, 1064 
Hydrodynamics 

Newtonian, in absence of gravity, 152ff 
Newtonian, in presence of gravity, 387f, 
1077-1080 

post-Newtonian. See under PPN 
formalism 
general relativistic 
basic references, 562n, 568 
for a simple fluid with no heat flow or 
viscosity, 562-563 

for a fluid with viscosity and heat flow, 
567-568 

volume changes related to divergence 
of flow lines. 565 

gradient of 4-velocity resolved into 

4-acceleration, expansion, rotation. 


and shear, 566 
Euler equation, 564 
equilibrium in a stationary 
gravitational field, 566, 568 
interaction of charged matter with an 
electromagnetic field, 570 
See also Thermodynamics, laws of 
Hydrostatic equilibrium 
in any stationary gravitational field, 566 
in static, spherical star, 601-602, 605 
Oppenheimer-VolkoflF equation of, 605 
buoyant force, 606 
Hyperbolic motion of an accelerated 
observer, 166flF, I73f 

Hypersurface, spacelike. See Spacelike slice 


Imaginary time coordinate not used. 51 
Ideal gas, 139f 
Impact parameter 

for hyperbolic, Newtonian orbit, 647 
for hyperbolic orbit in Schwarzschild 
field, 670 

for photon in Schwarzschild field, 672 
for photon in PPN formalism, IlOlf 
Identity, as automatically fulfilled 
conservation law, 405 
Index, contravariant and covariant, 76 
Index manipulations 
in global Lorentz frames, 85 
in curved, Riemannian manifolds. 

201-204, 223f 
in affine manifolds, 225f 
in linearized theory, 436 
Induction, from electromagnetic 4-potential, 
122 

Inertia, 460 

Inertial forces, 165, 332 
in Newton-Cartan theory. 294 
Inertial frames, dragging of. See Dragging 
of inertial frames. 

Inertial guidance, 247 
Inertial mass, I59f, 431, 1051 
Inertial reference frame, local (= local 
Lorentz frame if orthonormal 
coordinates are used), I8f 
defined by uniform velocity of free test 
particles, 23 

used in analysis of tide-producing 
acceleration, 29-37 
mathematical representation of. See 
Ricmann normal coordinates 
See also Lorentz frame. local 
Infinitesimal Lorentz. transformation. See 
Lorentz transformation, infinitesimal 
Infinity, regions of. tn asymptotically flat 
spacetime 

l\ / . V\ 4 \ * defined. 917 918 
conformal transformations of, 919-9^1 
936 

conformally transformed coordinate 
diagrams. 9W 921 
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Initial-value data 

as uniquely determining future, Hilbert 
on, 434 

mystery' of what fixes them, 555 
formulation of, on characteristic 
hypersurface. 554 

Initial-value data for geometrodynamics 
in extenso, Chap. 21 
required for dynamics, 484-485 
compatible on spacelike slice, 489-490 
on characteristic surface, 490 
thin-sandwich conjecture, 534 
count of, 529-533 
time and dynamic data mixed in 
3-geometry, 533 

improperly posed data, 534-535 
separation of time and dynamic data. 533 
York’s formulation of 
sketched, 490 

on hypersurface of zero or constant 
extrinsic time, 539-540 
gives conformal 3-geometry, 540-541 
gives York’s curvature, 541 
gives conjugate York momenta, 542 

Initial-value equations for 

geometrodynamics, 515-516, 519, 525 
531-535 

Initial-value problem for geometrodynamics 
York’s formulation of 
wave equation for conformal factor, 

542 

existence and uniqueness of solutions, 

543 

thin-sandwich formulation of 
as option in specifying data, 529 
electrodynamic analog, 529 
as guide in counting degrees of 
freedom, 529-533 

as guide to geometrodynamics, 529-531 
time-symmetric case, 490 
formulated, 535 
role of base metric in, 535 
gravitational wave amplitude in, 536 
wave equation for conformal 
correction factor, 535 
time-anti symmetric case, 490 
formulated, 536 
wave equation for conformal 
correction factor, 536 
mass of wave is positive, 536 
other symmetric cases 
Friedmann universe, 537, 705 f, 727f 
744 

mixmaster universe, 537 , 806-811 
waves with cylindrical symmetry, 537 
waves with spherical symmetry, 537 
pulsating star, 691-694 
as route to cosmology, 537 
See also Geometrodynamics, Integrating 
forward in time 

Initial-value theory for electrodynamics, 

523 f, 526, 529flf. See also 
Electrodynamics, Integrating forward 
in time 


Injection energy, 561, 562 
Integral conservation law, 146 
Integrating forward in time 
geometrodynamic equation 
statement of initial data in, 526-527 
options in choice of lapse and shift, 
527-528 

compared to electrodynamics, 527-528 
Maxwell’s equations 
statement of initial data in, 527 
options in choice of potential, 527 
as guide to geometrodynamics, 527 
See also Electrodynamics, 

Geometrodynamics, Initial value 
Integration 

of differential forms, 94-97, 150f 
of tensors, in track -1 language, 147ff 
See also Stokes’ theorem, Gauss’s 
theorem, Volume 

Interference, constructive and destructive, 
419, 423f, 1185-1187 

Interferometry, used to measure deflection 
of radio waves by sun, 1104-1105 
Intergalactic matter, mean density of, 712, 

761 f 

Interval, Lorentz, 19-23 
Intrinsic curvature. See under Curvature, 
formalism of 

Intrinsic time of Sharp, Baierlein, and 
Wheeler, 487, 490 
Invariants 

of electromagnetic field, 110, 480-483 
of Riemann tensor, 491 
Irreducible mass of a black hole, 889f, 913 
Isolated system, 454 

Isometry, 652-653. See also Killing vector 
fields 

Isostasy, 402 

Isothermal star clusters, 685ff 
Isotropic coordinates 
for a star, 595 

for Schwarzschild geometry, 840 
in post-Newtonian approximation, 1097 
Isotropy and homogeneity of universe 
in extenso, Chap. 30 
man could not exist in an anisotropic 
universe, 939 

adiabatic cooling of anisotropy, 802 
viscous dissipation of anisotropv, 769, 

802- 803 

pair creation by anisotropy energy, 769, 

803- 804 

See also Cosmological models; Cosmology 
Isotropy implies homogeneity, 715, 723 


J 

Jacobi identity, for commutators, 240 
Jacobian, 93, 148, 160f 
Jacob’s ladder. See Schild’s ladder 
Jeans instability, 757 
Junction conditions, 490 
from electrodynamics as guide, 551 


relevant components of Einstein field 
equation, 552 

surface stress-energy tensor, 552-553 
intrinsic geometry continuous, 553 
extrinsic curvature may jump, 554 
across null surface, 554 
and motion of surface layer, 555 
applied to collapsing shell of dust, 

555-556 

applied to surface of a collapsing star, 
852-853 

Jupiter, motion of satellites, 637 

K 

K’ai-feng observatory, ii 
Kasner cosmological model, 801, 805ff 
Keplerian orbits in Newtonian field of a 
point mass, 647-649 
analyzed using Hamilton-Jacobi theory, 
644-649 

effective potential for, 661 
“Kepler density” from satellite period, 44 
Kepler’s laws, 
discovery by Kepler, 755 
“1-2-3” law, 39, 450, 457 
Kernel, of wave operator, 121 
Kerr coordinates, 879f 
Kerr diagram, 881 
Kerr geometry, as limiting case of 
Kerr-Newman, 878 
Kerr-New'man geometry and 
electromagnetic field 
history of, 877n 

parameters of (M, Q, S , or a), 878 
limiting cases (Schwarzschild, 

Reissrier-Nordstrom, Kerr, extreme 
Kerr-Newman), 878 
uniqueness as external field of a black 
hole 

heuristic explanation of uniqueness, 

875, 877 

theorems implying uniqueness, 876, 

938, 939 

implications for realistic gravitational 
collapse, 863 

Boyer-Lindquist coordinates 
metric, 877, 878 

electromagnetic field tensor, 877, 878 
vector potential, 898 
pathology of, at horizon, 880 
Kerr coordinates 
electromagnetic field tensor, 879 
metric, 879 5 

transformation between Kerr and Boyer- 
Lindquist coordinates, 879f 
Kerr-Schild coordinates, 903 
stationary observers, 893-894 
locally nonrotating observers, 895-896 
Kerr diagram for, 881 
maximal analytic extension of, 882 
Killing vectors, 879, 892ff 
Killing tensor, 893 
principal null congruences, 901-904 
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light cones, 891, 896-897 
electromagnetic-field structure, 877flf, 883, 
892 

magnetic dipole moment, 883, 892 
multipole moments of, 883, 892 
horizon, 879ff 
null generators of, 903-904 
area of, 889, 914 
angular velocity of, 914 
rotational properties 
intrinsic angular momentum vector, 

891 

gyromagnetic ratio, 883, 892 
static limit, 879ff, 894 
ergosphere, 880 

dragging of inertial frames, 879ff, 
893-896 

dynamic properties 

role as endpoint of gravitational 
collapse, 882-883 

stability against small perturbations, 
884-885 

change of M, Q, S when particles fall 
into horizon, 904-910, 913 
reversible and irreversible 

transformations of, 889-890 
rotational energy of, 890 
electromagnetic energy of, 890 
irreducible mass, 889-890, 913 
test-particle motion in, 
super-Hamiltonian for, 897 
energy-at-inflnity, 898-899, 910 
axial component of angular 
momentum, 898-899 
rest-mass of particle, 899 
Carter’s fourth constant of the motion, 
£ or 3T, 899 

equations of motion in separated form, 
899-900, 901 

Hamilton-Jacobi derivation of 
equations of motion, 900-901 
orbits in equatorial plane, 911-912 
effective potential for equatorial 
motion, 911 

binding energy of last stable circular 
orbit, 885, 911 

wave propagation in, 914-915 
Kerr-Schild coordinates, 903 
Killing vector fields, 650-653 
associated conservation laws for 
test-particle motion, 651 
commutator of is Killing vector, 654 
eigenvalue problem for finding, 654 
for flat spacetime, 654 
for spherically symmetric manifolds, 658 
for Kerr-Newman geometry, 879, 892ff 
Killing’s equation, 650 
Killing tensor field, 893n 
Kinetic theory in curved spacetime 
in extenso, 583-590 
basic laws 

Liouville’s theorem for noninteracting 
particles in curved spacetime, 584, 
586-587, 590 


collisionless Boltzmann equation 
(kinetic equation), 587, 590. 
specialized to photons, 587-589 
basic concepts 
mass hyperboloid, 585 
momentum space, 583ff, 590 
phase space, 584f, 590 
volume in phase space, 584-587, 590 
distribution function (number density 
in phase space) defined, 583f, 590 
applications, 583 

elementary expression for pressure, 
139-140 

stress-energy tensor as integral over 
momentum space, 589f 
photons, 587ff 

relativistic star clusters, 679-687 
computation of optical appearance of a 
collapsing star, 850 
Klein-Alfven cosmology, 748, 770 
Kronecker delta, 22 

Kruskal diagrams, 528, 834f, 839, 848, 855 
Kruskal-Szekeres coordinates for 
Schwarzschild geometry, 828-832 
metric in, 827 

relationship to Schwarzschild coordinates, 
833-835 


L 

Lagrangian perturbations, 690-691 
Lamb-Retherford shift, principal 
mechanism, 1190 

Landau-Lifshitz pseudotensor. See 
Pseudotensor 

Laplace operator, vs. d’Alembertian, 177 
Lapse function 
as Lagrange multiplier, 487 
metric interval as fixed by, 507 
covariant and contravariant forms of, 
507-508 

award of arbitrariness in, reversed, 532 
variational principle for, 538 
Laser ranging to moon, 1048, 1130f 
Lattice. See Clocks; Rods 
Laws of physics in curved spacetime, 
384-393. See also specific lam , e.g ., 
Kinetic theory, Hydrodynamics, 
Conservation laws 
Leap second, 28 
Least action, principle of 
applied in elementary Hamiltonian 
mechanics, 125-126 
related to extremal time, 315-324 
Lens effect, 589, 795f, 887 
Levi-Civita tensor 
in flat spacetime, 87f 
orientation of, 87f 
in general basis, 202, 207 
in spherical coordinates, 206 
Lie derivative 
of a vector, 240 
of a tensor, 517 


independent of any affine connection, 

517 

Lie groups, 198 

Lie transport law, 240 

Light, bending of. See Deflection of light. 

Light cone 

characterization of advanced and 
retarded potentials, 122 
Newton-Cartan vs. Einstein difference, 
297 

See also Causal relationships 
Line element. See Metric 
Lines of force 

relation to honeycomb structure, 102 
never end, as core of Maxwell's 
equations, 420 

diagram for gravitational waves, 101 If 
Linearized theory’ of gravity {same as 
Spin-2 theory in flat spacetime) 
equivalence of the two theories spelled 
out, 435 

presentation from spin-2 viewpoint. 
179-186 

presentation as linearized limit of 

general relativity. Chap. 18. 448-451. 
461-464, 944-955 
sketched, 435 
bar operation, 436-438 
field equations, 437-438. 46It 
formula for metric, 438 
gauge transformations. 43K 441 
gauge invariance of Riemunn 
curvature. 438 
Lorentz gauge, 438. 441 
global Lorentz transformations. 439 
curvilinear coordinates. 441 
effect of gravity on matter and 
photons, 442-444 

self-inconsistency of. 180. 1K6. 443f 
complete repair of, leads to general 
relativity, 186, 4241 

partial repair for alow-motion s\vtcim 
leads to Newtonian and 
post-Ne\Uoman formalism*. 
1073-1078, I089f 
applications 

external field of static spherical h*nk. 
438 

external field of any source, 44 8 4M. 
461-464 

bending of light, rtddufl, pcnhchon 
advance, I83ff, 446 
gravitational wasev l8Sf, 442. 444i 
944 955. See m+Jer 
Gravitational ***** 

Liouvillc’s theorem, 5M 
Local physics *s simple ph>**v 4 W 
29f * 

Local inertia! frame. See Inefual tune, 
local 

Local Lorenu frame See lotent: fume, 
local 

Locally nonrouung ot*enerv 
Lorenu contract***. 4* 
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Lorentz force law 
compared equation of geodesic 
deviation, 35 

formulated, in flat spacetime, 73 
energy change associated with, 73 
double role: defines fields and predicts 
motions, 71-74 

in language of forms, 101-104 
in language of energy-momentum 
conservation, 155 
in curved spacetime, 201, 568 
for a continuous medium, 570 
derived from Einstein’s field equations, 
473-475 

in three languages, 474 
Lorentz frame, local 
closest to global Lorentz frame, 207 
mathematical representations of, 217f, 

285 ff, 314f 

straight lines are geodesics of curved 
spacetime, 312-324 

evidences for acceleration relative to, 327 
used to analyze redshift experiments, 
1056-1060 

See also Inertial frame, local 
Lorentz gauge. See Gauge transformations 
and invariance 

Lorentz geometry, global, 19-23 
contrasted with Euclidean geometry, 51 
spacetime possesses, if and only if 
Riemann vanishes, 284 
Lorentz group, 242 

Lorentz invariance, experimental tests of, 
1054f 

Lorentz transformations, 66-69 
key points, 67f 
matrix description of, 66 
way to remember index positions, 66 
velocity parameter in. 67 
boost, 67, 69 

rotation in a coordinate plane, 67 
infinitesimal 

antisymmetric matrix for, 171 
generator of, 329 

special case: boost along coordinate 
axis, 80 

in spin-matrix language, 1142-1145 
velocity parameter, 1145 
post-Newtonian limit of, 1086 
used to annul Poynting flux, 122 
See also Rotations 

Lowering indices. See Index manipulations 
Lunar orbit, experimental tests of general 
relativity using, 1048, 1116, 1119, 
1127-1131 

M 

Machine with slots. See under Covariant 
derivative, Metric, Tensor 
Mach’s principle, 490, 543-545 
acceleration relative to distant stars, 543 
and York’s formulation of initial-value 
problem, 546 


gives inertia here in terms of mass there, 
546 

and Foucault pendulum, 547 
and dragging of inertial frames, 547. 

See also Dragging of inertial frames 
drasgine analogous to magnetic effect, 
54ff 

inertial influence of distant stars, 548 
sum-for-inertia in, 549 
“flat” space as part of closed space in. 
549 

Magnetic flux, from integration of 
Faraday, 99-101 
Magnetic poles, absence of. 80 
Magnetostatics, plus covariance, gives 
magnetodynamics. 80, 106 
“Magnitude, absolute,” defined, 786 
“Magnitude, apparent.” defined, 782 
Magnitude-redshift relation. See under 
Cosmology, observational probes of 
standard model 

Manifold, differentiable, 10, 13, 24Iff 
Many-fingered time, and arbitrariness in 
slice through spacetime, 7l3f, 1184 
Mass 

active vs. passive. See Cavendish 
gravitational constant 
center of, 161 

experimental, finite, as difference 
between two infinities, 474-475 
inertial, density of, 159f 
inertial vs. gravitational, 431, 1051 See 
also Uniqueness of free fall 
“Mass-energy inside radius f,” 602ff, 858f 
Mass-energy, density of See Stress-energy 
tensor 

Mass-energy, total, of an isolated, 
gravitating system (= “active 
gravitational mass”) 
defined by rate metric approaches 
flatness 

in ex ten so. Chap. 19 
in linearized theory, 448-450 
in general, 453, 455 
no meaning of. for closed universe, 
457-459 

as geometric object residing in 

asymptotically flat spacetime, 453 
measured via Kepler’s 1-2-3 law, 450, 

457, 636ff 

contribution of gravitational field to, 467 
not localizable in generic case, 466ff 
precisely localizable only for spherical 
systems, 603f, 803f 

localizable to within a wavelength for 
gravitational waves, 955f, 964ff, 
969f 

conservation law for, 455, 468-471 
See also under Energy-momentum 
Mass hyperboloid, 585 
Matter in universe, luminous, mean density 
of, 71 Of, 761 

Matrix, inverse, explicit expression for, 161. 
See also Jacobian, Determinant 


Maxima, number of, 318 
Maximal analytic extension of a geometry. 
882 

Maxwell, dual 2-form representation of 
electromagnetic field, introduced, 105. 
See under Electromagnetic field 
Maxwell energy density, 140-141 
Maxwell's equations 

component version in flat spacetime, 80f 
geometric version, 88-89 
in language of forms, 112-114 
solution for particle in an arbitrary state 
of motion, 121-122 
in curved spacetime, 391, 568 
for vector potential, 569 
deduced from “lines of force end only 
on charge,” 79-81 

derived from physics on a spacelike slice, 
419-420 

derived from stress-energy and Einstein 
field equation, 471-473 
and conservation of energy-momentum, 
483 

nowhere failing, 1200 
See also Electrodynamics, Lorentz force 
law 

Mean eccentric anomaly, 648 
Measurability of geometry and fields in 
classical theory, 13 

Measurement, possibilities defined by 
theory, 1184 

Measuring rods. See Rods 
Mercury, perihelion precession of. See 
Perihelion shift 

Meshing of local Lorentz frames, 190-191 
Metric 

distilled from distances, 306-309 
descriptions 

summarized, 77, 305, 3 lOf 
as machine with slots, 22, 51-53, 77, 
305, 3lOf 

in component language, 77, 310f 
in terms of basis 1-forms, 77, 310 
as line element, 77, 305, 310 
introduced and defined, 22 
components of 

in arbitrary basis, 201, 3 lOf 
in Euclidean coordinates, 22 
in Lorentz coordinates, 22, 53 
determinant of components 
defined, 202 

differentiated, gives contraction of 
connection coefficients, 222 5 
variation of, 503 

computation of connection coefficients 
from, 210 , 216 

compatibility with covariant derivative, 
313ff, 353f 

structure, and symplectic structure, 126 
enters electromagnetism only in concept 
of duality, 105, 114 
role in spacetime of general relativity 
measured by light signals and free 
particles, 324 
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as “gravitational field,” 399-400 
test for local Lorentz character, 

311-312 

components not all predicted by 
geometrodynamic law, 409 
role in Newton-Cartan spacetime, 300, 
302 

coefficients in specific manifolds and 
frames. See specific manifolds , e.g.. 
Sphere, 3-dimensional; 

Schwarzschild geometry, or 
coordinate systems, e.g., Kerr-Schild 
coordinates 

Metric elasticity of space, 426-428 
Metric theories of gravity, 1067ff 
experiments to test whether the correct 
theory is metric, Chap. 38, 1067 
PPN formalism as approximation to, 

1069 

Microwave radiation. See Cosmic 
microwave radiation. 

Minima, number of, 318 
Minkowski geometry. See Lorentz geometry 
Missing matter, “mystery of,” 710. See also 
under Cosmology 
Mixmaster universe, 805-814. 

Mixmaster oscillations damp chaos, 769 
Mdbius strip, 96 

Moment of inertia tensor defined, 977 
Moment of rotation 
as meaning of Einstein curvature, 

373-377 

conservation of, 378ff, 473 
“Moment of time” means “spacelike 
hypersurface,” 713-714, 1184 
Momentum, in mechanics, as space rate of 
change of action, 486-487 
Momentum field, electromagnetic, 497f, 524 
Momentum space, 583ff, 590 
Momentum vector. See Energy-momentum 
4-vector 
Moon 

effect on tides, 44 
shadow on Earth, 24-26 
laser ranging to, 1048, 1130f 
orbit of, tests of general relativity using, 
1048, 1116, 1119, 1127-1131 
separation from Earth as 

gravitational-wave detector, 1013f, 
1018 

Morse theory, 318 
Mdssbauer effect, 63, 1056, 1057 
Motion. See Equations of motion 
Moving frame. See Tetrad 
Multicomponent fluid, 558 
Multipole expansion of Newtonian 
potential, 991 

Multipole moments of Kerr-Newman black 
hole, 883, 892 

N 

Near zone for radiation theory’, 997, 
999-1000 


Neutral stability, 697 
“Neutral relationship to.” See Causal 
relationships 
Neutrinos 

emitted in stellar collapse, 599 
transport energy in collapsing star, 628 
redshift when emitted by a collapsing 
star, 850 

from big-bang, 712, 736-737, 764-765 
damp anisotropy of expansion, 803 
formalism for analyzing in curved 
spacetime, 1164 
Neutron-star matter, 599 
idealized as simple fluid, 558 
equations of state for, 624-626 
Neutron stars, 619f, 622 
models for, 625-627 
mass limits, 627 
rotation of, 628 

as source of gravitational waves, 983, 
986, 1040 
pulsation of, 628 
as source of gravitational waves, 

982-986 

stability of 626-627, 696 
creation by stellar collapse, 627-629 
Newman-Penrose constants, 870f 
Newton-Cartan theory of gravity 
contrasted with Einstein’s theory, 3ff, 197, 
245, 297f, 302f 

incompatibility with special relativity, 

177, 304 

standard Newtonian formulation of 
in brief, 177, 301 

as approximation to general relativity, 
412-416 

as approximation to metric theories of 
gravity, 1077f, 1097 
useful formulas and computational 
techniques, 1078f 
virial theorems, 1079 
stress tensor for Newtonian 
gravitational field, 1078f 
Cartan’s curved-spacetime formulation of 
in extenso. Chap. 12 
in brief, in language of Galilean 
coordinates, 289f 

in brief, in coordinate-free language, 
300f 

transition between languages, 298f 
transition between Newton formulation 
and Cartan formulation, 289f, 299 
Noise, extraction of signals from, 10361T. See 
also Gravitational-wave detectors 
Nonlocalizability of gravitational energy. 

See under Mass-energy, total; 
Energyrmomentum; Pseudotensor 
Nonorientable surface, 96 
Nordstrom’s theories of gravity. See under 
Gravitation, theories of 
Nordtvedt effect, 1128-1131 
Norm of a /?-form, 97 
Normal, unit normal in terms of lapse and 
shift, 508 


Normal coordinate system, 1055 
Nothing, as foundation of everything 
Leibniz on, 1219 

geometrodynamics as early model for 
1202-1203 

calculus of propositions as a later model 
for, 1209, 1211-1212 
Novikov coordinates, 826f 
Nuclear burning in stars 

drives pulsational instability, 632 
HCNO cycle, 632 
catalyzed to endpoint, 624-626 
formulation of thermodynamics in 
presence of, 558 
Number-flux vector 
introduced, 138-139 
for photons in geometric optics limit, 

580 

Number-flux relation, in cosmology, 798 
Number space, 241 

Nuclear matter. See Neutron-star matter 
Nucleosynthesis in big bang, 760, 762 
Null experiments, 1050, 1064 

O 

Observational cosmology, Chap. 29. See 
under Cosmology 

One-form. See under Forms, differential 
Optics. See Geometric optics 
Olber’s paradox, 756 
Oppenheimer-Volkoff equation of 
hydrostatic equilibrium, 605 
Oppenheimer-Snyder model for collapsing 
star, 851-856 

Orbit, See Keplerian orbits. Planetary orbits. 
See also geometry in which the 
orbits occur, e.g., Schwarzschild 
geometry 
Orientation 

of space, embodied in duality operation, 
97 

of forms 

1- form, illustrated, 55 

2- form, illustrated, 100, 104, 107, 109. 

116 

3- form, 117 

of volumes, 133, 135f, 147-150 
relative, of domain and its boundary. % 

P 

Pair production 
by photon, 70 
at high temperatures, 558 
by tidal gravitational forces. 750. 769. 

803f, 816 

damps anisotropies of geometry. '69, 
803f 

Pairs 

free-fall of, experimental lest, 1051 
in early stages of standard «miicI»£hmI 
model. 736f, 764 
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Parallax, 757 
Parallel transport 
in brief, 208f 
in extenso , 245-263 
equation of, summarized, 224 
illustrated, 209, 212 
See also Fermi-Walker transport 
Parallelepiped, trivector and 1-form 
representation of, 133. 135-136 
Parametrization, of geodesic, 244-246 
Parametrized post-Newtonian formalism 
history of, 1049, 1073 
described qualitatively, 1049, 1068f, I072f 
summary of technical details, 1092 
notation, 1073f, 1092f 
accuracy and realm of validity, 1069, 
1072f, 1075 

metric theories encompassed by and not 
encompassed by, 1069 
parameters 

described qualitatively. 1069, 1072 
defined precisely, 1080f 
translated from one convention to 
another, 1093 

values for several theories, 1072 
parameters, experimental limits on 
y, 1103, 1105, 1108f 
ft 1111, 1113 

« 3 -4A-2y- 2-J, 1114 
«2 - 4 + ? - 1, 1124 
a 3 = 4ft - 2y - 2 - f, 1125 
foundations of 

coordinates of, 1073f, 1082-1087, 1089, 
1091, 1097 

expansion parameters and their 
magnitudes in the solar system, 
1068, 1075 

radiation, zone excluded from, 1075 
time derivatives small compared to 
space derivatives. 1075 
shear stresses typically negligible, 1074f 
expansion procedure, 1075ff 
metric coefficients, pattern of, 1076f, 
1080, 1100 

description of matter 
thermodynamic functions in, 1074f 
velocity of matter, 1073f, 1086 
transformation between coordinate 
frame and rest frame of matter, 
1087 

stress-energy tensor, 1086f 
matter generates gravity 
gravitational potentials (functions 
appearing in metric), 1080f, 1085 
nonlinear superposition of gravitational 
fields, 1096 

identities relating potentials to each 
other, 1082, 1089 

metric coefficients, precise form of, 
1084f 

Christoffel symbols, 1089 
equations of motion for matter, 1087ff 
baryon (rest mass) conservation, 1088 
energy conservation law, 1088 


Euler equation. 1088 
post-Galilean transformations, and 
invariance, 1085 

velocity of coordinate frame relative to 
universal preferred frame, 1083f, 
1098, 1114 
applications of 

total mass-energy of a body calculated, 
1091, 1094^1099, 1125f 
gravitational field of isolated, spherical 
sun, 1097 ff 

gravitational field of sun with 
quadrupole moment, 1115 
gravitational field of rotating Earth. 
1119 

why high-speed particle morion probes 
only the parameter y, 1099ff 
propagation of light and radio waves, 
1099-1109 

deflection of electromagnetic waves by 
sun, llOlff 

radar time-delay in sun’s gravitational 
field, 1103, 1106-1109° 
many-body (“EIH”) equations of 
motion, 1091, 1094f 
equation of motion for a spinning 
body, 1120f 

perihelion shift, 1110-1116 
three-body effects in lunar orbit, 1116 
precession of a gyroscope, 1117-1120 
Cavendish gravitational constant 
derived, 1125f 

Partial differential equations, 
applications of differential geometry to, 
198 

rationale of analyzing, 485 
Particle-physics experiments as tests of 
special relativity, 1054f, 1060. See also 
under Conservation laws. 

Particles. See Pairs; Conservation laws 
Passive vs. active transformations, 1140 
“Past of.” See Causal relationships 
Path integral. See Feynman’s sum over 
histories 

Pauli principle, as test of Riemannian 
geometry, 398-399 

“Peeling theorem,” in radiation theory, 

1165 

Perfect cosmological principle, 745 
Perfect fluid 
defined, 132, 140 
stress-energy tensor for, 132, 140 
See also Hydrodynamics 
Perihelion shift, 39If 
for nearly circular orbits in exact 
Schwarzschild geometry, 670 
in post-Newtonian limit of general 
relativity, 1110-1116 
in PPN formalism, 1110-1116 
in linearized (spin-2) theory, 183f, 446 
in spin-0 and spin-1 theories of gravity, 
179 

observational data on, 1112f 
. Einstein on, 433 


Permutation tensor (same as alternating 
tensor), 126, 128f, 207, 343 
Perturbation theory for spacetime geometry 
general formalism 

connection coefficients in terms of 
metric perturbation, 966-967 
curvature tensors in terms of metric 
perturbations, 966-967 
action principle for metric 

perturbations, in vacuum, 972f 
gauge transformations, 967ff 
stress-energy of metric perturbations in 
shortwave limit, 969 
applications 

shortwave approximation for 
gravitational waves, 964-973 
pulsation of relativistic stars, 688-699 
slow rotation of a star, 699 
to Friedmann cosmology, 800f 
to collapsing star, 844ff 
stability analyses of Schwarzschild and 
Kerr holes, 884f 

Petrov-Pirani classification of spacetimes, 

902 

Phase, of de Broglie wave, 53-55 
Phase, in geometric optics, 57If, 574f 
Phase space, 126, 584f, 590 
Photons 

splitting, forbidden for plane wave, 70 
world lines of, 388 

kinetic-theory description of, 587-589 
in geometric optics, 580 
Physical optics, correspondence with 
geometric optics, 412 

Piercing of surfaces, of a form. 55f, 60, 99, 
202, 231 

Piezoelectric "strain transducer, 401 
Pit in the potential, 636-637 
Planck length 
defined, 10 

relevance to fluctuations in geometry, 10, 
1180, 1192ff 

Plane electromagnetic waves in curved 
spacetime, 961-962 
Plane gravitational waves, exact 
form of metric, 957 

field equations and solution fbr a pulse 
of waves, 958-959 
linearized limit of, 958 
Riemann curvature of, 959 
global structure of spacetime, 958-960 
effect on test particles, 960-961 
comparison with exact electromagnetic 
plane wave, 961-963 $ 

stress-energy of, 963 

in language of shortwave approximation, 
962-963 

Plane gravitational waves in linearized 
theory, 945f, 949, 1004ff 
Planetary orbits 

periodic relativistic effects in, 1009, 1011 
deviations from geodesic motion, 1111, 
1126-1131 

See also Keplerian orbits, Perihelion shift 
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Plateau, problem of, 877 
Poincare transformation, 68 
Positive sense. See Orientation 
Poisson bracket, 654 
generalized, 486 

Poisson’s equation for Newtonian 

gravitational potential, 290, 299, 301 
Polarization of a gravitational wave 
tensors defined 953f, 971 
plane (linear), 952f 
circular, 953f 
elliptical, 955 

compared with that of an electromagnetic 
wave, 952-954 

rotational transformation of states, 954 
parallel transport of, in geometric-optics 
limit, 971 

line-of-force diagram, 1011-1012 
Polarization of a neutrino, 954 
Polarization of radiation fields with 
arbitrary spin, 954-955 
Polarization vector for electromagnetic 
waves, 573fT, 577f, 581 
Post-Galilean transformations, 1085 
Post-Newtonian approximation to general 
relativity, 1069 

obtained from PPN formalism, 1073 
derived by post-Newtonian expansion of 
field equations, 1089f 
stellar structure and stability analyzed 
using, 1073 

many-body (“EIH”) equations of motion, 
1091, 1094f 

See also Parametrized post-Newtonian 
formalism 

Post-Newtonian expansion procedure, 

1075ff. See also Parametrized 
post-Newtonian formalism." 
Post-post-Newtonian approximation, 1069, 
1077 

Post 5/2 -Newtonian approximation and its 
relationship to radiation damping, 1077 
Potentials, effective. See Effective potentials 
Pound-Rebka-Snider experiment, 1056ff 
Poynting flux, 122, 140f, 481, 550 
Precession 

of perihelion. See Perihelion 
of spin axis. See under Spinning body 
Precession component, of Einstein field, 547 
Preferred-frame theories of gravity, 1083, 
1093, 1098, 1123 ff 

Preferred-frame effects, experiments to 
search for, 1098, 1113f 
Pregeo metry, 1203-1212 passim 
as calculus of propositions, 1208-1209, 
1211-1212 

Pressure, in stress-energy tensor for a 
perfect fluid, 132 
Price’s theorem, 863, 866 
Primordial fireball. See Cosmic microwave 
radiation 

Principal null congruences of Weyl tensor 
defined, 902 

for Kerr-Newman geometry, 901-904 


Prior geometry, 429ff, 1068, 1070f 
Probability amplitude 
for a history, 419 

phase of, given by action, 486, 491 
Projection operator for transverse-traceless 
part of a tensor, 948 
Projection tensors, 565f 
Propagator, mentioned, 120 
Proper distance. See Interval, Lorentz 
Proper reference frame of an accelerated 
(or unaccelerated) observer 
constructed, 327-332 
metric, 33If 

connection coefficients, 330f 
inertial and Coriolis forces, 332 
applied to definition of thermodynamic 
potentials, 557f 

applied to analysis of gravitational-wave 
detector, 1005-1010, 1012 
Proper time. See Interval, Lorentz 
Pseudotensors of stress-energy for 
gravitational field, 465f 
do not localize gravitational energy, 466f 
order of magnitude of, 996, 999f 
used in analyzing generation of 
gravitational waves, 996-999 
for waves, averaging gives stress-energy 
tensor, 972 

Pulsars 

discovery of, 620, 762 
theory of, 628, 630 

timing data as a probe of neutron-star 
structure, 628, 630 

experimental tests of general relativity 
using, 1047 

in idealized experiment on “prior 
geometry,” 430 
See also Neutron stars 


Q 

Q of an oscillator, 1025 
Quadrupole-moment parameter for sun, J 2 , 
1112f, 1115 

Quadrupole moment, 977 
coupling to curvature produces 
departures from geodesic motion, 
476-480 

precession of spin axis, 391f 
reduced, 977 

as integral over mass distribution, 975, 
977 

as trace-free part of second moment of 
mass distribution, 977 
as coefficient in 1/r expansion of 
Newtonian potential, 991 
and generation of gravitational waves, 
975, 991-994 
Quantum fluctuations 
in electromagnetic field, 427, 1190f 
in geometry of spacetime, 419, 480, 
1190-1194, 1202 

and zero-point energy of particles and 


fields, as responsible for gravity, 
426ff 

Quantum geometrodynamics 
commutation of observables in, on 
spacelike hypersurface, 554 
ideas of Penrose and Hawking on, 936, 
938, 940 

See also Pair production, Quantum 
fluctuations 
Quantum theory 

angular momentum commutators, 236 
general operators, 236 
correspondence principle, 413 
particle self-energies, 474f 
Quantum propagator, 1194 
Quasars, 76If 

distances to, controversy over, 767 
evolution of population, 767f, 770 
models for energy source, 634-635, 687 
use in experimental tests of general 
relativity, 1047f, 1061, 1101, 1103ff 
Quaternions. See Spin matrices 

R 

Radiation, description of spectrum, 588 
specific intensity l v defined, 587, 589 
specific flux F v defined, 1025 
flux F defined, 782 
conservation of IJv 3 (Liouville’s 
theorem), 587-588 

redshift of temperature of black-body 
radiation, 588 
Radiation, electromagnetic 
pictorial explanation of 1/r behavior, 
llOf • 

and causality, 110 
of oscillating dipole, 111-112 
Radiation, gravitational. See Gravitational 
waves 

Radiation reaction, 474, 993f 
Radiation zone, 997 
Radar time delay in Sun’s gravitational 
field, 1048, 1103, esp, H06-1109 
Radio sources, cosmic, 759-762 
isotropy on sky, 703 
See also Quasars 

Radius of closed Friedmann universe, 704f 
Raising indices. See Index manipulations 
Rays, in geometric optics, 573ff, 58If 
Redshift, cosmological 
independent of wavelength, 775 
“tired light” does not explain, 775 
derivations 

from standing waves, 776 
from wave-crest emission, propagation, 
and reception, 777f 
using sypimetry-induced constant of 
geodesic motion, 777, 780 
used to characterize distances and times 
in universe, 779 

contrasted with Doppler shift, 794 
of particle energies and de Broglie waves 
780 " 
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Redshift ( continued ) 

of cosmic microwave radiation, 764-765, 

779 

in anisotropic cosmology, 801 
See also under Cosmology 
Redshift 

Doppler, 63f, 794 

due to “ether drift,” 1064f 

of radiation from a collapsing star, 847, 

849f. 872 

Redshift, gravitational, for gravitational 
waves, 956f. 968 

Redshift, gravitational, for photons 
compared with 1970 clock technology, 

1048 

experimental results, 1058, 1060 
Pound-Rebka-Snider experiment, 

1056-1058 

in solar spectum, 1058-1060 
equivalence principle tested by, 189f, 

1056 

geodesic motion tested by, 1055-1060 
implies spacetime is curved, 187-189 
derivation 

from energy conservation, 187 
from geodesic equation in generic 
static metric, 657, 659 
in linearized theory, 446f 
Redshift, gravitational, for temperature, 

568, 685 

Redshift parameter, z, defined, 187 
Regge-Wheeler radial coordinate, See 
Tortoise coordinate 

Reference system. See Coordinate system, 
Inertial frame, Lorentz frame. Proper 
reference frame 
Regge calculus, 
surveyed, Chap. 42 
suitable for low-symmetry 
geometrodynamics, 1166 
geometry determined by lengths. 1167 
simplexes and deficit angles, 1167-1169 
skeleton geometry, 1169 
hinges, 1169 

continuum limit of, 1169 

blocks associated with one hinge, 1170 

variational principle for 

geometrodynamics, 1170 
flow diagram for, 1171-1172 
initial-value data in, 1172 
Einstein’s geometrodynamic law, 
expressed in, 1173 

choice of lattice structure, 1173-1177 
supplementary’ vertices in, 1176 
facing, packing, and right-through blocks, 
1176 

count of faces, 1177 
choice of edge lengths, 1177-1178 
applications and future of, 1178-1179 
Reissner-Nordstrom geometry 
derivation of metric, 840-841 
Kruskal-like coordinates for, 841 
coordinates with infinity conformally 
transformed, 920 


global structure of, 920-921 
throat for Q = M identical to 

Bertotti-Robinson universe. 845 
uniqueness of (Birkhoff-type theorem), 

844 ff 

as limiting case of Kerr-Newman. 878 
Reversible and irreversible transformations 
in black-hole physics. 889f 
Relative acceleration. See Geodesic 
deviation 

Relativity. See Special relativity; General 
relativity 

Renormalization of zero-point energy of 
particles and fields, 426fT 
“Reprocessing” of universe, 1209, 

1213-1217 

Retarded fields and radiation reaction. 474 
Retarded potential, 121 
Ricci curvature. See under Curvature, 
formalism of 

Ricci rotation coefficients. See Connection 
coefficients 

Ricci rotation 1-forms. See Rotation 
1-forms. 

Riemann. See under Curvature, 
formalism of 

Riemann normal coordinates, 285fT. 

480-486 

Riemannian geometry 
characterized, 242, 304f 
track-1 treatment of. Chap. 8 
track-2 treatment of, Chap. 13 
Riemann’s founding of, 220 
of apple, is locally Euclidean, 19-21 
of spacetime, is locally Lorentzian, 19-23 
See also specific concepts, such as Metric, 
Connection 

Robertson-Walker line element, 722, 759 
Rods, 301, 393, 396-399 
Roll-Krotkov-Dicke experiment. See 
Eotvos-Dicke experiments 
Rotation 

as stabilizer of stars, 633f 
rigid-body, 123f 
of universe, limits on, 939 
“Rotation” 

of a field of 1-forms, 123f 
of a field of 4-velocities, 566 
of rays, in spinor language, 1165 
Rotation group, 50(3), manifold of 
generators, 242-243, 264 
structure constants, 243, 332 
geodesics and connection, 264, 332 
Riemann curvature, 288 
metric, 332 

isometric to 3-sphere, 725 
used in constructing mixmaster 
cosmological model, 807 
Rotation matrices. See Spin matrices 
Rotation 1-forms 350-354, 360 
matrix notation for, 359 
See also Covariant derivative, Connection 
coefficients 

Rotation operators. See Spin matrices 


Rotations 

in coordinate plane, 67 
composition of. 1135-1138 
Rodrigues formula, 1137 
represented as two reflections. 1137AT 
half-angles arise from reflections, 1137 
infinitesimal, 170f, 1140ff 
Rutherford scattering, 647, 669 
relativistic corrections to, 669f 

S 

Saddle points, number of, 318 
Sakharov view of gravitation, 426-428 
Scalar field 

stress-energy tensor, 483 
equation of motion, from Einstein's field 
equation, 483 

propagation in Schwarzschild geometry, 

863, 868ff 

Scalar product of vectors, 22, 52f, 62 
Scalar-tensor theories of gravity. See under 
Gravitation, theories of 
Schild’s argument for curvature, 187-189 
Schild’s ladder, 
described, 249 

applications, 251-253, 258, 263, 268, 278 
Schwarzschild coordinates 

for any static, spherical system, 597 
for Schwarzschild geometry, 607 
pathology at gravitational radius, 11, 
823-826 

for a pulsating star, 689 
Schwarzschild geometry, 822 
in extensOy Chaps. 25, 31 
as limiting case of Kerr-Newman, 878 
Birkhoffs theorem for, 843-844 
derivation from 

full field equations, 607 
initial-value equation, 538 
coordinate systems and reference frames 
Schwarzschild coordinates, 607, 
823-826 

isotropic coordinates, 840 
Novikov coordinates, 826-827 
ingoing Eddington-Finklestein 
coordinates, 312, 828f, 849 
outgoing Eddington-Finklestein 
coordinates, 829ff 
Kruskal-Szekeres coordinates, 827, 
831-836. See also 
Kruskal-Szekeres coordinates 
tortoise coordinate, 663, 665£ 
coordinates with infinity conformally 
transformed, 919f 
orthonormal frames, 821 
Riemann curvature, 82Iff 
structure and evolution 

Einstein-Rosen bridge (wormhole), 
837ff, 842f 
topology, 838ff 

not static inside gravitational radius, 
838 

evolution, 838ff, 842 
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embedding diagrams, 528, 837, 839 
diagram of causal structure, 920 
singularities. See Singularities of 
Schwarzschild geometry 
singularities at r = 0. See under 
Singularities. 

explored by radially infalling observer, 
820-823 

destruction of all particles that fall inside 
gravitational radius, 836, 839, 860-862 
test-particle motion in 
in extenso , Chap. 25 
analyzed using Hamilton-Jacobi theory, 
649 

analyzed using symmetry-induced 
constants of the motion, 656-672 
orbit lies in a “plane,” 645f, 655 
conserved quantities for, 656 
angular momentum, 656ff 
eit£rgy-at-infinity, 656ff 
effective potential for radial part of 
motion, 639, 656, 659-662 
qualitative description of orbits, 662 
radial orbits, details of, 663-668, 
820-823, 824-826, 835 
nonradial orbits, details of, 668 
circular orbits, stability of, 662 
binding energy of last stable circular 
orbit, 885, 911 

periastron shift for nearly circular 
orbits, 670 

scattering cross section, 669f 
deflection angle, 671 
photon motion in 
shape of orbit, 673, 677 
effective potential for radial part of 
motion, 673f, 676 
qualitative description of, 674f 
impact parameter, 672 
critical impact parameter for capture 
673 

escape versus capture as a function of 
propagation direction, 675 
scattering cross section, 676-679 
capture cross section, 679 
wave propagation in 
effective potentials for, 868, 870 
scalar field, analyzed in detail, 863, 

868ff 

electromagnetic field, Newman-Penrose 
constants, 870f 

fields of zero rest mass, integer spin, 

866 

Dirac equation in, 1164 
perturbations of 

high-frequency, analyzed by geometric 
optics, 640 

wave equations for, related to 

Hamilton-Jacobi equation, 640 
stability against small, 884 
applications 

as external field of a static star. 607 
as exterior of a collapsing star, 846-850 
matched to Friedmann geometry' to 


produce model for collapsing star, 
85 Iff 

many Schwarzschild solutions joined in 
lattice to form closed universe, 
739f 

Schwarzschild lattice universe, 739f 
Schwarzschild radius. See Gravitational 
radius 

Schwarzschild surface. See Gravitational 
radius 

Schwarzschild’s uniform-density stellar 
model, 609-612 

Second, changing definitions of, 23-29 
Second law of black-hole dynamics, 93Iff 
formulated with assumptions ignored, 

889, 891 

reversible and irreversible 

transformations, 889f, 907-910, 913 
used to place limits on gravitational 
waves from hole-hole collisions, 886 
Second law of thermodynamics, 563, 567f 
Second moment of mass distribution, 
defined, 977 
Selector parameter 
defined, 265-266 

used in analysis of geodesic deviation, 
Chap. 11 

“Self-energy,” infinite, 474 
“Self-force,” 474 

Semicolon notation for covariant derivative, 
210 

Semimajor axis of an elliptic orbit, 647 
“Sense.” See Orientation 
Separation vector, 29ff, 218f, 265-270 
“Shear” 

of a congruence of world lines, 566 
of a bundle of null rays, 582 
in spinor language, 1165 
Shear stress 

idealized away for perfect fluid, 140 
produced by viscosity, 567 
in PPN formalism, 1074, 1075n 
Shell crossing, 859 
Shift function 

as Lagrange multiplier, 487 
metric interval as fixed by, 507 
covariant and contravariant forms of, 

507f 

award of arbitrariness in, reversed, 532 
two variational principles for, 538 
Shock waves 

hydrodynamic, 559, 564, 628 
in spacetime curvature, 554 
Signature, of metric, 311 
Simple fluid, defined, 558 
Simplex, 307, 380f, 1167ff 
Simultaneity 

in Newton, Minkowskii, and Einstein 
spacetime, 296 

as term for spacelike slice. See Spacelike 
slice 

Singularities, coordinate, 10-12 
illustrated by Schwarzschild coordinates, 
11, 823ff 


Singularities in geometry of spacetime 
definitions of, 934 

theorems on creation of, 934ff, 936, 938, 
762 

structures of, 935, 940, 804ff 
cosmic censorship vs. naked singularities, 
937 

in Schwarzschild geometry, 

and evolution of the geometry, 838f 
remote possibility that infalling objects 
might destroy, 840 

and spherical gravitational collapse of 
a star, 846, 860ff 
Mixmaster, 805-813 
is generic, 806, 940 

changing standards of time near, 813f 
initial, of the universe, 769f 
what “preceded” it? 769f 
prospects for understanding, 707 
should one worry about singularities? 
Misner’s viewpoint, 813f 
Thorne’s neutrality 
Wheeler’s viewpoint, 1196ff 
unphysical, due to overidealization 
surface layers, 552-556 
shell crossings, 859 

Signals, extraction of from noise, 1036-1038 
Size 

related to angular momentum, 162 
of accelerated frame, 168f 
Skeleton geometry, 309, 1169 
Skeleton history, 499 
Slicing of spacetime, 506. See also 
Spacelike slice 

“Slot” in machine concept of tensor. See 
Tensor 

Solar system, 752-756 
ephemeris for (J.P.L.), 1095, 1097 
relativistic effects in, magnitude of, 1048, 
1068 

Nordtvedt effect in, 1128 
See also Earth, Moon, Planetary orbits. 
Sun, Experimental tests of general 
relativity 

Space 

Newtonian absolute, 19, 40, 29If 
foamlike structure and quantum 
fluctuations, 1204 

not spacetime, as the dynamic object, 1181 
See also Manifold, Differential geometry, 
Differential topology, Affine 
geometry, Riemannian geometry 
Space theory of matter, 1202-1205 
Source counts in cosmology, 798 
Spacecraft, used to test general relativity 
1108f, 1114 

“Spacelike relationship to.” See Causal 
relationships 
Spacelike slice 

as “moment of time” in spacetime, 713f 
as the dynamic object in superspace, 

423 f, 1181 

geometrodynamics and electrodynamics 
derived from physics on, 419-423 
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Spacelike slice {continued) 

See also Embedding diagrams, Initial 
value. Three-geometries 
Spacetime geometry 

Newtonian. See Newton-Cartan theory' of 
gravity. 

Minkowskiian (Lorentz). See Lorentz 
geometry’ 

Einsteinian 
modeled by apple, 4 
Riemannian character tested by 
stability of Earth, 398f 
curvature of, implied by gravitational 
red shift, 187ff 

stratification denied by locally Lorentz 
character of physics, 304f 
viewed as a “gravitational field.” 399f 
as dynamic participant in physics, 337 
response to matter, as heart of general 
relativity, 404 

as classical approximation, 1181 f 
as classical leaf slicing through 
superspace, 1184 
See also General Relativity, 

Geometrodynamics, Curvature of 
spacetime - • 

Newtonian, Minkowskiian, and 
Einsteinian, compared and 
contrasted, 296, 437 
Special relativity 
briefly outlined, 47-48 
spelled out, Chaps. 2-6 
does not take in gravitation, Chap. 7 
local validity as central feature of curved 
spacetime, 304f 
See also specific concepts, e.g., 

Electromagnetic field, Lorentz 
transformations 

Specific intensity, defined, 587, 589 
Specific flux, defined, 1025 
Sphere, 2-dimensional (S 2 ) 
two coordinate patches to cover, 12 
topology of, 24If 
metric on, 340 
Riemann tensor of, 341 
Sphere, 3-dimensional (S 3 ) 
volume of, 724 

hyperspherical coordinates and metric 
for, 723 f 

Riemann curvature tensor, 721 
embedding diagram, 723 
compared with spheres of lower 
dimensionality, 704 

isometric to manifold of rotation group, 
725 

Spherical symmetry, Killing vectors for, 658 
Spherical systems, static 
Schwarzschild coordinates for, 594-597 
isotropic coordinates for, 595 
orthonormal frames for, 598 
rigorous derivation of line element, 616f 
curvature tensors for, 360f 
Spherical systems, dynamic 

Schwarzschild coordinates for, 616f, 689 


curvature tensors for, 36If 
Birkhoffs theorem for, 883f 
Spin, as nonclassical two-valuedness. 1204 
Spin matrices 

in law of combination of rotations, 1136 
as quaternions or rotation operators, 1136 
Hermitian conjugate of. 1138 
algebraic properties, 1137-1142 
and 3-vectors. 1140f 
and 4-vectors, 1142f 
associated spin matrices. 1152f 
multiplication law for, 1153 
Spinning body 

equation of motion for, 1120f 
transport law for spin 

Fermi-Walker, in absence of curvature 
coupling, 165, 176f, 1117 
modified by curvature coupling, 39If 
spin precessions 
“general,” 391 f 
Thomas, 175f, 1118, 1145ff 
frame-dragging, 1119f 
due to space curvature (“geodetic”), 

1119f 

Spindown of black holes, 886 
Spinors 

general account, Chap. 41 

and orientation-entanglement relation, 

1148ff 

defined by their law of transformation, 

1148ff 

conjugate complex, 1150 
with dotted indices, 1150 
correspondence with vectors, 11501T 
of higher rank, 1151 
algebra of, 1151-1155 
spinor equivalent of tensors, 1153f 
spin space and basis spinors, 1156 
flagpole plus flag plus 

orientation-entanglement relation, 
1157-1160 

in curved spacetime, 1164 
analyze appearance of night sky, 
1160-1164 

as tool in gravitation theory, 1164f 
Standard candle, 789 

Standard hot big-bang cosmological model. 

See under Cosmology 
Star clusters, relativistic, 621, 635 
creation by evolution of a galactic 
nucleus, 634 

analysis of structure, 679-683 
equations of structure summarized, 683f 
relativistic instability in, 621, 686f 
collapse of, 884 

possible roles in quasars and galactic 
nuclei, 634, 687 
specific models 

with purely circular orbits, 683 
self-similar, 683 
isotropic, 683f 
isothermal, spherical, 685ff 
Star operations. See Duality 
Starquake, 628 
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Stars, evolution into final state, 621, 624, 
627-629 

Stars, Newtonian 

equations of structure, 601-602, 605AT 
gravitational energy, 606-607 
uniform-density model, 609 
pulsation theory 

dynamical analysis, 697f 
virial-theorem analysis, 1079f 
volume-averaged analysis, 630f 
Stars, relativistic, nonrotating 
structure 

in extenso, Chaps. 23, 24 
equations of structure summarized, 

608- 609, 689 
must be spherical, 593 
Schwarzschild coordinate system, 597 
isotropic coordinate system, 595 
curvature tensors for, 360f 
Newtonian limit of gravitational 

potentials, 595 

parameters describing matter, 597-600 
proper reference frame of fluid, 598 
equations of structure derived, 600-606 
Newtonian limit of equations of 
structure, 601-602, 605ff 
mass-energy inside radius r, 602ff 
must have 2 m/r <i 1, 605, 612f, 615 
total number of baryons, 606 
external gravitational field. See 
Schwarzschild geometry, 
monotonicity of r, 612-613, 615 
embedding diagrams for, 613-615, 617 
specific models 
how to construct, 608f 
SchwarzschiId’s uniform-density model, 

609- 612 

Fermi-gas model with p c = oo, 615ff 
numerical models for white dwarfs and 
neutron stars, 625 ff, 696 
radial pulsation of 
dynamic analysis, 688-699 
boundary conditions for, 694 
eigenvalue problem for normal modes, 
695f 

Newtonian limit, 697f 
post-Newtonian limit, 698f, 1080 
stability of 

critical adiabatic index for radial 
pulsations, 697ff 

pulsational instability in massive stars, 
632 

relativistic instability, 605, 697ff 
See also under White-dwarf stirs, 

Neutron stars, Supermassive stars 
collapse of. See Collapse, Gravitational 
nonradial pulsation of, 984f 
Stars, relativistic, rotating 

slowly rotating, spherical stars, 699 
rapidly rotating disks, 621 
stabilizing effects of rotation, 633f 
Stars. See also Binary stars, Cepheid 
variable stars, Neutron stars, 
Supermassive stars, White dwarfs 
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Static limit, 879ff, 894 
Stationary gravitational field 
hydrostatic equilibrium in, 566 
thermal equilibrium in, 568 
Steady coordinates, 964 
“Steady flux of waves,” defined, 1019 
Steady-state cosmology, 745, 750, 770 
Stokes theorem, generalized, 96f, 127 
content in pictures, 117 
Gauss’s theorem as special case, 97, 150f 
applications, 96f, 125, 378 
Stress. See Stress-energy tensor 
Stress-energy pseudotensor. See 
Pseudotensor 
Stress-energy tensor 
summarized, 131-132 
in extenso, Chap. 5 
as machine to reveal energy density, 

momentum density, and stress, 13 If 
physical interpretation of components, 
137f 

symmetry of, 141-142 
vanishing divergence, 152 
as functional derivative of Lagrangian, 
485, 503ff 
specific form for 
nearly Newtonian fluid, 152, 154 
perfect fluid, 132, 140 
viscous fluid with heat flowing through 
it, 567 

stressed medium with no heat flow, 
1086f 

swarm of particles (kinetic theory), 
138f, 589f, 680, 682 
spherical star cluster, 680, 682 
photons in geometric optics limit, 579f 
geometric-optics waves, 579 
electromagnetic field, 139-140, 480ff 
scalar field, 504f 

gravitational field in spin-2 theory, 425 
gravitational field in general relativity. 
Does not exist; see Pseudo tensor 

Structure 

Cartan’s equation of, 378 
constants, of rotation group, 243 
deformation of, 530 
differentiable, 242 

symplectic, of Hamiltonian mechanics, 
125-126 

See also Global techniques, Horizons 
Sum-for-inertia. See Mach’s principle 
Sum over histories. See Feynman’s sum 
over histories 

Summation convention, Einstein’s, 9 
Sun 

mass deduced from planetary orbits, 638, 
endpapers 

quadrupole moment, 1112f, 1115f, 1053f 
gravitational field in PPN formalism. 

1097 ff 

velocity relative to Galaxy, local group, 
and universe, 1114 
effect on tides, 44 

observed redshift of light from, 1058ff 


radiation flux negligible compared to 
pressure, 1075 

See also under Experimental tests of 
general relativity 
Super-Hamiltonian 
contrasted with Hamiltonian, 488f 
for test-particle motion 
in electromagnetic field, 488f 
in gravitational field, 654 
in combined electromagnetic and 
gravitational fields, 897f 
for mixmaster cosmology, 809-813 
Supermassive stars, 620f, 634 
convection in, 600 
entropy constant in, 600 
adiabatic index in, 633 
pulsational instability in, 632-633 
relativistic instability in, 605, 620, 633f 
rotation as a stabilizer, 633-634 
possible roles in quasars and galactic 
nuclei, 634 

See also Neutron stars, White dwarfs 
Supernova, 619, 622 
Crab nebula created by, ii, 619f 
theoretical scenario for, 628 
as source of gravitational waves, 982, 987, 
1040, 1042 

Superspace, 1180-1183 
as starting point for Einstein’s 
geometrodynamic law, 423 
quantum fluctuations, 1180 
spread-out wave versus wave packet, 

1185 

conserved probability current in, 1189 
truncated, skeleton version, 1181 
mixmaster version {“minisuperspace”), 

806 

of 2-geometries, 221 
Surface integral. See Integration 
Surface of last influence, 873f 
Symmetry of tensor, indicated by round or 
square bracket, 126 
Symmetry operations as tensors, 126, 

128-129 

Symmetries, more subtle than symmetry 
and antisymmetry, 86 

Symmetries of spacetime. See Killing vector 
fields 

Symmetrization, of tensor, 83 
Synchronous coordinate system, 717. See 
also Gaussian normal coordinate 
system 

Syst£me International (SI) second. 28 


T 

Tails of waves in curved spacetime, 957, 
864f, 869ff 

Tangent space, 205, 227-231 
at neighboring points, linked. 246f, 252 
Tangent vector. See Vector, tangent. 
Taub-NUT space. 940 
TCP, experimental tests of. 1054 
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Teichmfiller space, 221 
Temperature. See Thermodynamics 
Temperature, redshift of, 568, 588, 685 
Tensors 

summaries of formalism for manipulating 
in global Lorentz frame, 85 
m manifold without metric, 233f 
in curved manifold with metric, 203f, 
223f 

machine-with-slots definition, 22, 74ff, 

131, 133f, 233f, 31 Of 

ambiguity of slots removed by component 
notation, 84 
rank, defined, 75f, 234 
components of 
in Lorentz frame, 75f 
in general frame, 201-204, 312 
operations on, introduced in global 
Lorentz frame 

indices, raising and lowering, 75-76 
addition, 76 

multiplication by scalar, 76 
tensor product, 76 
basis tensors, 76 
gradient, 81-82, 84 
contraction, 82, 84 
divergence, 82 
transpose, 83 

symmetrization, antisymmetrization. 

83, 85f, 126 
wedge product, 83 
duality, 85, 87, 88 
integration, 147ff 

algebraic operations extended to general 
frames, 201-207, 233f 
covariant derivative introduced, 208ff, 
257-261 

Lie derivative introduced, 517 
spinor representation of, 1153-1155 
Tensors, first rank. See Vectors, Forms, 
1-forms 

Tensors, second-rank symmetric, 
decomposition of, 947 
Tensors, completely antisymmetric. See 
Forms, differential 
Tensor density, 50If 
“Test body,” defined, 1050n 
Tests of general relativity. See Experimental 
^ tests of general relativity 
Test particle, freely falling, defines geodesic 
196 ' ~ 

Test particles 

three needed to explore Lorentz force, 72 
more needed to explore Riemann 
curvature, 72 

Tetrad 

carried by a uniformly accelerated 
observer. 169-170 

in Fermi-Walker transport, 170-171 
carried by accelerated observer, 328-332 
Tetrahedron, 307, 309 
Theories of gravitation. See Gravitation, 
theories of 

Thermal conductivity. See Heat conduction 
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Thermal equilibrium in a stationary 
gravitational field. 568 
Thermodynamics 
in exienso , 557-562 
basic references, 557n. 568 
basic concepts defined 
simple fluid, 558 
multicomponent fluid, 558 
baryon number density, 558 
density of total mass-energy, 558 
chemical potential. 558, 561, 562 
entropy per baryon. 558 
entropy 4-vector, 567 
temperature, 508 
heat-flux 4-vector, 567 
pressure, 558 

primary thermodynamic potential, 561 
physical free energy, 561 
chemical free energy, 561 
equations of state, 560 
adiabatic index, 692 
viscosity coefficients, 567 
laws and equations of 
law of heat conduction, 559 
Maxwell relations, 561, 564 
See also Conservation laws, baryons; 
First law of thermodynamics; 
Second law of thermodynamics; 
Equation of state 

extension of formalism when nuclear 
burning occurs. 558 
some applications and processes 
injection energy, 561 f 
shock waves, 559 

pair production at high temperatures, 
558 

chemical potential for an ideal Fermi 
gas, 565 

See also Hydrodynamics 
Thin-sandwich conjecture, 534 
Thomas precession, 175-176, 1118, 1146f 
Three-geometry 
of initial and final spacelike 
hypersurfaces, 488 

as carrier of information about time, 488 
533 

fixed at surface in ADM formulation, 522 
conformal 

in York’s formulation of initial-value 
problem, 540-541 

pure spin-2 representation via York 
curvature, 541 

YES vs. NO vs. quantum probability for, 

1184f 

See also Initial value, Spacelike slice 
Three-plus-one split of spacetime, 486, 505 
sandwiches and rigidification, 506 
via 3-metric plus lapse and shift, 506-507 
4-metric vs. 3-metric in, 508 
choice of slicing doesn’t matter, 526 
Tidal forces, 823, 860ff. See also Curvature 
of spacetime, Geodesic deviation 
Tides, produced by sun and moon, 38, 44 
391 f 


Time 

standards of, 23-29 
defined so motion looks simple, 23-29 
end of, in gravitational collapse, Chap. 44 
many-fingered, 495, 498, 527 
proper. See Interval 
imaginary coordinate for, not used, 51 
Newtonian universal, 40, 299 
See also Bubble-time derivative, Clocks. 
Day 

Time dilation, experimental tests of, 1054f 
Tired light, 775 
Tolman universe, 733 
Topology, point-set, 241, 926n 
Topology of spacetime 
various possibilities for Schwarzschild 
geometry. 837-840 
various possibilities for Friedmann 
cosmological models, 725 
Einstein vs. flat space views of, 437 
See also Differential topology 
Torque of sun and moon on Earth, 391-392 
Torsion 

not present in affine connection if 
equivalence principle is valid, 250 
vanishes in Riemannian geometry, 378 
possible incorporation into general 
relativity, 1049, 1068 
Tortoise coordinate, 663, 665-666 
Torus, three-dimensional, 284, 725 
Transformations 
active vs. passive, 1140 
of tensors, 201-204 
of connection coefficients, 262 
of spinors, 1149f 
Transpose of tensor, 83 
Transverse-traceless gauge 
in linearized theory, 946-950 
in a curved background, 969 
Trapped surface, 934, 936 
Trivector, defined, 83 
Tubes of force, 102, 114 
Twin “paradox,” 167 
Two-length-scale expansion, 571f 
Twistors, 937 

Two-form. See under Forms, differential 

U 

Unified theory of electricity and gravitation, 
Riemann’s unsuccessful search for, 32, 
221 

Uniqueness of free fall (“weightlessness,” 
“weak equivalence principle”), 13-19, 
197, 1050-1054 

formulation of this book, 1050 
Einstein’s 1908 formulation, 5 
contained in Einstein’s 1911 equivalence 
principle, 17 

experimental tests of 13-17, 1051-1054. 
See also Eotvos-Dicke experiment 
Universal Time (UT0, UT1, UT2), 28 
Universe. See Cosmological models, 
Cosmology 


Variational principles for spin-0, spin-1, 
and spin-2 theories of gravity in flat 
spacetime, 178-181 

Variational principles for test particle motion 
extremal proper time, 314-324 
“dynamic” principle, 322f 
Variational principles for geometrodynamics 
Hilbert’s, 434 

in exienso, Chap. 21 

in brief, 418, 485 

what fixed at limits, 485 

scalar curvature as integrand in, 491, 

519 

grounded in quantum character of 
physics, 499f 

in space-plus-time split. 519f 
put into ADM form. 520 
Sakharov renormalization of, 426 
Hilbert’s, by Palatini's method 
sketched, 491 

analogy with mechanics, 491-495 
analogy with electrodynamics, 495-498 
connection as independently variable 
in, 492 

Arnowitt, Deser, Misner 
in simplest form, 521 
exploited, 526 

specialized to mixmaster cosmology, 

808f 

thin-sandwich, for lapse and shift, 538 
in shortwave approximation, 927f 
in Regge calculus, 1170 
in superspace formulation, 1186 
Vector, tangent 
introduced, 8-13 
definitions of 
as arrow. 49 

as parametrized straight line, 49 
as derivative of point, 49, 205, 226-229 
as directional derivative operator, 205, 
227-230 

manipulations summarized. See under 
Tensor 

formalism of, in global Lorentz frames, 
timelike, null, and spacelike, 53 
defining directional derivative, 59f 
correspondence to 1-form, 58ff 
from 1-form by raising index, 62 
test for linear dependence, 83 
transition to curved spacetime, 201-207, 
230f I 

commutator, 204 

formula for determining components of, 
232 

transformation laws for, 230ff 
comparison by parallel transport, 245-263 
correspondence of, with spinors, 1150ff 
covariant components from spinor 
analysis, 1153 
Vector, /7-vector, 91 
Vectors, three-dimensional (spatial), 
introduced, 64 
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Vector potential for electromagnetic field, 
in curved spacetime, 569f 
Velocity 4-vector 
defined, 49 
components, 50 
unit length, 54 

related to energy-momentum 4-vector, 
53f 

Velocity parameter, in Lorentz 
transformation, 67 
Vierbein. See Tetrad 
Virial theorems in Newtonian theory of 
gravity, 1078 

evaluated for spherical stars, 607 
Viscosity and viscous stresses 
formalism for, 567 

damping of primordial chaos by, 769, 
802ff 

Vlasoff*equation, 680. See also Collisionless 
Boltzman equation 

Volume, Hero-Tartaglia formula for, 307 
Volume 1-form, 133-137 
4-volume, 147 
volume integrals, 147-151 
Volume in phase space, 584-587, 590 

W 

“Wave-dominated detector,” defined, 1019 
Wave vector, 573ff 

Wave function. See Probability amplitude 


Wave operators 

d’Alembertian, 89, 120, 177, 183 
de Rham, for vector fields, 569 
Lichnerowicz-de Rham, for tensor fields 
382 

conformally invariant, 542 
Wave number 1-form, 55-58 
related to energy-momentum 4-vector, 57 
Weak equivalence principle. See 
Uniqueness of free fall. 

Wedge product 

of tensors and of forms, 83, 9If 
rules for addition and multiplication, 92 
other names for, 83, 99 
of permutation tensors, 128f 
Weight, of a tensor density, 501 
Weightlessness. See Uniqueness of free fall 
White-dwarf matter 
thermal pressures negligible, 599-600 
electron capture in, 619 
equation of state for, 624ff 
White-dwarf stars, 619 
models for, 625-626 
Chandrasekhar mass limit, 619 
stability of, 605, 619, 626f, 696 
Whitehead’s theory of gravity. See under 
Gravitation, theories of 
“Wire up” source to field, 367ff, 405 
World line, 4, 13. See also Geodesic 
World sheet, swept out by one face of 
3-volume, 133 


World tube, analysis of balance of 
4-momentum in, 473 
Wormholes in space 
and Betti numbers, 221 
of Schwarzschild geometry, 837ff, 842 
of Reissner-Nordstram geometry, 921 
probably do not exist in real universe at 
classical level, 842f 
at quantum level, 1200f, 1203 
electric charge as lines of force trapped 
in, 1200 


X 

X-ray diffraction, related to 1-forms, 232 

X-ray sources, 761-762 

X-rays from gas around black holes, ix, 885 

Y 

York momenta measure deformation, 
recoordinatization, and scaling, 542 
York’s curvature, 541 
Young symmetry diagrams 86 


Zero-point energy, 426ff 


{ ' - T<. » « j 

| Ir-.C.' 1 .. 

\ ' 
v v . 




